Warning: this adapter is work in progress, contributions welcome!
This plugin can be installed via pip:
$ pip install git+https://github.com/Dandandan/dbt-athena.git
A dbt profile can be configured to run against Athena using the following configuration:
Option | Description | Required? | Example |
---|---|---|---|
s3_staging_dir | The location where Athena stores meta info | Required | s3://bucket/staging |
schema | Specify the schema (athena database) to build models into | Required | dev |
database | Data catalog | Required | awsdatacatalog |
region_name | Specify in which AWS region it should connect | Required | eu-west-1 |
threads | How many threads dbt should use | Optional(default=1 ) |
8 |
max_retry_number | Number for retries for exponential backoff | Optional(default=5 ) |
8 |
max_retry_delay | Maximum delay for exponential backoff in seconds | Optional(default=100 ) |
8 |
Example profiles.yml entry:
athena:
target: athena
outputs:
athena:
type: athena
database: awsdatacatalog
schema: dev
region_name: eu-west-1
threads: 8
s3_staging_dir: s3://athena-staging-bucket/
Due to the nature of Athena, not all core dbt functionality is supported. The following features of dbt are not implemented on Athena:
- Archival
- Incremental models
If you are interested in helping to add support for this functionality in dbt on Athena, please open an issue!
Known issues:
- Quoting is not supported
Everyone interacting in the dbt project's codebases, issue trackers, chat rooms, and mailing lists is expected to follow the PyPA Code of Conduct.