Skip to content

Minimal containerised examples of ETL scripts/flows/components in Python

License

Notifications You must be signed in to change notification settings

DylanBartels/python-etl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python ETL

Minimal examples of ETL scripts/flows/components in Python. All implementations use the same input data and data transformation. Starting point and lookup source for anything date-engineer related.

Requirments

  • docker
  • make

Usage

make [implementation]

List of Implementations

Example

make spark

Will run a etl job through a containzerized spark cluster and put the result on minio. After the etl job is finished running on spark, the result can be seen in the minio console at http://127.0.0.1:8060/buckets/example/browse (user: dev & password: dev_pass)

Inspiration

Inspired by algorithms, and intended to become the etl scripts/flows/components equivalent.

About

Minimal containerised examples of ETL scripts/flows/components in Python

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published