Generic ETL Framework with Apache Iceberg & Python

This ETL framework is designed to extract data from 100+ sources, transform using Pandas/Arrow, and load into Apache Iceberg format on AWS S3.

🔧 Features

python main.py mysql_customers

Or trigger via Prefect UI/CLI:

prefect deployment build flows/etl_flow.py:etl_flow -n etl-deployment
prefect deployment apply etl_flow-deployment.yaml
prefect agent start

See config/etl_config.yaml

Define AWS creds in .env file or via environment variables

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
connectors		connectors
core		core
flows		flows
.env		.env
.gitattributes		.gitattributes
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py