End-to-End-Data-Pipeline

End-to-End-Data-Pipeline

hoangsonww

πŸ“ˆ A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transformation, storage, monitoring, and AI/ML serving with CI/CD automation using Terraform & GitHub Actions.

54 Stars
31 Forks
54 Watchers
Python Language
mit License
Cost to Build
$1.39M
Market Value
$3.25M

Growth over time

2 data points  Β·  2025-09-16 β†’ 2025-09-20
Stars Forks Watchers
πŸ’¬

How do you feel about this project?

Ask AI about End-to-End-Data-Pipeline

Question copied to clipboard

What is the hoangsonww/End-to-End-Data-Pipeline GitHub project? Description: "πŸ“ˆ A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transformation, storage, monitoring, and AI/ML serving with CI/CD automation using Terraform & GitHub Actions.". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard β€” paste it after the AI opens.

How to clone End-to-End-Data-Pipeline

Clone via HTTPS

git clone https://github.com/hoangsonww/End-to-End-Data-Pipeline.git

Clone via SSH

[email protected]:hoangsonww/End-to-End-Data-Pipeline.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the End-to-End-Data-Pipeline issue tracker:

Open GitHub Issues