e2e-data-engineering
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
How to download and setup e2e-data-engineering
Open terminal and run command
git clone https://github.com/airscholar/e2e-data-engineering.git
git clone is used to create a copy or clone of e2e-data-engineering repositories.
You pass git clone a repository URL. it supports a few different network protocols and corresponding URL formats.
Also you may download zip file with e2e-data-engineering https://github.com/airscholar/e2e-data-engineering/archive/master.zip
Or simply clone e2e-data-engineering with SSH
[email protected]:airscholar/e2e-data-engineering.git
If you have some problems with e2e-data-engineering
You may open issue on e2e-data-engineering support forum (system) here: https://github.com/airscholar/e2e-data-engineering/issuesSimilar to e2e-data-engineering repositories
Here you may see e2e-data-engineering alternatives and analogs
discourse gogs Qix netdata NodeBB typeorm dotfiles scala-exercises react-firebase-starter node-pg-migrate mumuki-laboratory docker-django-nginx-uwsgi-postgres-load-balance-tutorial bookbrainz-site rest-api-node-typescript octobox metabase backup sysbench tsung netkiller.github.io postgrest pgcli dbeaver dev-setup redash pgweb patroni stolon docker-compose-healthcheck pgdoctor