3 repositories on SrcLog
Personal Data Engineering Projects
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
Loan Default Prediction using PySpark, with jobs scheduled by Apache Airflow and Integration with Spark using Apache Livy