Redfin-Analytics-ETL-using-Amazon-EMR-by-Airflow-on-EC2
PublicThis is an end-to-end AWS Cloud ETL project. This data pipeline uses an Amazon EMR cluster managed by Apache Airflow that is running on an AWS EC2 instance. It demonstrates how to build orchestration that would perform data transformation using Amazon EMR as well as automatic data ingestion into a Snowflake via Snowpipe. It also features Power BI.
amazon-emr-clusterapache-airflowapache-sparkaws-ec2aws-s3business-intelligencedagsdata-visualizationetl-pipelinegoogle-colab-notebook
Creat:2025-04-11T03:25:07
Update:2025-04-11T04:27:11
0
Stars
0
Stars Increase