HomeAI Tutorial

realtime-cdc-pipeline-docker

Public

An End-to-End Real-time Data Pipeline using Debezium (CDC) to stream changes from PostgreSQL to Kafka, processed by Apache Spark (Structured Streaming), and sunk into ClickHouse for analytics. Orchestrated by Airflow and fully containerized with Docker Compose.

Creat2025-11-30T22:06:36
Update2025-11-30T22:50:57
1
Stars
0
Stars Increase

Related projects