Script to extract collections from MongoDB and load it to Parquet
Workflow Engine for Kubernetes
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
The developer first cloud governance platform
PipelineAI
Build data pipelines, the easy way ?️
Curated list of resources about Apache Airflow
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Elyra extends JupyterLab with an AI centric approach.
Node.js library to receive live stream events (comments, gifts, etc.) in realtime from TikTok LIVE.