Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

books-etl-pipeline

Public

The Books ETL Pipeline is a data engineering project that extracts, transforms, and loads data from Goodreads and other sources to analyze book authors and their works. It leverages tools like Airflow for orchestration, MySQL for data storage, and Grafana for visualization.

Creat2025-04-14T17:40:56
Update2025-04-15T00:40:47
0
Stars
0
Stars Increase