AIbase

trafilatura

Public

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Creat2019-04-08T19:38:48
Update2025-03-26T19:40:42
https://trafilatura.readthedocs.io
4.6K
Stars
15
Stars Increase