wiki2txt
PublicA tool to extract plain (unformatted) multilingual text, redirects, links and categories from wikipedia backups (dumps). Designed to prepare clean training data for AI training / Machine Learning software.
ai-learningai-learning-toolai-trainingdata-for-robotsdata-parser-for-aimachine-learningmachine-learning-toolplaintext-data-for-aitool-for-aitraining-data
Creat:2021-12-02T04:59:39
Update:2025-03-27T09:31:23
6
Stars
0
Stars Increase