HomeAI Tutorial
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AMH-Tokenizer

Public

Syllable-aware BPE tokenizer for the Amharic language (አማርኛ) – fast, accurate, trainable.

Creat2025-10-14T19:42:42
Update2025-10-20T16:54:07
https://pypi.org/project/amharic-tokenizer
95
Stars
1
Stars Increase