Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

bpemb

Public

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Creat2017-10-04T21:03:23
Update2025-03-22T23:42:35
https://nlp.h-its.org/bpemb
1.2K
Stars
0
Stars Increase

Related projects