Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Build-a-LLM-model-from-scratch

Public

LLM pipeline: data→tokenizer→GPT train/eval→instruction FT→sampling. Reproducible, clean configs, RTX-4060 defaults, ready for AMP/LoRA/DDP.

Creat2025-08-15T00:09:39
Update2025-08-15T01:03:19
0
Stars
-1
Stars Increase

Related projects