Tencent Huyaun Opensources HunyuanOCR Model: 1B Parameters Achieve SOTA in Multiple Scenarios, Empowering OCR Applications
Tencent Huyaun opensources the 1 billion parameter OCR model HunyuanOCR, which adopts an end-to-end design, integrating a video encoder, visual adapter, and lightweight language model. It achieves SOTA results in multiple benchmarks, with small size and easy deployment as core advantages, providing an efficient OCR solution.