The Japanese Receipt Vision-Language Model lfm2-450M is a vision-language model specifically designed for understanding and processing Japanese receipts. It is built on LiquidAI's LFM2-VL-450M base model, capable of analyzing receipt images, extracting structured information, answering questions about the receipt content, and providing detailed descriptions in Japanese and English.
Multimodal
TensorboardMultiple Languages