HomeAI Tutorial
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

Multi-modal-Sandtable

Public

Multi-Sensors funsion traffic Sandtable. Micropy with ESP32 connect env sensor and publish to MQTT. Microphone get sounds translate to text, RTSP Cam with YOLO identify The Car, Fingers positions. Using LLM intent recognition and slot filling to concat text question and semantic vision data, could answering mqtt, visual and execute operations.

Creat2025-10-14T14:02:44
Update2025-10-14T21:06:36
5
Stars
0
Stars Increase