HomeAI Tutorial
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

Multi-LLM-Voice-Agent-with-RAG

Public

Voice-first multimodal RAG agent built with Streamlit, LangChain, NVIDIA NIM (Llama 3, Granite, Phi-3 Vision) and Whisper for code, vision, and speech copiloting.

Creat2025-07-26T03:59:02
Update2025-11-19T19:50:32
0
Stars
0
Stars Increase

Related projects