Multi-modal Thermal Object Detector
Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies
? The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
LlamaIndex is the leading framework for building LLM-powered agents over your data.
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
OpenMMLab Detection Toolbox and Benchmark
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone