A multimodal model for language-guided socially compliant robot navigation.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! ?
Python sample codes and textbook for robotics algorithms.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
The official GitHub page for the survey paper "A Survey of Large Language Models".
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.