a lightweight C++ LLaMA inference engine for mobile devices
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A high-throughput and memory-efficient inference and serving engine for LLMs
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! ?
ClickHouse? is a real-time analytics database management system
Port of OpenAI's Whisper model in C/C++
Making large AI models cheaper, faster and more accessible