Orchestrate and serve ML models with Flyte and Banana
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A high-throughput and memory-efficient inference and serving engine for LLMs
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! ?
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Port of OpenAI's Whisper model in C/C++
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Making large AI models cheaper, faster and more accessible
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.