API serving for your diffusers models
? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
21 Lessons, Get Started Building with Generative AI ? https://microsoft.github.io/generative-ai-for-beginners/
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A high-throughput and memory-efficient inference and serving engine for LLMs
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Making large AI models cheaper, faster and more accessible
? The easiest way to automate building and releasing your iOS and Android apps
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.