Easily deploy inference models to dev, test, and production at scale
X-Ray Vision for your infrastructure!
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
The Cloud Native Application Proxy
A high-throughput and memory-efficient inference and serving engine for LLMs
?????? Awesome cheatsheets for popular programming languages, frameworks and development tools. They include everything you should know in one single file.
Port of OpenAI's Whisper model in C/C++
? The Cloud-Native API Gateway and AI Gateway.
Making large AI models cheaper, faster and more accessible
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.