Automated bottleneck detection and solution orchestration
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
A high-throughput and memory-efficient inference and serving engine for LLMs
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The fastai deep learning library
A WebGL accelerated JavaScript library for training and deploying ML models.
A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
Open3D: A Modern Library for 3D Data Processing
Open deep learning compiler stack for cpu, gpu and specialized accelerators
NVIDIA? TensorRT? is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.