State-of-the-art papers for depth estimation of 360 images.
? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
??? 60+ Implementations/tutorials of deep learning papers with side-by-side notes ?; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ? reinforcement learning (ppo, dqn), capsnet, distillation, ... ?
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
Port of OpenAI's Whisper model in C/C++
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
OpenMMLab Detection Toolbox and Benchmark
State-of-the-art 2D and 3D Face Analysis Project
SOTA Open Source TTS