Cameras as Relative Positional Encoding
? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies
??? 60+ Implementations/tutorials of deep learning papers with side-by-side notes ?; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ? reinforcement learning (ppo, dqn), capsnet, distillation, ... ?
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
? The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
A high-throughput and memory-efficient inference and serving engine for LLMs
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Port of OpenAI's Whisper model in C/C++
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more