Use quantized versions of Whisper to speed up inference
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Port of OpenAI's Whisper model in C/C++
OpenMMLab Detection Toolbox and Benchmark
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go