run whisper on CPU/GPU server
Port of OpenAI's Whisper model in C/C++
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Take control over your live stream video by running it yourself. Streaming + chat out of the box.
OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go
A PyTorch-based Speech Toolkit