JAX Implementations of Descript Audio Codec and EnCodec
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Easily train a good VC model with voice data <= 10 mins!
Cross-platform, customizable ML solutions for live and streaming media.
A Collection of useful Methods in Java
SoftVC VITS Singing Voice Conversion
Deezer source separation library including pretrained models.
GUI for a Vocal Remover that uses Deep Neural Networks.
The free and privacy-friendly screen recorder with no limits ?