A TTS app where you can clone the voices of any person you wish.
PyTorch based Probabilistic Time Series forecasting framework based on GluonTS backend
PALLAIDIUM - a generative AI movie studio integrated in the Blender Video Editor.
A webui for different audio related Neural Networks
a self-hosted webui for 30+ generative ai
Code for training and test machine learning classifiers on MIT-BIH Arrhyhtmia database
Awesome Easy-to-Use Deep Time Series Modeling based on PaddlePaddle, including comprehensive functionality modules like TSDataset, Analysis, Transform, Models, AutoTS, and Ensemble, etc., supporting versatile tasks like time series forecasting, representation learning, and anomaly detection, etc., featured with quick tracking of SOTA deep models.
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Plugin that lets you ask questions about your documents including audio and video files.
A Survey of Spoken Dialogue Models (60 pages)
Fast and simple music and audio analysis using RNN in Python ?️♀️ ?