Meta's AI video tool Vibes launches in Europe, enabling users to create or remix AI-generated videos via text prompts, add music, and share across platforms for a collaborative AI content ecosystem.....
Sandbar, founded by ex-Meta employees, launches Stream smart ring. It acts as a 'voice mouse' for recording ideas, controlling music, and AI interaction, simplifying daily tasks.....
OpenAI's Sora video app launches on Android via Google Play, expanding its global short video creation influence. It introduces a 'paid roles' feature for enhanced user personalization.....
NetEase Cloud Music launches 'AI Mastering' feature for personalized audio optimization using AI to analyze songs and adjust parameters in real-time.....
Chaos is designed for modern creators, integrating all aspects of the creative process into one platform.
Advanced AI transforms lyrics into complete songs with professional vocals and instruments, supporting multiple music genres.
SongGuru AI can create songs, lyrics, and music with the help of AI and also has various audio processing functions.
AI music creation studio that can instantly generate professional, royalty-free music tracks of various types.
nvidia
Audio Flamingo 3 is an advanced, fully open-source large audio language model that can enhance the reasoning and understanding abilities of speech, sound, and music.
ACE-Step
A hybrid rap vocal model focused on improving the generation quality of Chinese rap/hip-hop music
calcuis
ACE-Step-v1-3.5B is a text-to-audio model that supports high-quality audio generation, suitable for music and sound effects creation.
walterheart
Bark is a Transformer-based text-to-audio model created by Suno, capable of generating highly realistic multilingual speech, music, background noise, and sound effects.
sicto
The SICTO Vocal Separator is a high-quality vocal separation model developed based on the PyTorch framework, specifically designed to extract clear vocal parts from music audio. This model is trained on the musdb18hq dataset and can provide professional-level vocal separation effects for music production and audio editing.
dallinmackay
Stable Diffusion model fine-tuned with screenshots from the movie 'Cats (2019)', capable of transforming characters into the style of the musical 'Cats'
HKUSTAudio
AudioX is a unified diffusion transformer model capable of generating audio and music from arbitrary content. It produces high-quality general audio and musical compositions, offers flexible natural language control, and seamlessly handles multimodal inputs.
m-a-p
YuE is a series of open-source foundational models designed for music generation, particularly for converting lyrics into complete songs (lyrics2song).
awsaf49
Advanced model for end-to-end synthetic song detection, capable of identifying AI-generated complete songs (including vocals, music, lyrics, and style)
facebook
Unified automatic quality assessment model for speech, music, and sound
Gyaneshere
This model is an audio classification model fine-tuned on the GTZAN music genre classification dataset based on DistilHuBERT, with an accuracy of 84%.
Alissonerdx
YuE is a groundbreaking open-source foundational model series specifically designed for music generation, particularly for converting lyrics into complete songs (lyrics2song).
sugarblock
Music genre classification model fine-tuned on the GTZAN dataset with 93% accuracy
wkCircle
This model is an audio classification model based on the Audio Spectrogram Transformer (AST) architecture. After pre-training on the Audioset dataset, it was fine-tuned on the GTZAN music genre classification dataset.
duysal
An audio classification model based on the DistilHuBERT architecture, fine-tuned on the GTZAN dataset for music genre classification tasks.
Doctor-Shotgun
A quantized version based on the m-a-p/YuE-s1-7B-anneal-en-cot model using Exllamav2, suitable for text generation tasks, particularly excelling in music-related fields.
Felguk
This model is used to classify audio clips as either 'Suno' music or 'People' music.
YuE is a series of open-source foundational models specifically designed for music generation, particularly for converting lyrics into complete songs (lyrics2song).
FunAudioLLM
InspireMusic is a unified framework focused on music generation, song generation, and audio generation, combining audio tokenization with autoregressive transformers and flow-matching models to support high-quality long-form audio generation.
InspireMusic is a unified framework focused on music generation, song generation, and audio generation, integrating autoregressive transformers with flow-matching models through audio tokenization technology, supporting high-quality long audio generation.
AbletonMCP is an integration tool that connects Ableton Live and Claude AI. It enables two - way communication via the Model Context Protocol (MCP), allowing AI to directly control and operate Ableton Live for music creation and production.
An open - source short - video automatic generation tool that integrates text - to - speech, automatic subtitles, background videos, and music to create professional short videos from simple text input.
The MCP service of Bangumi TV provides access to the BangumiTV API, supporting queries for information on entries such as anime, manga, music, games, and related character and personnel data.
An MCP server for controlling YouTube music playback
An Apple Music API interaction server based on the MCP protocol, providing song search and playback link generation functions.
TIDAL MCP is a personalized music recommendation system that filters the TIDAL music library through LLM combined with user - defined conditions, supporting intelligent recommendation and playlist management based on playback history/playlists.
An MCP server that integrates the Spotify API, supporting playlist management, music search, and recommendation retrieval through Claude
An integration project that enables Claude Desktop to interact with Spotify via the Model Context Protocol (MCP), providing functions such as music search, playback control, and playlist management.
ArtistLens is a powerful MCP server that provides access to the Spotify Web API, supporting functions such as music search, artist information retrieval, and playlist management.
The MCP service of Bangumi TV provides access to the BangumiTV API, supporting queries for information on entries such as anime, manga, music, and games, including entry details, characters, personnel, and related data retrieval functions.
An MCP server based on ableton-js for real-time interaction and control of Ableton Live, assisting music producers in music creation.
A production - ready MCP server that enables AI - driven music generation through Strudel.cc, providing complete browser automation control, real - time audio analysis, and pattern generation functions.
An audio analysis tool based on MCP and librosa, supporting the analysis of local files, YouTube links, and audio links.
A Python - based MCP server project that can collaborate with AI assistants such as Claude to generate local music playlists in .m3u format based on the user's mood or theme and save them to the specified directory.
A QQ Music search test server based on the MCP protocol
An MCP server based on YouTube Music that allows you to search for and play music through an AI assistant.
A service based on the Model Context Protocol (MCP) that allows large language models to search, download, and play YouTube music.
The Navidrome MCP server is an AI music assistant that enables intelligent playlist creation, music discovery, and library management through natural language interaction. It supports integration with AI assistants like Claude and ChatGPT.
Mureka Model Context Protocol (MCP) is an official server that supports interaction with powerful APIs for generating lyrics, songs, and background music. The server allows MCP clients (such as Claude Desktop and OpenAI Agents, etc.) to generate lyrics, songs, and background music (instrumental).
mcp-sonic-pi is a tool that connects MCP clients with Sonic Pi, allowing users to create music through English instructions.