AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
Datasets
EN

AI News

View More

Giant Network Launches Three Multi-Modal Models: Eliminating Video Distortion and Enabling Practical Song Usage Through Voice Conversion

Giant Network AI Lab, in collaboration with Tsinghua University and North-western Polytechnical University, has launched three audio-visual multi-modal generation technologies: YingVideo-MV (music-driven video generation), YingMusic-SVC (zero-shot voice conversion), and YingMusic-Singer (voice synthesis). These technologies will be open-sourced, with YingVideo-MV capable of generating videos using only music and a person's image.

8.7k 6 minutes ago
Giant Network Launches Three Multi-Modal Models: Eliminating Video Distortion and Enabling Practical Song Usage Through Voice Conversion
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2025AIBase
Business CooperationSite Map