Tencent ARC Open-Sources AudioStory: Generating Long Audio with Large Language Models
Tencent's AudioStory model generates long narrative audio via LLMs, overcoming short audio limitations. It unifies understanding and generation for tasks like dubbing and synthesis, enhancing coherence with LLM integration.....