AIbase
Product LibraryTool NavigationMCP

SoundMind

Public

We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for complex reasoning tasks. Building on this resource, we propose SoundMind, a rule-based reinforcement learning (RL) algorithm tailored to endow audio language models (ALMs) with deep bimodal reasoning abilities.

Creat2025-06-13T10:41:16
Update2025-06-19T10:52:38
https://arxiv.org/abs/2506.12935
264
Stars
59
Stars Increase

Related projects