LLaMA-Omni
PublicLLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Creat:2024-09-10T20:21:53
Update:2025-03-27T10:49:49
https://arxiv.org/abs/2409.06666
3.0K
Stars
2
Stars Increase