Amazon SageMaker has deployed the Voxtral model from Mistral AI
Mistral AI launched the Voxtral series of models, integrating text and audio processing capabilities. The series includes two models: Voxtral-Mini-3B-2507 and Voxtral-Small-24B-2507. The former is a 3-billion parameter model, suitable for fast audio transcription and basic multimodal understanding; the latter has 240 billion parameters, supporting advanced audio-text intelligence and multilingual processing, suitable for enterprise applications. Both models support audio context processing of 30 to 40 minutes.