Hume AI Releases EVI 3: A Voice AI That Understands Your Emotions Faster Than GPT-4!

AIbase基地

Published inAI News · 9 min read · Jun 3, 2025

107

Recently, Hume AI officially released its third-generation voice interaction model, EVI3. This new voice AI has garnered significant attention in the industry due to its outstanding emotional understanding capabilities and personalized interaction experience. EVI3 can accurately identify emotions in user speech and generate specific styles and personalities based on user preferences, marking a major breakthrough in the field of emotional interaction and natural communication for voice AI. Below, AIbase brings you the latest news and in-depth analysis about EVI3.

Experience it at: https://demo.hume.ai/

EVI3: The Perfect Fusion of Emotional Intelligence and Voice Interaction

EVI3 is Hume AI's third-generation voice language model developed based on multimodal datasets, integrating speech transcription, reasoning, and voice synthesis. Compared to its predecessors, EVI3 has made qualitative leaps in emotional understanding, naturalness of voice expression, and personalized customization. According to official introduction, this model can generate entirely new voices and personality settings within less than one second based on simple text prompts from users, supporting over 30 complex voice styles, giving AI unique "personalities" or "emotions."

For example, users can describe generating diverse character voices such as "old-school comedian" or "wise wizard," and EVI3 not only precisely imitates specified styles but also dynamically adjusts tone and expression methods according to dialogue contexts. This highly personalized interaction experience allows EVI3 to demonstrate great potential in scenarios like customer service, virtual assistants, and content creation.

Ultra-Low Latency and Intelligent Responses: Comprehensive Technical Leadership

EVI3’s inference latency is as low as 300 milliseconds, significantly outperforming OpenAI's GPT-4o, comparable to emerging technology Sesame, and far surpassing Google's Gemini. In a blind test involving 1,720 participants, EVI3 surpassed GPT-4o in seven dimensions, including emotional expression, naturalness, voice quality, response speed, and interruption handling, showcasing unparalleled performance advantages.

Even more impressively, EVI3 can perform real-time searches, reasoning, and intelligent responses during conversations. For instance, while engaging with AI, EVI3 can listen to user speech, simultaneously call external tools for information retrieval, and seamlessly integrate answers into the conversation, greatly enhancing interaction fluidity and practicality. This end-to-end voice processing capability makes EVI3 a benchmark in the current field of voice AI.

Emotional Recognition: Making AI Understand Humans Better

Another highlight of EVI3 is its powerful emotional recognition capability. By analyzing the pitch, rhythm, and timbre of user speech, EVI3 can accurately capture emotional states and adjust its response tone accordingly, creating a more natural and empathetic human-AI interaction experience. Compared to traditional voice assistants, EVI3 demonstrates finer emotional expression, capable of simulating pauses, tone changes, and even natural oral habits like “umm” in human dialogues.

Hume AI stated that EVI3 optimized pitch, speaking speed, and emotional style through reinforcement learning technology, training data covering over 100,000 voice samples. This unique multimodal training method enables EVI3 to extract subtle features of human speech from massive datasets, thereby generating more realistic and emotionally engaging voice expressions.

Multi-Scenario Applications: Infinite Possibilities from Customer Service to Content Creation

EVI3 is now available for user experience through Hume AI's iOS app and online demonstration platform, with API interfaces expected to be launched within the next few weeks for developers to integrate into various applications. Whether used for customer service, health coaching, immersive storytelling, or virtual companions, EVI3 provides highly personalized and emotional interaction experiences.

For example, in customer service scenarios, EVI3 can adjust tone based on user emotional states to provide more considerate responses; in content creation fields, creators can use EVI3 to generate customized audiobooks or voice acting for game characters, greatly enriching creative possibilities. Hume AI plans to further optimize EVI3’s multilingual capabilities, aiming to support languages like French, German, Italian, and Spanish more proficiently in the future, expanding the global market.

Hume AI's Vision: Driving the Future of AI with Emotion

Hume AI was founded by former DeepMind researcher Alan Cowen in 2021, dedicated to developing AI technologies centered around human emotion and well-being. The release of EVI3 is an important step toward achieving Hume AI’s vision. Officially stated, by the end of 2025, Hume AI aims to create a fully personalized voice AI experience, making voice interaction the primary way humans communicate with AI.

Compared to giants like OpenAI and Anthropic focusing on improving general intelligence, Hume AI places greater emphasis on the realism and emotional resonance of voice AI. EVI3’s natural language customization tools allow users to create exclusive AI voices without complex technical operations, a user-friendly design that is expected to promote the popularization and application of voice AI.

The release of EVI3 undoubtedly injects new vitality into the field of voice AI. Its breakthroughs in emotional recognition, low-latency response, and personalized customization not only challenge the performance limits of existing voice AI models but also point the way forward for future AI interaction methods. AIbase believes that the advent of EVI3 marks a key step for voice AI moving from mechanical voice assistants toward truly "understanding" intelligent companions.

Shengshu Technology Secures Several Billion Yuan in Funding, Driving New Trends in AI Commercialization through Video Generation

Recently, Shengshu Technology, a leading company in the field of multimodal AI, announced the successful completion of an A-round funding round worth several billion yuan. This round was led by Bohua Capital, with existing investors such as Baidu's strategic investment division and the Beijing Artificial Intelligence Industry Investment Fund continuing to participate, demonstrating strong market recognition of Shengshu Technology. The company plans to use the funds to further advance model R&D and technological innovation, explore the potential of multimodal large models, and accelerate product expansion and user services. Multimodal technology, especially in the field of video generation, is currently experiencing rapid development.

Google Chrome Browser Adds New AI Features, How Should Internet Users Respond?

Google recently announced that the Chrome browser will undergo its largest update to date, primarily by adding AI features to enhance user experience. This update will be rolled out today to macOS and Windows users in the United States, with users who have English settings being the first to experience these new features. Mike Torres, Vice President of Google Products, stated that the core of this update is 'Geminiization,' and users can now access AI capabilities for web pages through a newly added Gemini button.

Suno v5 Music Model is About to Launch, AI Music Creation is About to Experience a Revolutionary Upgrade

Suno recently sparked global discussions through a mysterious teaser video: its fifth-generation music model 'v5' is about to be released. This announcement is seen by the industry as a 'revolutionary' milestone in AI music creation, and is expected to further blur the boundaries between human composition and machine-generated music, significantly lowering the barriers to entry for creators from amateur enthusiasts to professional producers. Suno officially posted a 15-second short video on social media at night on September 18th, showing abstract notes and interwoven light and shadow, accompanied by a deep electronic melody, ending with 'coming soon'.

Tongyi Wanxiang's New Action Generation Model Wan2.2-Animate Officially Open-Sourced

On September 19, 2025, Alibaba Cloud announced the official open-sourcing of Tongyi Wanxiang's new action generation model Wan2.2-Animate. This model can drive photos of people, anime characters, and animals, and is widely applied in short video creation, dance template generation, and animation production. Users can download the model and code on GitHub, HuggingFace, and the Moda Community, or call the API through the Alibaba Cloud BaiLian platform or experience it directly on the Tongyi Wanxiang website. Wan2.2-Animate

Tencent HuanYuan 3D Studio Makes a Stunning Debut: 3D Creation Speeds Up from Days to Minutes

On September 19, 2025, Tencent launched HuanYuan 3D Studio, an AI workbench specifically designed for 3D designers, game developers, and modelers. This is Tencent's second major release within a week. The platform reduces the 3D asset production cycle from days to minutes, achieving a revolutionary improvement in production efficiency. A one-stop platform covers the entire creative process. The initial version of HuanYuan 3D Studio has been launched, featuring character and prop creation pipelines, integrating the entire workflow from concept design, geometric modeling, to texture mapping, skinning, and animation production. The platform is based on

Musk's AI Company Faces Intensifying Power Struggle, Multiple Executives Leave Due to Discontent with Management Style

Recently, Elon Musk's AI company xAI has experienced a management crisis, with multiple executives leaving due to dissatisfaction with the company's management style and financial situation. Currently, the daily operations of xAI are handled by two close advisors of Musk, Jared Bunch and John Hering, and all major decisions still require Musk's approval. Image source note: The image was generated by AI, provided by the licensing service Midjourney. A source revealed that some executives of xAI expressed dissatisfaction with Bunch and Hering in internal meetings.

AI Video Breakthrough! Luma Ray 3 Inference Model Launches - One-Click Thinking to Generate 4K HDR Movies

A milestone upgrade has been achieved in the field of video generation AI. Luma AI officially launched the Ray3 model, a product that is called the world's first inference video model. This model has completely changed the rules of the game for AI video generation through its built-in multimodal reasoning system. The core innovation of Ray3 lies in its intelligent reasoning capability. Unlike traditional random generation models, this model can understand user intent, plan complex scenes, and self-assess output quality as a true creative partner. It first conceptualizes a storyboard in its mind and then iteratively optimizes it, a process similar to that of animated

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Hume AI Releases EVI 3: A Voice AI That Understands Your Emotions Faster Than GPT-4!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tencent Video Launches AI Repair for Classic Film and Television Works to Restore 4K Quality

Hong Kong's Ultrasound Field Sees an AI Revolution! New Large Model Helps Doctors Diagnose Easily

AI Daily: Xiaomi Opensources Its First Native End-to-End Speech Large Model; Tongyi Wanxiang Wan2.2-Animate Officially Open-Sourced; Suno v5 to Launch Soon

Shengshu Technology Secures Several Billion Yuan in Funding, Driving New Trends in AI Commercialization through Video Generation

Google Chrome Browser Adds New AI Features, How Should Internet Users Respond?

Suno v5 Music Model is About to Launch, AI Music Creation is About to Experience a Revolutionary Upgrade

Tongyi Wanxiang's New Action Generation Model Wan2.2-Animate Officially Open-Sourced

Tencent HuanYuan 3D Studio Makes a Stunning Debut: 3D Creation Speeds Up from Days to Minutes

Musk's AI Company Faces Intensifying Power Struggle, Multiple Executives Leave Due to Discontent with Management Style

AI Video Breakthrough! Luma Ray 3 Inference Model Launches - One-Click Thinking to Generate 4K HDR Movies

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Hume AI Releases EVI 3: A Voice AI That Understands Your Emotions Faster Than GPT-4!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tencent Video Launches AI Repair for Classic Film and Television Works to Restore 4K Quality

Hong Kong's Ultrasound Field Sees an AI Revolution! New Large Model Helps Doctors Diagnose Easily

AI Daily: Xiaomi Opensources Its First Native End-to-End Speech Large Model; Tongyi Wanxiang Wan2.2-Animate Officially Open-Sourced; Suno v5 to Launch Soon

Shengshu Technology Secures Several Billion Yuan in Funding, Driving New Trends in AI Commercialization through Video Generation

Google Chrome Browser Adds New AI Features, How Should Internet Users Respond?

Suno v5 Music Model is About to Launch, AI Music Creation is About to Experience a Revolutionary Upgrade

Tongyi Wanxiang's New Action Generation Model Wan2.2-Animate Officially Open-Sourced

Tencent HuanYuan 3D Studio Makes a Stunning Debut: 3D Creation Speeds Up from Days to Minutes

Musk's AI Company Faces Intensifying Power Struggle, Multiple Executives Leave Due to Discontent with Management Style

AI Video Breakthrough! Luma Ray 3 Inference Model Launches - One-Click Thinking to Generate 4K HDR Movies

GEO Services