Musk Announces Completion of Pre-training for Grok V7 Base Model, Possessing Native Multimodal Capabilities

AIbase基地

Published inAI News · 2 min read · Aug 11, 2025

7

Elon Musk announced on August 11 on the X platform that xAI's Grok V7 base model was pre-trained last week. The biggest highlight of this version is its native multimodal capabilities, which can directly process video and audio bitstreams without conversion to understand the content.

This means that Grok V7 can not only understand video images but also perceive subtle changes in speech, accurately identifying emotions and tonal emphasis in expressions, thus achieving a deeper semantic understanding.

Grok, Musk, xAI

At the same time, Musk also announced that the Grok4 model is now freely available to all users. Free users can perform a limited number of queries per day, and if more usage permissions are needed, they need to pay for a subscription. This move aims to expand the user base of Grok, making it more accessible and widely used by the public.

The native multimodal capabilities of Grok V7 indicate significant improvements in video and audio processing, opening up more possibilities for future AI applications. The free availability of Grok4 also reflects that xAI is balancing technological innovation with market penetration through different strategies.

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Musk Announces Completion of Pre-training for Grok V7 Base Model, Possessing Native Multimodal Capabilities

AIbase基地

This article is from AIbase Daily