Welcome to the "AI Daily" column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers to help you understand technical trends and innovative AI product applications.

Fresh AI products Click to learn more:https://top.aibase.com/

1. ByteDance Launches End-to-End Simultaneous Interpretation Model Seed LiveInterpret 2.0

The Seed team from ByteDance has launched its latest research achievement - Seed LiveInterpret 2.0, which reaches top industry standards in Chinese-English simultaneous interpretation quality, featuring low latency and real-time voice replication functions, significantly enhancing the naturalness and fluency of cross-language communication.

image.png

AiBase Summary:

🚀 Seed LiveInterpret 2.0 achieves near-human simultaneous interpretation accuracy, with extremely low latency, only 3 seconds.

🎙️ Supports real-time voice replication function, allowing synthesis of "original voice" speech translation without prior voice sample collection.

📊 In professional evaluations, Seed LiveInterpret 2.0 performed excellently in Chinese-English translation tasks, scoring far higher than other systems.

Details link: https://arxiv.org/pdf/2507.17527

2. Mistral Search API Launch: Pricing at 3 cents, Providing Multimodal Search Capabilities

Mistral AI Search officially launched its search API, offering developers a new alternative to Bing Search API. The API is priced at 0.03 yuan per query, supports multimodal search, and has no usage barriers, making it easy to integrate quickly.

image.png

AiBase Summary:

✅ Mistral AI Search API is officially launched, providing developers with a new search alternative.

💡 Priced at 0.03 yuan per query, it has market competitiveness and supports multimodal search.

🚀 Developers can test and use immediately, without complex application processes, improving integration efficiency.

3. Lovart AI Official Version Launched Globally: Full-Chain Intelligent Design Redefines the Creative Experience

The article introduces the global launch of the official version of Lovart AI, emphasizing its innovation as the first AI design agent, and how it redefines industry standards through natural language interaction and full-chain design capabilities. The article also mentions its new features ChatCanvas and the "Xingliu Agent" for the Chinese market, highlighting its profound impact on the design industry.

image.png

AiBase Summary:

🎨 Lovart AI provides high-quality visual asset generation services through natural language interaction and full-chain design capabilities.

🧠 New feature ChatCanvas supports multi-turn dialogue and real-time layout and color adjustments, improving creative efficiency.

🇨🇳 The "Xingliu Agent" optimized for the Chinese market supports Chinese semantics and national style aesthetics, helping local creators create efficiently.

4. Li Mu's Team Releases Higgs Audio v2, Pioneering a New Era in Speech Synthesis

Li Mu's team released Higgs Audio v2, a major breakthrough in the field of speech synthesis, with features such as multilingual dialogue generation, automatic rhythm adjustment, and voice cloning. The model was trained using 10 million hours of speech data and performed excellently in various tests, becoming an industry benchmark.

image.png

AiBase Summary:

🔥 Higgs Audio v2 supports multilingual dialogue generation and voice cloning, achieving complex tasks.

📊 In the EmergentTTS-Eval test, Higgs Audio v2 performed well in emotional and question categories.

🚀 Supports real-time voice chat and audio content creation, suitable for virtual anchors and voice assistants, among other scenarios.

5. Sora2 Comes into View: OpenAI Aims to Regain the Top Spot in Generative AI Video

The article introduces that OpenAI is developing the successor to its text-to-video model Sora, called Sora2, while also mentioning the popularity of Google's Veo3. This indicates that competition in the generative AI video field will become even fiercer.

image.png

AiBase Summary:

🚀 OpenAI is actively developing Sora2 to cope with competition from Google's Veo3.

💡 Sora2 has not been publicly released yet, but more information may be available in the coming weeks.

🌐 Google's Veo3 is now freely available to college students and can be experienced via Google Cloud.

6. OpenAI and Oracle Collaborate to Expand Stargate Project, Creating Thousands of Jobs

OpenAI and Oracle have reached a new agreement to expand the capacity of the Stargate project in U.S. data centers to 4.5 gigawatts, with total capacity exceeding 5 gigawatts. This marks an important step toward OpenAI's goal of reaching 10 gigawatts by 2029. The project aims to position the United States as a leader in global artificial intelligence development and has attracted participation from multiple technology companies and international investors.

image.png

AiBase Summary:

🔥 The capacity of the Stargate project has been expanded to over 5 gigawatts, aiming to reach 10 gigawatts by 2029.

🤝 OpenAI and several technology companies including Oracle are jointly promoting the project, which is expected to create over 100,000 jobs.

💰 The project has received over $1.9 billion in funding and attracted participation from investors around the world.

7. Google Photos Adds AI Features: Photos Instantly Turn into Anime, One-Click Video Generation

Google Photos introduced multiple new AI-based features, including transforming static photos into dynamic videos and converting photos into different artistic styles. These features aim to enhance the user's creative experience and continuously optimize the product through experimental methods.

image.png

AiBase Summary:

📷 The photo-to-video feature uses the Veo2 model, enabling users to easily turn static photos into 6-second dynamic videos.

🎨 The Remix feature is driven by Imagen AI, which can convert ordinary photos into anime, comic, and other artistic styles.

📌 Google has added a 'Create' tab in the Photos app, integrating various creative tools to provide a one-stop creative experience.

8. YouTube Shorts Will Launch New AI Effects: Photos Instantly Become Videos!

YouTube announced a series of revolutionary generative AI features for Shorts creators, including image-to-video conversion and AI effects. These tools can transform static photos into dynamic videos and offer various creative options, significantly lowering the barrier to entry for content creation while enhancing the appeal of the content.

image.png

AiBase Summary:

📷 The image-to-video feature allows static photos to gain life within 6 seconds, improving the efficiency of short video creation.

🎨 AI effects can turn doodles, selfies, and other simple materials into beautiful artworks, inspiring creators' creativity.

🎥 The next-generation Veo3 video generator will simultaneously generate audio, providing a more complete creative solution.

9. Google Launches Aeneas Model: Opening New Paths for Ancient Text Interpretation

Google's Aeneas model offers a new approach to interpreting ancient inscriptions, accelerating historians' work in restoring, identifying, and dating inscriptions through artificial intelligence technology. It can also be extended to other ancient languages and materials, greatly enhancing the efficiency and depth of historical research.

image.png

AiBase Summary:

🧠 Aeneas model is developed by Google DeepMind, aimed at helping historians understand ancient texts.

🗣️ The model can analyze the similarity of ancient texts, fill in missing parts, and reduce the burden on historians.

📜 Aeneas transforms texts into "historical fingerprints," helping historians interpret inscriptions in a broader context.

Details link: https://deepmind.google/discover/blog/aeneas-transforms-how-historians-connect-the-past/

10. GitHub Spark Emerges: Build Web Applications with One Sentence, AI Development Enters a New Era!

GitHub Spark enables developers and non-developers alike to quickly build personalized web applications through natural language processing technology, significantly lowering the programming barrier and providing new possibilities for micro-app development.

image.png

AiBase Summary:

🌟 GitHub Spark allows users to describe their needs in natural language and quickly generate complete web applications.

🚀 Provides a fully managed runtime environment, supporting one-click deployment and PWA adaptation, simplifying the development process.

🔧 Supports multiple model selections and integrates deeply with the GitHub ecosystem, improving development efficiency.

Details link: https://github.blog/changelog/2025-07-23-github-spark-in-public-preview-for-cop ilot-pro-subscribers/

11. Huawei M-Pencil Pro Released: 699 Yuan, Supports One-Touch Activation of Xiaoyi Smart Assistant

Huawei released the new generation of stylus, the HUAWEI M-Pencil Pro, priced at 699 yuan, featuring 16384-level pressure sensitivity, side rotation function, and multiple pen tip choices, while also supporting AI function quick access and Starlight precise positioning function, bringing more convenient and realistic creative experiences for creators.

image.png

AiBase Summary:

✨ HUAWEI M-Pencil Pro has 16384-level pressure sensitivity, accurately sensing changes in force, enhancing the realism of creation.

💡 The pen tail smart key adopts the HarmonyOS Star Ring design's breathing light, allowing one-touch activation of Xiaoyi Smart Assistant, improving operational convenience.

📍 Starlight precise positioning function supports accurate positioning within a 50-meter range, solving the problem of lost styluses.