AI Daily Report - June 30th: Baidu Open Sources the WENXIN Large Model 4.5 Series; Tongyi Qianwen Multimodal Generation Model Qwen VLo

AIbase基地

Published inAI News · 8 min read · Jun 30, 2025

Welcome to the AIbase [AI Daily] column!

Get to know today's major AI events in three minutes a day, helping you understand AI industry trends and innovative AI product applications.

Visit more AI news:https://www.aibase.com/zh

1. Baidu unveils the WENXIN Large Model 4.5 series with ten new models launched!

Baidu officially released the WENXIN Large Model 4.5 series and fully open-sourced it, including ten new models with various parameter configurations. The models were trained and inferred using the PaddlePaddle framework, achieving a FLOPs utilization rate of 47%. They perform excellently in text multimodal benchmark tests and provide a one-stop usage guide and tools, making it easy for developers to fine-tune and deploy. The models have been uploaded to platforms such as Hugging Face and GitHub.

Experience address: https://yiyan.baidu.com

Hugging Face: https://huggingface.co/baidu

GitHub: https://github.com/PaddlePaddle/ERNIE

2. Tongyi Qianwen releases the multimodal unified understanding and generation model Qwen VLo

微信截图_20250628093705.png

The Qwen VLo multimodal large model was released, based on the Qwen-VL series upgrade, using a progressive generation method, accurately understanding the world and creating high-quality content, supporting open instruction editing and modification, multi-language instruction capabilities, and can handle image and text input and output. It is currently in the preview stage, with the experience address being the Qwen Chat platform.

Experience address: chat.qwen.ai

3. Alibaba Ovis-U1震撼发布: Multimodal AI Three-in-One, Open Source Empowers Global Developers

The Alibaba International AI team released the Ovis-U1 multimodal large model, with 3 billion parameters, integrating multimodal understanding, text-to-image generation, and image editing functions. It uses an innovative architecture design and is built using Python3.10 and other technology stacks. Compliance checking algorithms were introduced during training, and code model weights are already public, helping to support applications in multiple fields.

Project: (https://huggingface.co/AIDC-AI/Ovis-U1-3B)

4. Huawei opensources PanGu 7B dense and 72B mixture-of-experts models

Huawei open-sources the PanGu 7B dense model, 72B mixture-of-experts model, and Ascend inference technology, practicing the Ascend ecosystem strategy, promoting research on large model technology and industry applications. Related model weights and code have been uploaded to open-source platforms, inviting developers to download, use, and provide feedback.

5. One picture generates a hit video! Meitu MOKI's "AI Creative Advertising" is temporarily free

微信截图_20250630083834.png

Meitu MOKI launched the "AI Creative Advertising" feature, allowing users to upload images and select templates to generate professional-level videos. It integrates seven mainstream video generation models. The experience address is www.moki.cn, completing the entire process from creativity to final production.

Experience address: www.moki.cn

6. Gemini 2.5 Pro API returns to free tier, developer community responds enthusiastically

Google's Gemini 2.5 Pro API has returned to the free tier of Google AI Studio. This model has strong multimodal and reasoning capabilities, supports multiple input types. This free return provides developers with innovation opportunities, doubling free computing resources, and the community has responded positively.

7. Douyin's "Deep Research" function is now available on Douyin APP, web version, and PC version for testing

微信截图_20250630140622.png

Douyin APP and other platforms have started testing the "Deep Research" function, which can integrate massive deep information to generate research reports or visual web results. Users can get customized reports within minutes by entering instructions and also support一键 converting into podcast formats.

8. Xiaomi's "AI Toolbox" internal testing period ends, service will be suspended starting July 5th

Xiaomi's "AI Toolbox" internal testing has ended, and the service will be suspended starting July 5th. The internal testing collected data feedback, not a discontinued project but rather a strategic planning for data organization. Xiaomi continues to invest in AI exploration and build a multi-layered, full-scenario AI ecosystem.

9. New open-source AI system OmniGen2: Integrates image and text generation like GPT-4o

The Beijing Artificial Intelligence Research Institute launched the open-source system OmniGen2, focusing on text and image generation and editing. It uses an independent decoding path, based on the Qwen2.5-VL-3B transformer, and uses a custom diffusion transformer with a reflection mechanism. Its performance is excellent in multiple benchmark tests and will be released on the Hugging Face platform.

Project: https://huggingface.co/OmniGen2/OmniGen2

10. Zhihu "Direct Answer" upgrades knowledge base function, deeply integrates community content to create an immersive AI Q&A experience

Zhihu "Direct Answer" upgrades the knowledge base function, deeply integrating community content, bringing innovative features such as immersive reading, aiming to provide an immersive AI Q&A experience across multiple scenarios, expanding the influence of answerers' content, and reducing user query costs.

AntGroup Launches Multilingual Visual Large Model Training Framework to Break Language Barriers!

AntGroup introduced a multilingual multimodal large model training framework at the Hong Kong FinTech Festival, breaking through the bottlenecks of multilingual applications. This technology targets small languages such as Egyptian Arabic, and through a language-aware optimization framework, it achieves a 'thinking in the target language' mechanism, improving the training effectiveness for resource-scarce languages.

Apple Siri Will Undergo a Major Transformation! Paying Google to Help with AI Upgrades

Apple faced obstacles in developing its own Siri large model, so it turned to a collaboration with Google, adopting a customized Gemini language model to enhance AI capabilities. The new strategy will adopt an 'edge-cloud collaboration' hybrid model, combining the advantages of cloud-based large models with local data privacy protection, aiming to optimize user experience and address shortcomings in handling complex tasks.

Modern Motor and NVIDIA Collaborate to Build a $3 Billion Artificial Intelligence Factory

Modern Motor and NVIDIA deepen their cooperation to build an AI factory based on the Blackwell architecture. The two companies announced joint development of projects in autonomous driving, smart factories, and robotics at CES. The project has received support from the South Korean government and will be detailed at the 2025 APEC Summit in South Korea.

Large Models Are Disrupting Financial Services: Du Xiaoman CEO Reveals How AI Is Helping Promote Inclusive Finance

The 2025 Hong Kong Fintech Week focuses on the integration of fintech and AI, bringing together guests such as Carrie Lam and Geoffrey Hinton. Zhu Guang, CEO of Du Xiaoman, emphasized the innovative applications of large models in financial services, driving customer service from monthly surveys to real-time responses, achieving a revolutionary transformation centered around customer-centricity.

Yuanbao Integrates WeChat Pay, Adds Three AI Features: Automatic Collection, Copy Editing, etc.

WeChat Pay has launched the "Business Payment Code" feature. After small and medium merchants activate it, they can conveniently collect payments within WeChat, automatically calculate accounts, and receive community copywriting and technical development guidance, simplifying daily business processes and improving efficiency.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

AI Daily Report - June 30th: Baidu Open Sources the WENXIN Large Model 4.5 Series; Tongyi Qianwen Multimodal Generation Model Qwen VLo

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AntGroup Launches Multilingual Visual Large Model Training Framework to Break Language Barriers!

Apple Siri Will Undergo a Major Transformation! Paying Google to Help with AI Upgrades

Modern Motor and NVIDIA Collaborate to Build a $3 Billion Artificial Intelligence Factory

Large Models Are Disrupting Financial Services: Du Xiaoman CEO Reveals How AI Is Helping Promote Inclusive Finance

Yuanbao Integrates WeChat Pay, Adds Three AI Features: Automatic Collection, Copy Editing, etc.

ByteDance AI Programming Tool Trae Removes Claude Model Pro Membership Compensation

AI Daily: Kunlun Tech SkyReels V3 Model Released; Moonshot AI Launches Kimi Linear Model; MiniMax Music 2.0 Released

AI One-Click Transformation PPT Master! New Feature Launches in Gemini Canvas, Instantly Liberating Professionals

Ant Group Launches Multilingual Visual Large Model Training Framework for Efficient Identification of Document Forgery and Logical Contradictions

Wenxin Magic Comic Function Launch: One Sentence, One Image, Two Minutes to Generate a Serial! Everyone Can Be a Cartoonist

GEO Services