Welcome to the 【AI Daily】 column! Here is your daily guide to exploring the world of artificial intelligence. Every day, we present hot content from the AI field, focusing on developers to help you gain insight into technical trends and understand innovative AI product applications.

Fresh AI products click to learn more: https://top.aibase.com/

1. Alibaba Cloud's Qwen Code Intelligence IDE Officially Launched, Bringing a New Programming Experience

Alibaba Cloud has launched Qwen Code Intelligence IDE, an AI development environment deeply compatible with Qwen3 that features powerful programming intelligence, long-term memory, and in-line suggestion prediction functions, while also providing in-line dialogue capabilities, significantly enhancing development efficiency and becoming one of the most popular programming assistant tools in China.

image.png

[AiBase Summary:]

🚀 AI IDE Launch: Alibaba Cloud's Qwen Code Intelligence IDE has officially been released, allowing users to download it for free and start a new era of efficient programming.

🧠 Powerful Features: Supports programming intelligence, long-term memory, and in-line suggestion predictions, greatly improving development efficiency and simplifying the programming process.

🌐 Wide Application: The Qwen Code plugin has over 15 million downloads and is widely used by enterprises such as FAW Group and NIO, receiving high praise.

2. Xiaomi Multi-Modal Large Model Xiaomo MiMo-VL Open-Sourced

MiMo-VL-7B performs excellently across multiple multi-modal tasks, with only 7 billion parameters yet surpassing larger closed-source models. Its powerful visual perception capabilities and innovative training methods make it a standout among open-source models.

image.png

[AiBase Summary:]

Xiaomi's self-developed MiMo-VL-7B significantly leads in multi-modal reasoning tasks, with only 7 billion parameters surpassing the 10-times-scale Alibaba Qwen-2.5-VL-72B.

Through high-quality pre-training data and mixed online reinforcement learning algorithms, MiMo-VL-7B demonstrates excellent generality across image, video, and language tasks.

The model excels not only in academic competitions but also in practical applications such as complex image reasoning and GUI operations, enhancing user experience.

For more details: https://huggingface.co/XiaomiMiMo

3. Black Forest Lab Releases FLUX.1Kontext: Can Modify Images Multiple Times via Text and Reference Images

Black Forest Lab's FLUX.1Kontext is a powerful image generation model that supports multiple edits through text and reference images, featuring character consistency, local editing, style reference, and low latency, providing enterprises with rapid iteration solutions.

image.png

[AiBase Summary:]

Contextual generation capabilities make image generation more flexible and efficient, generating based on reference images or context rather than starting from scratch.

Supports local editing of text and reference images, maintaining character consistency without affecting the overall image style.

As a flow model, it can start from existing images and achieve instant and flexible editing through simple text instructions.

For more details: https://bfl.ai/announcements/flux-1-kontext

4. Midjourney V7 Major Update: Rendering Speed Soars by 40%, New User Voting Function Development

Midjourney V7 brings several major updates, including a 40% increase in rendering speed, upgraded AI moderator functionality, and the launch of the second round of community roadmap voting activities. These updates not only improve work efficiency but also enhance the user creation experience.

image.png

[AiBase Summary:]

🔥 Rendering speed increases by 40%, greatly improving creation efficiency.

🌟 AI moderator functionality upgraded, providing more precise optimization suggestions.

🗳️ Second round of community roadmap voting activity starts, allowing users to participate in deciding future function development.

For more details: https://midjourney.com/ideas

5. DeepSeek Becomes World’s Second Largest AGI Lab

DeepSeek R1-0528 has made significant breakthroughs in technical performance and open-source weights, surpassing xAI, Meta, and Anthropic, ranking second alongside Google.

image.png

[AiBase Summary:]

🌟 DeepSeek R1-0528 surpasses top AI labs, becoming the second-largest artificial intelligence lab globally.

📈 Intelligence index score jumps from 60 to 68, progress comparable to OpenAI o1 to o3 models.

🚀 Establishes leadership in open-source weights, promoting technological popularization and innovation.

6. Hugging Face Enters Humanoid Robot Market: Launches Open-Source Robot HopeJR for $3000

Hugging Face has officially entered the robotics hardware sector by releasing two open-source humanoid robots, HopeJR and Reachy Mini, aiming to break the monopoly of big tech companies in robotics technology.

image.png

[AiBase Summary:]

Releases HopeJR and Reachy Mini robots, targeting full-size and desktop-level applications respectively.

Robots are open-source and affordable, avoiding the monopolization of robot technology by a few large companies.

Strategic acquisition of Pollen Robotics and long-term ecosystem development have supported product development.

7. ByteDance's Volcano Forge Officially Connects DeepSeek-R1-0528 Version

I understand that ByteDance's Volcano Forge platform has integrated the latest version of DeepSeek-R1-0528, offering a high-performance service system and rich features, bringing efficient and convenient application experiences to enterprises and developers.

image.png

[AiBase Summary:]

Volcano Forge achieves inference speeds as low as 30ms/Token through its self-developed xLLM framework, ensuring stability and real-time interaction fluency.

Provides function support including Function Call and networking, covering diverse application scenarios to meet high concurrency needs.

Offers new customer discounts of 50% and various experience entry points, assisting quick onboarding and easy implementation of large model applications.

8. Anthropic Announces 'Circuit Tracing' Tool: Unlocking AI's Brain, Deciphering the Entire Decision-Making Process of Large Models

Anthropic has released an open-source tool named 'Circuit Tracing', which uses generated attribution maps to display the internal decision-making paths of large language models, enhancing understanding of AI decision mechanisms and promoting the transparency of AI technology development.

image.png

[AiBase Summary:]

✨ 'Circuit Tracing' tool reveals the internal decision-making paths of large models through generated attribution maps, making the AI 'thinking' process visible.

🔍 Provides Neuronpedia interactive front-end, lowering research barriers so non-professionals can preliminarily understand the model's decision-making process.

🌐 Open-source empowerment, promoting AI transparency and controllability, helping address ethical and safety challenges such as model hallucinations and biases.

9. Alibaba Open-Sources Autonomous Search AI Agent WebAgent for More Efficient Research

I am very impressed with WebAgent, this AI agent can simulate human behavior to actively search, analyze, and make decisions in network environments, greatly improving information retrieval efficiency. Its two clearly defined modules—WebDancer and WebWalker—handle agent training and language model benchmark testing respectively, with WebDancer's multi-step reasoning capability being particularly impressive.

image.png

[AiBase Summary:]

🔍 WebAgent has end-to-end information retrieval and multi-step reasoning capabilities, enabling active searching, analysis, and decision-making, greatly enhancing research efficiency.

📚 WebAgent achieves complex information retrieval through WebDancer and WebWalker modules, with WebDancer's innovative algorithm significantly improving data efficiency and strategy robustness.

🌐 WebAgent supports multi-domain applications, such as academic research and market analysis, capable of integrating different literature to generate comprehensive research reports.

For more details: https://github.com/Alibaba-NLP/WebAgent

10. Hume Releases Voice Language Model Hume EVI3: Low Latency, High Emotion

Hume company has released the new voice language model EVI3, which features low latency and high emotional expression in voice generation, bringing revolutionary progress to voice interactions.

image.png

[AiBase Summary:]

Breakthrough voice-to-voice technology supports arbitrary style voice generation and accurately conveys emotion and tone.

Low latency ensures smooth real-time dialogue, enhancing immersion and interaction efficiency.

Widely applied in virtual assistants, education, entertainment, and cross-language scenarios, demonstrating strong practical value.

For more details: https://demo.hume.ai

11. Manus Slides Officially Released: One-Click Professional Slide Generation

Manus has launched a new feature, Manus Slides, which quickly generates structured slides through a single prompt word, applicable to various scenarios, significantly improving the efficiency of creating presentation documents.

image.png

[AiBase Summary:]

✨ Intelligent generation and efficient editing: Input a short prompt word, and AI automatically generates and optimizes slide content, supporting immediate adjustments.

🎯 Wide application: Suitable for business, education, creative fields, assisting in quickly producing high-quality presentations.

🌐 Global competitiveness enhancement: AI-driven automation lowers barriers, driving productivity tool innovation.

12. Turn Phone Photos into Art! Runway Gen-4 References Unlock New Ways to Play with Camera Negatives

Runway's Gen-4 References feature now supports mobile devices, allowing users to upload photos from their phones and combine them with natural language prompts to generate art works with consistent styles, greatly enhancing the convenience and diversity of creation.

image.png

[AiBase Summary:]

📱 Use your phone to upload photos and easily transform everyday shots into art pieces.

🎨 Combine natural language prompts to maintain consistency in people, scenes, and styles.

🌟 Supports various material types, enhancing creativity and realism.