Welcome to the "AI Daily" column! This is your guide to exploring the world of artificial intelligence every day. Each day, we bring you the latest hot topics in the AI field, focusing on developers to help you understand technology trends and innovative AI product applications.
Fresh AI products click to learn more:https://top.aibase.com/
1. Kuaishou Keling 2.1 launches a new first and last frame feature
Kuaishou Keling 2.1 model launched a new first and last frame feature, significantly improving the effect and smoothness of video generation, while optimizing transition effects and text response capabilities. The model has shown significant improvements in dynamic performance, semantic understanding, and generation efficiency, suitable for various professional video creation scenarios.
【AiBase summary:】
🎥 Keling 2.1 adds a first and last frame feature, enhancing precise control over the beginning and end of videos.
💡 Supports custom first and last frame images, solving the problem of stiff transitions, suitable for professional video creation.
⚡ Generation speed and cost have decreased, improving the efficiency of creators.
2. Kunlun Wanyi Launches AI Music Model Mureka V7.5 and Introduces MoE-TTS Voice Synthesis Model
Kunlun Wanyi Group launched the Mureka V7.5 model on August 15, 2025, marking the successful conclusion of its SkyWork AI Technology Release Week. The model excels in Chinese song creation, optimizing the authenticity and emotional depth of vocal performances, and combining with the MoE-TTS voice synthesis framework, it enhances the naturalness and controllability of voice synthesis.
【AiBase summary:】
🎧 Mureka V7.5 demonstrates excellent capabilities in Chinese song creation, including improvements in tone, playing techniques, diction, and emotional expression.
🎤 MoE-TTS precisely controls voice characteristics and style through natural language descriptions, solving the problem of generated speech deviating from expectations due to complex rhetoric.
🌐 Kunlun Wanyi demonstrated its strong capabilities in AI music creation and voice synthesis, providing new ideas for research and development in related fields.
3. Tencent Cloud Launches AI Development Tool CloudBase AI CLI, Reducing Coding by 80%
Tencent Cloud launched CloudBase AI CLI, an AI command-line tool deeply integrated with the cloud development platform, aimed at providing developers with a more efficient and convenient development experience. The tool supports multiple AI programming tools through a unified command-line entry, significantly improving development efficiency, and covering the entire process from code generation to application deployment.
【AiBase summary:】
🔥 CloudBase AI CLI provides a unified command-line interface, simplifying the development process.
🌐 Supports cross-platform compatibility and multi-model collaboration, meeting the needs of different development scenarios.
💡 Provides free trial quotas, lowers the usage threshold, and improves the cost-effectiveness of AI.
Details link: https://static.cloudbase.net/cli/install/install.sh -fsS | bash
4. Overseas New Product MuleRun Goes Viral! Each User Gets Their Own Virtual Machine, AI Agent Automatically Plays Games and Does Modeling
MuleRun, as an innovative AI product, brings users an unprecedented intelligent experience through its unique virtual machine mechanism and community-driven agent ecosystem, demonstrating the broad application potential of AI agents in multiple fields.
【AiBase summary:】
🎮 MuleRun's AI agent can automatically complete game tasks, greatly enhancing the user experience.
💻 MuleRun provides users with a dedicated virtual machine environment, supporting the operation of various software and applications.
🌐 A community-driven agent ecosystem allows ordinary users to easily use automation tools, lowering the technical barrier.
Details link: https://discord.com/invite/kKAAEYay5F
5. Meta's Major Open Source DINOv3! AI Visual Master Without Manual Annotation, Revolutionizing Image Recognition Future
Meta AI open-sourced the next-generation general image recognition model DINOv3, which achieves excellent performance without manual annotation based on self-supervised learning, considered a new milestone in AI vision technology. DINOv3 excels in high-resolution feature extraction and multi-task adaptability, applicable to multiple fields such as environmental monitoring, healthcare, and autonomous driving, and reduces development barriers through open source.
【AiBase summary:】
🧠 Self-supervised learning: Extract features from massive unannotated images without manual annotation.
🖼️ High-resolution feature extraction: Captures global information and local details simultaneously, supporting various visual tasks.
🚀 Wide range of applications: Suitable for cross-domain applications such as environmental monitoring, healthcare, and autonomous driving.
Details link: https://github.com/facebookresearch/dinov3
6. Spring Festival Star Wins Again! Yushu H1 Wins the First 1500m Gold Medal in Robot History
The humanoid robot H1 from Yushu Technology won the first-ever 1500m gold medal in a global competition centered around humanoid robots, showcasing its outstanding performance in speed and endurance.
【AiBase summary:】
🏃♂️ Yushu Technology's humanoid robot H1 won the first-ever 1500m gold medal in a global competition centered around humanoid robots.
🏆 This competition attracted 280 teams and over 500 humanoid robots from 16 countries, showcasing the industry's top level.
🤖 H1 was optimized for running speed and endurance in software, demonstrating breakthroughs in extreme speed and endurance.
7. Google Gemini Receives Major Update! New Memory Function and Private Chat Mode Added
Google introduced two new features for its Gemini AI assistant — a memory function and a temporary chat mode, marking significant progress in personalized services and privacy protection for AI assistants. The memory function can continuously learn user information to provide more accurate services, while the temporary chat mode ensures that conversation content is not saved, protecting user privacy.
【AiBase summary:】
🧠 The memory function records user preferences and habits to enhance personalized service experiences.
🔒 The temporary chat mode ensures privacy, and conversation content will not be saved or used for training.
💡 These two features represent dual breakthroughs in personalization and privacy protection for AI assistants.
8. Hong Kong University Collaborates with Open Source Project OpenCUA to Create Personalized Computer Intelligent Assistant!
Hong Kong University collaborated with multiple institutions to open-source the OpenCUA framework, aiming to help developers build personalized computer usage intelligent agents (CUA), enhancing user productivity. The framework provides rich data support and powerful tools, demonstrating its potential in the field of intelligent assistant development.
【AiBase summary:】
🧠 OpenCUA framework provides a seamless annotation infrastructure for capturing human operations on computers.
📊 Integrates AgentNet dataset, covering over 200 applications and websites, supporting multiple operating systems.
🚀 Supports scalable workflows, converting demonstrations into "state-action" pairs, enhancing long-chain reasoning ability.
Details link: https://opencua.xlang.ai/
9. OpenAI May Introduce Ads in ChatGPT, Executive Says "Keeping Flexibility Is Important"
OpenAI is exploring ways to increase revenue, including introducing ads in ChatGPT. Although executive Nick Turley said that ads need to be handled carefully to avoid affecting user experience, the company still considers adopting ad models in other products. Meanwhile, the subscription model still has great growth potential.
【AiBase summary:】
📌 OpenAI considers introducing ads in ChatGPT but needs to handle them carefully to ensure user experience.
💡 Executives believe the subscription model still has great growth potential and there are many untapped opportunities.
📈 OpenAI expects subscription revenue to reach $12.7 billion in 2024, but it will take until 2029 to achieve positive cash flow.
10. Google Releases Ultra-Small and Efficient Open Source AI Model Gemma 3 270M, Can Run on Smartphones
Google DeepMind released the open-source AI model Gemma3270M, with 270 million parameters, compact and energy-efficient, supporting offline operation on lightweight devices such as smartphones and Raspberry Pi. It performs well in instruction-following tasks and has rapid fine-tuning capabilities, suitable for enterprise development and creative scenarios.
【AiBase summary:】
🧠 Gemma3270M is an open-source AI model with 270 million parameters, suitable for offline operation on smartphones.
⚡ Performs well in instruction-following tasks, internal testing shows it consumes only 0.75% of battery power, high energy efficiency.
📱 Supports rapid fine-tuning, suitable for enterprise development and creative applications, meeting diverse needs.
Details link: https://developers.googleblog.com/en/introducing-gemma-3-270m/