Welcome to the "AI Daily" section! This is your guide to exploring the world of artificial intelligence every day. Each day, we present you with the latest content in the AI field, focusing on developers, helping you understand technology trends and innovative AI product applications.

Fresh AI products click to learn more: https://top.aibase.com/

1. Alibaba Tongyi Open-Sources ThinkSound, the First Audio Generation Model Supporting Chain-of-Thought Reasoning

The Alibaba speech AI team has open-sourced ThinkSound, the world's first audio generation model that supports chain-of-thought reasoning. By introducing a thinking chain technique, this model breaks through the limitations of traditional video-to-audio technology, achieving high-fidelity, strong synchronization spatial audio generation. This technological advancement marks a leap from "image-to-audio dubbing" to "structured understanding of scenes" in AI audio.

image.png

【AiBase Highlights:】

🧠 ThinkSound is the first to combine multimodal large language models with a unified audio generation architecture, achieving precise audio synthesis.

📊 The research team built an AudioCoT dataset containing 2,531.8 hours of high-quality samples, enhancing the model's ability to handle complex instructions.

🚀 ThinkSound outperforms mainstream methods in multiple test sets. Code and pre-trained weights are now open-source, available for free to developers.

More details: https://github.com/FunAudioLLM/ThinkSound https://huggingface.co/spaces/FunAudioLLM/ThinkSound https://www.modelscope.cn/studios/iic/ThinkSound

2. Google Veo3 Makes Major Upgrades, Supporting Dynamic Video Generation from Static Images

Google announced a major upgrade to its AI video generation tool Veo3, allowing users to generate high-quality audio and video content by simply uploading a single static photo, demonstrating the huge potential of AI in the creative field. Veo3's core features include maintaining character consistency across multiple shots and offering rich camera movement functions, such as dolly-in shots. Additionally, users can choose different quality models, but they need to use corresponding credits.

image.png

【AiBase Highlights:】

🖼️ After the upgrade, Veo3 supports generating high-quality dynamic videos from a single static image.

🎥 Supports camera movement functions like dolly-in shots, enhancing the professionalism of videos.

🔊 Users can choose different quality models, but they need to use corresponding credits.

3. Hugging Face Launches New Small-Parameter Model SmolLM3: 128K Context, Dual-Mode Reasoning

Hugging Face released SmolLM3, a small open-source model with 3 billion parameters, outperforming Llama-3.2-3B and Qwen2.5-3B. The model supports multilingual processing and offers dual-mode reasoning, while also publicly releasing architectural details to promote research and optimization.

image.png

【AiBase Highlights:】

🧠 SmolLM3 has 3 billion parameters and outperforms similar open-source models, supporting multilingual processing.

⚙️ Provides both deep thinking and non-thinking reasoning modes, flexibly addressing different needs.

📊 Uses an advanced transformer decoder architecture and enhances capabilities through a three-stage hybrid training process.

More details: https://huggingface.co/HuggingFaceTB/SmolLM3-3B-Base

4. Alibaba Open-Sources WebSailor, Featuring Strong Reasoning and Retrieval Capabilities

Alibaba Tongyi open-sourced WebSailor, a web agent that performs well in the BrowseComp evaluation sets for Chinese and English tasks, surpassing closed-source models like DeepSeek R1 and Grok-3, showcasing strong reasoning and retrieval capabilities. Galaxy Securities pointed out that the AI Agent economy is fully launched and recommended paying attention to SAAS companies with leading positions. Listed companies such as Jiaodian Technology and Zhongke Jincai have already made progress in AI Agent technology, promoting the development of agent technology.

image.png

【AiBase Highlights:】

📌 Alibaba Tongyi open-sources WebSailor, showing strong reasoning and retrieval capabilities.

📈 Galaxy Securities pointed out that the AI Agent economy is fully launched, recommending attention to related SAAS companies.

💡 Companies such as Jiaodian Technology and Zhongke Jincai have clear advantages in agent technology applications.

More details: https://github.com/Alibaba-NLP/WebAgent

5. Moonvalley Releases Marey Realism v1.5: Native 1080P AI Video Model, Zero Copyright Risk Leading Industry Trends!

Moonvalley's Marey Realism v1.5 AI video generation model achieved comprehensive upgrades in image quality, creative freedom, and legal compliance. Its native 1080P video generation capability, training data based on authorized content, and accurate interpretation of complex prompts provide safer and more efficient tools for film production and advertising creativity.

image.png

【AiBase Highlights:】

🎥 Native 1080P video generation capability, providing a visual experience close to real filming.

🔒 Trained on 100% authorized data, completely avoiding copyright risks.

🔄 Supports text-to-video and image-to-video generation, enhancing creative flexibility.

6. Vidu Q1 Shock Upgrade: Reference-to-Video Support for Up to Seven Images, AI Video Generation Sets New Records

Vidu Q1's 'Reference-to-Video' feature allows users to upload up to seven reference images to generate 1080p videos with extremely high visual consistency. This technology ensures consistency of multi-image elements in the video through semantic fusion, solving issues of scene breaks or character distortion in traditional AI video generation, providing creators with powerful tools.

image.png

【AiBase Highlights:】

🎥 Supports up to seven reference images, enhancing video creation flexibility.

🔍 Semantic fusion technology ensures high consistency of multi-image elements in videos.

🔄 Multi-subject consistency technology achieves coherent visual experiences in complex scenes.

7. Apple Develops AI Customer Service Assistant Similar to ChatGPT, Enhancing User Support Experience

Apple is developing an AI-based 'Support Assistant' aimed at providing users with smarter and more efficient customer service. This feature has been found in the code of the Apple Support app, and in the future, it will allow users to receive AI-generated solutions before contacting customer service, improving service efficiency.

image.png

【AiBase Highlights:】

🍎 Apple is developing an AI-based support assistant to enhance customer service efficiency.

💬 Users can obtain AI-generated solutions before contacting customer service, reducing waiting time.

🔄 The support assistant may allow file uploads, enriching the interactive experience.

8. Feishu Launches Multiple AI Products, Building an Enterprise-Level "Doubao"

Feishu launched multiple AI products, including knowledge QA, AI meetings, Aily, Feishu Miaoda, aiming to accelerate the implementation of AI in enterprise applications. At the same time, Feishu also launched the industry's first AI application maturity model to help enterprises evaluate the actual effectiveness of AI products.

image.png

【AiBase Highlights:】

🚀 Feishu launches multiple AI products to help enterprises achieve intelligent operations.

📊 Launches an AI application maturity model to improve enterprises' ability to assess AI products.

📈 Feishu Multidimensional Tables achieve a dual leap in performance and AI capabilities, supporting large-scale data processing.

9. Microsoft, OpenAI, and Anthropic Jointly Launch Educator AI Training Center

The American Federation of Teachers (AFT) jointly established the National Artificial Intelligence Education Academy with Microsoft, OpenAI, and Anthropic, aiming to provide free AI tool training for teachers to help them better utilize artificial intelligence technology. This project received $23 million in funding support, driving technological changes in the education sector.

image.png

【AiBase Highlights:】

👩‍🏫 Teachers will master new technologies through AI training to ensure their leadership in education.

💰 Microsoft, OpenAI, and Anthropic provide $23 million in funding support for AI education projects.

📚 The AI Academy is committed to promoting educational democratization, ensuring that technology serves students and teachers.