Welcome to the "AI Daily" column! This is your guide to exploring the world of artificial intelligence every day. Each day, we present you with the latest content in the AI field, focusing on developers to help you understand technical trends and learn about innovative AI product applications.

Fresh AI products click to learn more:https://app.aibase.com/zh

1. Alibaba Open Sources Z-Image Image Model: Supports Bilingual Text Rendering

Alibaba's Tongyi Lab has open-sourced a new image generation model called Z-Image. With only a 6B parameter scale, it achieves efficient image generation and editing, with visual quality close to three times the parameter level of commercial models. Its lightweight architecture and high performance make it suitable for consumer-grade devices, and it excels in complex instruction understanding and bilingual rendering.

image.png

【AiBase Summary:】

🔥 Z-Image uses a single-stream DiT architecture, including Turbo, Base, and Edit variants, to meet different needs.

💡 Supports bilingual text rendering, solving the pain points of traditional AI models in text processing.

🚀 Memory usage as low as 16GB, can run smoothly on consumer-grade GPUs, improving image generation efficiency.

Details: https://tongyi-mai.github.io/Z-Image-homepage/

2. Quark AI Glasses Launch: Equipped with Dual Flagship Chips and Integrated with Alibaba Qwen

The launch of Quark AI Glasses marks the first time that Alibaba Qwen enters the physical world. Through hardware upgrades and innovative technology, it provides users with a more efficient and convenient AI experience.

image.png

【AiBase Summary:】

📱 Equipped with dual flagship chips, improving the response speed and performance of Qwen.

📷 Introduces mobile-level imaging capabilities, enhancing photo quality and stability in low-light environments.

🔋 Uses a dual battery interchangeable design, ensuring long-term online standby.

3. Opera Neon Browser Major Upgrade: 1-Minute Research + Gemini3 One-Click Switch + Google Docs Instant Writing

Opera Neon browser released a major update, adding a '1-Minute Deep Research' mode, integrating Gemini3Pro and Nano Banana Pro dual models, and supporting natural language creation and editing of Google Docs for the first time. This feature improves user efficiency between quick queries and comprehensive research, while providing an automated solution for document writing.

image.png

【AiBase Summary:】

✨ Added '1-Minute Deep Research' mode, improving efficiency in handling complex problems.

🔄 Supports switching between Gemini3Pro and Nano Banana Pro models, flexibly responding to multi-stage tasks.

📝 Integrates Google Docs intelligent agent, enabling natural language operation of documents and improving writing efficiency.

4. Tsinghua University Releases AI Application Guidelines: Prohibits Using AI-Generated Content as Academic Work

Tsinghua University officially released the "Tsinghua University Artificial Intelligence Education Application Guidelines," aiming to standardize the use of artificial intelligence on campus. The guidelines systematically propose global and tiered guidance norms for AI application, covering core scenarios in teaching and academic research.

image.png

【AiBase Summary:】

🧠 Tsinghua University releases AI education application guidelines to regulate AI use on campus.

📚 The guidelines emphasize strictly prohibiting the use of AI-generated content as academic work to ensure academic integrity.

🔍 The university encourages teachers and students to explore AI-assisted learning, but they must follow clear usage guidelines.

5. DeepMind Releases "Gemini 3 Pro System Instructions": Agent Task Success Rate Increased by 5%, Multi-step Workflow Reliability Engineering

DeepMind publicly released exclusive System Instructions for Gemini 3 Pro, significantly improving the performance of large models in multiple benchmark tests. The instructions emphasize logical reasoning, risk assessment, and persistence, marking a shift from 'black-box tuning' to 'engineering instructions' for large models.

image.png

【AiBase Summary:】

📌 The System Instructions for Gemini 3 Pro improved the success rate of Agent tasks by approximately 5%.

🔍 The instructions emphasize logical dependencies, risk assessment, and hypothesis exploration, enhancing model reliability.

🚀 DeepMind plans to package the instructions into configurable JSON Schema and open them to platforms like Vertex AI in Q1 2026.

6. Adobe Releases Project Graph: AI Tool to Reshape Creative Workflows

Adobe's Project Graph is a node-based visual editor designed to help artists and designers customize their creative workflows more efficiently. It connects AI models, tools, and effectors, improving the controllability and precision of creation and supports packaging complex workflows into shareable tools, thus enhancing team collaboration efficiency.

image.png

【AiBase Summary:】

🎨 Adobe launches Project Graph, aiming to reshape creative workflows in the AI era.

🛠️ The system uses a node editor, allowing users to customize creative workflows like building blocks.

📦 Users can package creative workflows into shareable tools, facilitating team collaboration and application.

Details: https://www.adobe.com/express/create/chart/bar

7. Trae SOLO China Edition Launched: Plan Mode + Sub Agent, Write Code First with a Battle Map, Long Conversations Are No Longer Confusing!

Trae SOLO China Edition launched five new capabilities, including Plan Mode, multi-task parallelism, Sub Agent, DiffView, and context compression, aiming to improve development efficiency and make AI programming smarter.

image.png

【AiBase Summary:】

🎯 Plan Mode: Describe requirements in natural language, and AI automatically breaks down steps and generates a list of file modifications.

ParallelGroup: Supports running multiple Tabs and Chats simultaneously without interference.

🔍 DiffView: Aggregates all code changes, highlights them, and supports one-click rollback.

8. Giant Network Releases Three Muli-Modal Models: Eliminate Video Distortion, Achieve "Real Songs Available" for Voice Conversion

Giant Network's AI Lab, in collaboration with Tsinghua University SATLab and Northwestern Polytechnical University, has released three audio-video multimodal generation technology achievements, including the music-driven video generation model YingVideo-MV, zero-shot voice conversion model YingMusic-SVC, and voice synthesis model YingMusic-Singer, showcasing the team's latest progress in audio-video multimodal generation and planning to open-source these technologies.

image.png

【AiBase Summary:】

🎥 The music-driven video generation model YingVideo-MV can generate high-quality music video clips from a piece of music and a person's image.

🎤 The zero-shot voice conversion model YingMusic-SVC achieves "real songs available" voice conversion capability, effectively suppressing interference and reducing the risk of broken notes.

🎵 The voice synthesis model YingMusic-Singer supports inputting any lyrics to generate natural singing, with zero-shot voice cloning functionality, enhancing creative flexibility.