AI Daily: DouBao Large Model 1.8, Seedance 1.5 Pro Released; Gemini 3 Flash Officially Launched; MiniMax Passes Hong Kong Stock Exchange Listing Hearing

站长之家

Published inAI News · 15 min read · Dec 18, 2025

199

Welcome to the "AI Daily" section! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers to help you understand technical trends and innovative AI product applications.

Fresh AI products Click to learn more:https://app.aibase.com/zh

1、Gemini 3 Flash Launches: Free, Fast, Intelligence Surpasses Pro, Google AI Fully Enters the "Zero Latency" Era

Google released its new lightweight model Gemini 3 Flash, which has a response speed three times that of its predecessor, nearly "zero latency," and surpassed the same-generation flagship Gemini 3 Pro in multiple high-difficulty benchmark tests, becoming the first "Flash model" in history to "overcome the elder brother."

AiBase Summary:
🧪 On the authoritative code repair list SWE-bench, Gemini 3 Flash scored 78%, slightly ahead of Gemini 3 Pro (76.2%).
🧠 In the doctor-level reasoning test GPQA Diamond, it achieved a high score of 90.4%.
⚡ In the extremely difficult comprehensive evaluation Humanity’s Last Exam, it achieved a score of 33.7%, significantly better than the previous flagship Gemini 2.5 Pro.

2、Volc Engine FORCE Conference Shows Off: Doubao Large Model 1.8 + Seedance 1.5 Pro Released, Daily Average 50 Trillion Tokens Top China's First

At the Volc Engine FORCE Conference, Doubao Large Model 1.8 and the video generation model Seedance 1.5 Pro were released, along with the "AI Cost-Saving Plan," aimed at lowering the cost barrier for enterprises using large models. Doubao Large Model 1.8 showed significant improvements in several key dimensions, while Seedance 1.5 Pro enhanced video generation quality and consistency. In addition, the daily average token usage of the Doubao Large Model has exceeded 50 trillion, firmly holding the top position in China and third globally, marking its transition from a technological product to large-scale industrial application.

AiBase Summary:
🧠 Doubao Large Model 1.8 achieved significant improvements in key dimensions such as reasoning, multilingual support, code generation, and tool invocation.
🎥 Seedance 1.5 Pro supports longer duration, higher frame rate controllable video content creation, providing industrial-level visual generation capabilities for short videos, advertisements, and games.
💰 The "AI Cost-Saving Plan" lowers the cost barrier for enterprises using large models through technologies such as model compression, inference optimization, and resource scheduling.

3、Apple Opens SHARP Model: Say Goodbye to Long Waiting, Turn 2D Photos into 3D Spaces in 1 Second

Apple recently open-sourced a new AI model called SHARP, which can transform an ordinary 2D photo into a 3D scene with real-world proportions, taking less than one second. The core technology of SHARP is the "3D Gaussian Splatting" technique, which mastered general spatial geometric rules through deep training. With just one quick scan, it can predict the positions of millions of "Gaussian balls" with lighting information. SHARP's image quality leads the industry's strongest models and supports realistic camera movement simulation. Currently, Apple has released the complete code and resources of SHARP on GitHub for global developers to download.

AiBase Summary:
⚡ Achieved a magnitude breakthrough in speed: SHARP model improved the 2D to 3D conversion speed by three orders of magnitude, achieving near real-time conversion experience in less than one second.
🌐 Leading 3D generation technology: Based on 3D Gaussian Splatting technology, the model predicts millions of 3D points with a single neural network forward pass, accurately restoring real-world proportions.
🔓 Comprehensive open-source ecosystem: Apple has open-sourced SHARP's code and resources on GitHub to accelerate innovation in spatial computing and 3D content fields for global developers.

4、Meta Releases SAM Audio: The World's First Multimodal Audio Model Supporting "Click to Separate Sounds", One-click Extraction of Guitar Sound, Voice or Dog Barks

Meta released SAM Audio, the world's first multimodal audio separation model that can extract target sounds such as guitar sounds, voice, or dog barks with a single click through text, visual, and time segment prompts. This technology replicates the way humans naturally perceive sound in AI systems for the first time, marking a revolutionary significance.

AiBase Summary:
🎧 Text Prompt: Extract corresponding sound sources through semantic descriptions.
👁️ Visual Prompt: Click on the sound-emitting object in the video to separate the audio.
⏱️ Time Segment Prompt: Mark time intervals to automatically process similar sounds.
More details: https://ai.meta.com/samaudio/ https://github.com/facebookresearch/sam-audio

5、MiniMax Passes Hong Kong Stock Exchange Listing Hearing, the First Domestic Large Model "Stock" May Be in Shanghai

MiniMax passed the Hong Kong Stock Exchange listing hearing, and is expected to become the first domestic large model company to list on the capital market, with its core assets being large language models and multimodal generation technology. This marks an increased recognition of the commercialization path of large models by the capital market and may open the door for subsequent AI company IPOs.

AiBase Summary:
🚀 MiniMax passed the Hong Kong Stock Exchange listing hearing, becoming the first domestic large model company to list on the stock market.
💼 Its core assets are large language models and multimodal generation technology, different from traditional computer vision companies.
📈 If successfully listed, it will validate the capital market's recognition of the commercialization path of large models and may open the door for subsequent AI company IPOs.

6、The Battle for the First Stock in Large Models: MiniMax and Zhipu AI Both Passed the Hong Kong Stock Exchange Hearing on the Same Day

China's AI large model sector has made a milestone progress, with MiniMax and Zhipu AI both passing the Hong Kong Stock Exchange hearing on the same day, planning to list on the Hong Kong stock exchange and compete for the title of "Global First Large Model Stock."

AiBase Summary:
🚀 MiniMax has passed the Hong Kong Stock Exchange hearing and plans to list on the stock exchange in January 2026.
💼 Zhipu AI also passed the hearing, sponsored by investment banks such as CICC.
💰 Both companies have received support from top-tier investment institutions, opening up a new capital track for AGI base models.

7、OpenAI Officially Announces: Developers Can Submit Applications to ChatGPT

OpenAI has opened up the ChatGPT application submission permission for global developers, marking that ChatGPT has advanced to an AI-native application platform. Developers can submit their works through the latest guide, and after approval, they will appear in the ChatGPT application directory, giving ChatGPT practical capabilities.

AiBase Summary:
🚀 Opening the ecological door: OpenAI opens application submission, allowing developers to integrate functions into ChatGPT for global users to discover.
🛒 Application directory launched: Users can search and browse selected AI applications through the tools menu or visit chatgpt.com/apps.
💰 Clear profitability prospects: Supports linking to external websites for trading physical goods and plans to explore digital commodity monetization models.

8、Qwen App Integrates with Amap: Alibaba AI Enters the Real World

The Qwen App integrates with Amap, marking its ability to understand and act in the physical world, capable of handling complex real-world scenario demands, and plans to further integrate into more core scenarios, building a powerful super entrance.

AiBase Summary:
🚀 Qwen App integrates with Amap, achieving a leap from answering questions to geographical space reasoning.
🧭 Qwen can generate visual decision cards, directly triggering navigation or ride-hailing services.
🛍️ Alibaba plans to make Qwen a super entrance that can call the real-world fulfillment network.

9、Microsoft Open Sources TRELLIS.2: Convert Images into High-Precision 3D Models in One Click

Microsoft open-sourced TRELLIS.2, an efficient image-to-3D model generation tool that can quickly generate high-quality 3D models and support multiple platforms. TRELLIS.2 performs well on NVIDIA H100 graphics cards, completing high-resolution model generation in an extremely short time. In addition, it provides PBR four-piece texture maps, making it very suitable for e-commerce scenarios.

AiBase Summary:
🌟 TRELLIS.2 is an image-to-3D model generation tool open-sourced by Microsoft, capable of quickly generating high-quality 3D models.
⏱️ This tool generates a 512³ resolution model in just 3 seconds on NVIDIA H100 graphics cards, with extremely high efficiency.
🛒 It comes with PBR four-piece texture maps, convenient for e-commerce users to quickly convert products into 3D displays.
More details: https://huggingface.co/microsoft/TRELLIS.2-4B

10、xAI Launches the Fastest Voice Agent API, Supporting Real-Time Chinese Search and Emotion Control

xAI's Grok voice agent API demonstrates excellent performance and highly competitive pricing in the real-time voice AI field. The model performed well in audio reasoning benchmark tests, with a response speed far exceeding competitors, and supports multi-language automatic detection, real-time web search, and emotion control functions, providing developers with powerful tools.

AiBase Summary:
🔥 Grok voice agent API is launched at $0.05 per minute, offering high cost-effectiveness.
🌐 Supports automatic detection and free switching of multiple languages including Chinese, meeting global user needs.
🧠 Deeply integrated with real-time web search and reasoning capabilities, ensuring responses keep up with the latest information.

AI Daily: WeChat Mini Program Officially Integrates Hy3 Preview; QQ Browser Launches Gaokao AI Skill; Moonlight Releases Kimi WebBridge

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present the latest content in the AI field, focusing on developers to help you understand technical trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. WeChat announced that the Mini Program Growth Plan has officially integrated the Hy3preview model, enhancing the AI capabilities to improve development.

WeChat Announces Official Launch of Mini Program Growth Plan Integrated with Hy3 Preview

The 'Growth Plan' WeChat mini-program completed a model upgrade on May 15, fully integrating Tencent's Hy3preview model. The new version offers enhanced logical reasoning and contextual understanding, aiming to improve developers' intelligent development and operational experience within the WeChat ecosystem. An official upgrade guide has been released to support implementation.....

EverMind Invests 3 Million to Cultivate ReUnite: Leveraging AI Large Model Long-Term Memory Technology to Assist Global Family Reunions

EverMind, under Shanda Group, incubates the AI public welfare product 'ReUnite,' leveraging long-term memory technology to help reunite missing families globally. Originating from an open-source community competition and developed by an AI engineer in the chemical industry, it builds a digital bridge for reunions by analyzing physical features and childhood memories.....

Tencent Q1 Financial Report: Hy3preview Query Volume Continues to Lead in OpenRouter, Agent Releases Intensify

Tencent Holdings released its Q1 report on May 13, highlighting accelerated AI progress. The Hunyuan large model was rebuilt within three months, with the Hy3preview version showing significant improvements in context understanding, agent, and coding capabilities. According to OpenRouter, Hy3preview maintained top daily token usage and weekly token calls after its free trial ended, ranking first in the weekly chart for three consecutive weeks fro....

iOS27 Will Launch a Separate App for Siri with a Chatbot-Like Interface

Before Apple's WWDC 2026, journalist Mark Gurman revealed that Siri will return as a standalone application in iOS27, codenamed "Rave," marking the first time in 15 years. The new version of Siri is upgraded into a 24/7 intelligent agent, featuring a chat interface similar to ChatGPT, supporting conversation history, file uploads, and content prioritization, and is deeply integrated with Dynamic Island, significantly enhancing the user experience.

Google Releases AI Notebook Platform Googlebook: Gemini Model Redesigns Pointer Interaction and System Infrastructure

Google launches Googlebook, an AI-driven notebook platform shifting personal computing from traditional OS to a model-centric logic. It deeply integrates Gemini at the system level, embedding AI into the mouse cursor to analyze screen content in real-time and proactively respond, revolutionizing human-app interaction.....

Anthropic Raises Funding at a Valuation of $90 Billion Targeting $3 Billion

The AI company Anthropic is planning to raise $3 billion in funding, with a pre-money valuation of $90 billion, reflecting strong market confidence in its future prospects. Negotiations are progressing quickly, and the deal is expected to be completed as early as月底, but specific terms of the agreement are still under discussion and have not been finalized yet.

Google Launches Gemini Intelligence, Android Enters the AI Era!

Google unveiled the new features of Android 17 and the AI technology named Gemini Intelligence at its launch event on May 13th, marking the upgrade of Android from an operating system to a smart system. This AI will be rolled out in batches this summer, initially supporting the Samsung Galaxy S26 and Google Pixel 10 phones, with further expansion to devices such as smartwatches, car systems, smart glasses, and laptops.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

AI Daily: DouBao Large Model 1.8, Seedance 1.5 Pro Released; Gemini 3 Flash Officially Launched; MiniMax Passes Hong Kong Stock Exchange Listing Hearing

站长之家

This article is from AIbase Daily

AI News Recommendations

AI Daily: WeChat Mini Program Officially Integrates Hy3 Preview; QQ Browser Launches Gaokao AI Skill; Moonlight Releases Kimi WebBridge

WeChat Announces Official Launch of Mini Program Growth Plan Integrated with Hy3 Preview

ChatGPT's Traffic Share Slumps Dramatically as Google Gemini Catches Up Rapidly

EverMind Invests 3 Million to Cultivate ReUnite: Leveraging AI Large Model Long-Term Memory Technology to Assist Global Family Reunions

Tencent Q1 Financial Report: Hy3preview Query Volume Continues to Lead in OpenRouter, Agent Releases Intensify

iOS27 Will Launch a Separate App for Siri with a Chatbot-Like Interface

Google Releases AI Notebook Platform Googlebook: Gemini Model Redesigns Pointer Interaction and System Infrastructure

Google Android 17 Officially Released Gemini AI Strongly Enters Laptops

Anthropic Raises Funding at a Valuation of $90 Billion Targeting $3 Billion

Google Launches Gemini Intelligence, Android Enters the AI Era!

AI News Recommendations

AI Daily: WeChat Mini Program Officially Integrates Hy3 Preview; QQ Browser Launches Gaokao AI Skill; Moonlight Releases Kimi WebBridge

WeChat Announces Official Launch of Mini Program Growth Plan Integrated with Hy3 Preview

ChatGPT's Traffic Share Slumps Dramatically as Google Gemini Catches Up Rapidly

EverMind Invests 3 Million to Cultivate ReUnite: Leveraging AI Large Model Long-Term Memory Technology to Assist Global Family Reunions

Tencent Q1 Financial Report: Hy3preview Query Volume Continues to Lead in OpenRouter, Agent Releases Intensify

iOS27 Will Launch a Separate App for Siri with a Chatbot-Like Interface

Google Releases AI Notebook Platform Googlebook: Gemini Model Redesigns Pointer Interaction and System Infrastructure

Google Android 17 Officially Released Gemini AI Strongly Enters Laptops

Anthropic Raises Funding at a Valuation of $90 Billion Targeting $3 Billion

Google Launches Gemini Intelligence, Android Enters the AI Era!