Kling AI Officially Launches O1 Video Large Model Today: Unified Multimodal Architecture, Supports Generating Videos with a Single Sentence

AIbase基地

Published inAI News · 3 min read · Dec 2, 2025

ComfyAI Company announced that its self-developed O1 video large model has been fully opened to the public starting at midnight today. The model adopts an MVL (Multimodal Vision Language) unified interaction architecture, integrating text, images, and videos in a single input box, and for the first time introduces a Chain-of-Thought reasoning pathway. The official called it "the world's first unified multimodal video large model."

Different from the conventional step-by-step process in the industry, the O1 model can complete tasks such as text-to-video, image-to-video, local editing, and shot extension in one go without users switching interfaces. A product director of ComfyAI stated that the model uses multi-viewpoint subject construction technology to lock onto the characteristics of people and objects, solving the "feature drift" issue during camera transitions, ensuring continuity in multi-subject scenes.

Currently, the O1 model is available for experience on ComfyApp and the official website, supporting free setting of 3–10 second durations, targeting short video creators, advertising teams, and individual users. The company revealed that it will later open API interfaces for third-party platforms to integrate. Industry analysts believe that the launch of O1 may further reduce the threshold for AI video production, but whether it can strike a balance between generation quality and cost efficiency remains to be tested by the market.

O1VideoLargeModel KelingAI MVL Chain-of-Thought

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Stand Up to AlphaFold3! ByteDance Releases Protenix-v1, Setting a New Benchmark for Open-Source Biomolecular Prediction

ByteDance open-sources the biomolecular structure prediction model Protenix-v1, fully replicating the core capabilities of AlphaFold3, supporting all-atom 3D structure prediction of proteins, nucleic acids, and small molecule ligands, breaking through technological barriers.

Feb 9, 2026

170

Sam Altman's Bold Bet on World Lab: The AI Vision Behind a $1 Billion Valuation

OpenAI CEO Sam Altman invests heavily in World Labs, founded by Stanford professor Fei-Fei Li, raising over $100M with a $1B valuation to develop AI with human-like perception.....

Feb 9, 2026

130

Qwen's Spring Festival Big Discount Day One Was a Hit: 1 Million Orders Placed in 3 Hours, Server Struggled Temporarily

Alibaba Qwen APP launched the "Spring Festival 3 Billion Discount" campaign. The first round of "Free Tea Discount" was launched, and orders exceeded 1 million within 3 hours. The simple and low-barrier rules attracted a large number of users to participate. On social media, the topic "First AI Tea" went viral. The popularity of the event also put pressure on the system.

Feb 6, 2026

160

Anthropic Releases Claude Opus 4.6: Focused on Programming and Office Work, Autonomy Reaches a New Level

On February 5, 2026, Anthropic released Claude Opus 4.6, just two months after the previous version, demonstrating rapid iteration. The core advancements focus on 'autonomy' and 'task persistence'. Key breakthroughs include the first introduction of a 1 million token context window at the Opus level, as well as enhanced autonomous consciousness, marking the transition of the model from a dialogue tool to an intelligent agent.

Feb 6, 2026

150

Anthropic Releases Claude Opus 4.6: First with a 1 Million Token Context Window, Focused on Automation and Programming

Anthropic launches its new flagship AI model, Claude Opus 4.6, with a rapid update cycle. The new version focuses on 'autonomy' and productivity, aiming to provide deep intelligent support for developers and enterprise offices. Technical highlights include the first introduction of a 1 million token ultra-large context window, significantly enhancing the model's ability to handle long texts.

Feb 6, 2026

170

AI Daily: Kolors 3.0 Released; Alibaba's Large Model Brand Officially Renamed Qwen; Mistral AI Releases Voxtral Transcribe 2 Voice Model

Welcome to the [AI Daily] column! Here is your guide to exploring the world of artificial intelligence every day. Each day, we present you with the latest content in the AI field, focusing on developers, helping you grasp technological trends and understand innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. 'Global First Subject Reference': Kolors AI 3.0 is officially released, opening the era of AI directing with 15-second long videos. The release of Kolors AI 3.0 marks a new era in AI video creation, through

Feb 5, 2026

370

Qingling AI 3.0 Launches: Further Lowering the Barrier for Multimodal Creation, Making Cinematic Storytelling Widely Available

Kling AI 3.0 released, enhancing video and image generation with a focus on narrative, control, and multimodal synergy. New 'Smart Storyboarding' feature improves creative experience.....

Feb 5, 2026

130

Valuation Surges Nearly Twice in 4 Months! AI Chip Star Cerebras Secures $1 Billion Series H Funding

Cerebras completes $1 billion Series H funding, with valuation soaring to $23 billion. The round is led by Tiger Global, with AMD as a strategic investor. Just four months after the previous valuation of $8.1 billion, the growth has been rapid.

Feb 5, 2026

110

Trillion-Parameter Peak: Shanghai AI Lab Opens Source the World's Largest Scientific Multimodal Model Intern-S1-Pro

Shanghai Artificial Intelligence Laboratory has released and open-sourced the trillion-parameter scientific multimodal large model ShuRen Intern-S1-Pro, based on the "Integration of General and Specialized" architecture SAGE. It sets a new record for parameter scale in the open-source community and achieves breakthroughs in multiple scientific capabilities, maintaining a leading position in international academic evaluations in the AI4S field.

Feb 5, 2026

190

The Truth Detector in Academia: The Birth of the OpenScholar Model, Citation Accuracy Comparable to Human Experts

OpenScholar, an open-source AI model from UW and AI2, tackles academic AI 'hallucinations' by improving citation accuracy and content quality, outperforming models like GPT-4o which have high error rates in citations.....

Feb 5, 2026

100

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Kling AI Officially Launches O1 Video Large Model Today: Unified Multimodal Architecture, Supports Generating Videos with a Single Sentence

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Stand Up to AlphaFold3! ByteDance Releases Protenix-v1, Setting a New Benchmark for Open-Source Biomolecular Prediction

Sam Altman's Bold Bet on World Lab: The AI Vision Behind a $1 Billion Valuation

Qwen's Spring Festival Big Discount Day One Was a Hit: 1 Million Orders Placed in 3 Hours, Server Struggled Temporarily

Anthropic Releases Claude Opus 4.6: Focused on Programming and Office Work, Autonomy Reaches a New Level

Anthropic Releases Claude Opus 4.6: First with a 1 Million Token Context Window, Focused on Automation and Programming

AI Daily: Kolors 3.0 Released; Alibaba's Large Model Brand Officially Renamed Qwen; Mistral AI Releases Voxtral Transcribe 2 Voice Model

Qingling AI 3.0 Launches: Further Lowering the Barrier for Multimodal Creation, Making Cinematic Storytelling Widely Available

Valuation Surges Nearly Twice in 4 Months! AI Chip Star Cerebras Secures $1 Billion Series H Funding

Trillion-Parameter Peak: Shanghai AI Lab Opens Source the World's Largest Scientific Multimodal Model Intern-S1-Pro

The Truth Detector in Academia: The Birth of the OpenScholar Model, Citation Accuracy Comparable to Human Experts

AI News Recommendations

Stand Up to AlphaFold3! ByteDance Releases Protenix-v1, Setting a New Benchmark for Open-Source Biomolecular Prediction

Sam Altman's Bold Bet on World Lab: The AI Vision Behind a $1 Billion Valuation

Qwen's Spring Festival Big Discount Day One Was a Hit: 1 Million Orders Placed in 3 Hours, Server Struggled Temporarily

Anthropic Releases Claude Opus 4.6: Focused on Programming and Office Work, Autonomy Reaches a New Level

Anthropic Releases Claude Opus 4.6: First with a 1 Million Token Context Window, Focused on Automation and Programming

AI Daily: Kolors 3.0 Released; Alibaba's Large Model Brand Officially Renamed Qwen; Mistral AI Releases Voxtral Transcribe 2 Voice Model

Qingling AI 3.0 Launches: Further Lowering the Barrier for Multimodal Creation, Making Cinematic Storytelling Widely Available

Valuation Surges Nearly Twice in 4 Months! AI Chip Star Cerebras Secures $1 Billion Series H Funding

Trillion-Parameter Peak: Shanghai AI Lab Opens Source the World's Largest Scientific Multimodal Model Intern-S1-Pro

The Truth Detector in Academia: The Birth of the OpenScholar Model, Citation Accuracy Comparable to Human Experts

GEO Services