Lightricks Open-Sources AI Video Model LTX-2 for High-Speed Audio-Visual Integration of Up to 20 Seconds

AIbase基地

Published inAI News · 4 min read · Jan 12, 2026

Israeli tech company Lightricks recently announced the release of its latest audiovisual synthesis system, LTX-2. The system features extremely high computational efficiency, capable of directly generating high-definition video content lasting 20 seconds with perfectly synchronized audio and video based on brief text descriptions.

Different from traditional visual synthesis methods, LTX-2 breaks through the bottleneck of "first image, then voice" processing order. The development team pointed out that traditional audio-visual decoupling processes cannot reproduce the natural distribution of real environments. Therefore, LTX-2 adopts a complex dual-stream parallel computing architecture, using 19 billion computational parameters to collaboratively process visual and acoustic environments. Among these, 1.4 billion parameters are allocated for video stream processing, while 5 billion are for audio stream processing, this asymmetric distribution precisely simulates the density differences between visual and auditory information in real life.

In practical performance testing, the system demonstrated astonishing synthesis speed. In mainstream enterprise-level graphics card environments, generating a 720p resolution audiovisual content takes only 1.22 seconds per step. Data shows that its operational efficiency can reach up to 18 times that of similar products. At the same time, the 20-second generation limit also surpasses similar tools from Google and other major laboratories.

To accurately understand complex language instructions, the system integrates a multilingual text parsing engine and introduces a "preprocessing buffer" mechanism, allowing the system sufficient space to parse logic before executing the final synthesis. Through a unique cross-association mechanism, the system can accurately match the moment of object collisions in the image with corresponding acoustic effects.

Despite its technological leadership, the development team also admitted that the system occasionally experiences voice attribution errors when handling rare dialects or multi-character dialogues. Long sequences over 20 seconds still face challenges with micro-shifts in the timeline.

Ziv Faberman, founder of Lightricks, stated that choosing to open-source the system code rather than keeping it as a closed service was based on considerations regarding "technological control." He believes that content creators should control the technology on their own hardware, rather than outsourcing decision-making power to a few interest groups. Currently, the complete code and training framework of the system have been released on an open platform and have been deeply optimized for the latest consumer-grade high-performance graphics cards.

Google Releases Its First Native Multimodal Embedding Model Gemini Embedding 2: Enabling Machines to Truly Understand the World

Google launches the native multimodal embedding model Gemini Embedding 2, which supports text, images, videos, audio, and documents, mapping them uniformly into a vector space to achieve deep cross-media understanding. Unlike generative models, it focuses on 'understanding,' converting data into vectors to help systems identify semantic relationships.

Kitchen Black Tech: Nosh One Robot Chef Released, Achieving Full Automation of Cooking for $1500

Nosh Robotics launches Nosh One, a $1499 robotic chef that automates cooking from ingredients to meal, requiring no human intervention. Users place pre-cut ingredients, and the robot prepares dinner independently. Now available for pre-order on Kickstarter, with first shipments planned for this summer.....

Google Gemini Embedding 2 Launches with Great Impact! The First Full-Multimodal Embedding Model Is Here

Google launches Gemini Embedding2, the first multimodal embedding model based on the Gemini architecture, now in preview on Gemini API and Vertex AI. It maps text, images, videos, audio, and documents into a unified embedding space for cross-modal retrieval and classification, supporting over 100 languages.....

AI Daily: Tencent's Xiaolongxia WorkBuddy Launched; AI Impact on Red Fruit Real Production, Base May Be Cancelled; ZhiPu Released One-Click Deployment Agent Tool AutoClaw

Welcome to the 【AI Daily】 column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technological trends and learn about innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1, 1 minute connection to Qiyue! 8, Tencent SkillHub Community officially launched: optimized for Chinese users and includes over 13,000 AI skills

Affected by Tesla's AI6 Chip Production Plan Changes, South Korean AI Rising Star DX-M2 Mass Production Delayed to Third Quarter of 2026

Tesla's production plan changes led to Samsung adjusting its 2nm production line schedule, forcing Korean AI chip firm DeepX to delay mass production of its next-gen NPU chip DX-M2 by six months, with testing expected only after Q3 2026. This highlights how large clients in the semiconductor foundry industry prioritize scheduling, impacting smaller enterprises.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Lightricks Open-Sources AI Video Model LTX-2 for High-Speed Audio-Visual Integration of Up to 20 Seconds

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Addressing AI Safety Issues: OpenAI Acquires AI Safety Startup Promptfoo

Fried Shrimp! Hong Kong Stock OpenClaw Concept Stocks Suddenly Plunged, MiniMax Dropped Nearly 9%

Google Releases Its First Native Multimodal Embedding Model Gemini Embedding 2: Enabling Machines to Truly Understand the World

True Emotional Freedom! Fish Audio Releases S2: Multi-Speaker, Word-Level Emotion Control, Fully Open Source

Kitchen Black Tech: Nosh One Robot Chef Released, Achieving Full Automation of Cooking for $1500

Google Gemini Embedding 2 Launches with Great Impact! The First Full-Multimodal Embedding Model Is Here

AI Daily: Tencent's Xiaolongxia WorkBuddy Launched; AI Impact on Red Fruit Real Production, Base May Be Cancelled; ZhiPu Released One-Click Deployment Agent Tool AutoClaw

Affected by Tesla's AI6 Chip Production Plan Changes, South Korean AI Rising Star DX-M2 Mass Production Delayed to Third Quarter of 2026

Thousands of Authors Jointly Publish a Blank Book: Literary Giants Like Kazuo Ishiguro Protest AI Infringement

Valuation of 14.6 Billion USD: AI Compute Newcomer Nscale Completes 2 Billion USD Series C Funding

AI News Recommendations

Addressing AI Safety Issues: OpenAI Acquires AI Safety Startup Promptfoo

Fried Shrimp! Hong Kong Stock OpenClaw Concept Stocks Suddenly Plunged, MiniMax Dropped Nearly 9%

Google Releases Its First Native Multimodal Embedding Model Gemini Embedding 2: Enabling Machines to Truly Understand the World

True Emotional Freedom! Fish Audio Releases S2: Multi-Speaker, Word-Level Emotion Control, Fully Open Source

Kitchen Black Tech: Nosh One Robot Chef Released, Achieving Full Automation of Cooking for $1500

Google Gemini Embedding 2 Launches with Great Impact! The First Full-Multimodal Embedding Model Is Here

AI Daily: Tencent's Xiaolongxia WorkBuddy Launched; AI Impact on Red Fruit Real Production, Base May Be Cancelled; ZhiPu Released One-Click Deployment Agent Tool AutoClaw

Affected by Tesla's AI6 Chip Production Plan Changes, South Korean AI Rising Star DX-M2 Mass Production Delayed to Third Quarter of 2026

Thousands of Authors Jointly Publish a Blank Book: Literary Giants Like Kazuo Ishiguro Protest AI Infringement

Valuation of 14.6 Billion USD: AI Compute Newcomer Nscale Completes 2 Billion USD Series C Funding

GEO Services