Luma AI releases Uni-1 image generation model with an autoregressive architecture that generates text and pixels simultaneously

AIbase基地

Published inAI News · 5 min read · Mar 24, 2026

127

Luma Labs released the image generation model Uni-1 on March 23, which is the company's first publicly available model based on the Unified Intelligence architecture. The official website has opened free trial access, and API pricing has been announced, with enterprise access channels gradually launching.

Architecture Change: From Diffusion Models to Autoregressive

Uni-1 abandoned the current mainstream diffusion model approach and instead uses a decoder-only autoregressive Transformer, arranging text tokens and image tokens in an alternating sequence into a single sequence, completing inference and pixel generation in a single forward pass.

Luma CEO Amit Jain explained that traditional solutions usually first use a language model for planning and then hand it over to a diffusion model for generation, leading to information loss between the two stages. The design goal of Uni-1 is to eliminate this gap.

Jain previously worked at Apple and participated in Vision Pro engineering work.

Function: Reference Image Control and Cross-Style Generation

Uni-1 supports generating images guided by one or more reference images, preserving identity, posture, and composition of the subject. Official tests show that in handling character consistency and portrait control, the multi-reference image mode performs stably.

The model claims support for 76 visual styles, covering categories such as realistic photography, comics, and ukiyo-e.

In a demonstration, inputting "Draw an infographic of the Golden Gate Bridge" led the model to automatically plan the layout, generate a bridge structure diagram, and annotate data such as "1711 Meters," with the internal reasoning process visible in real time.

Benchmarking: Leading in Spatial Reasoning and Reference Generation

Data published by Luma shows that Uni-1 scored 0.51 in the RISEBench reasoning benchmark, higher than Google Nano Banana 2's 0.50 and OpenAI GPT Image 1.5's 0.46; its spatial reasoning score was 0.58, and logical reasoning 0.32, about twice that of GPT Image.

ODinW-13 object detection score was 46.2 mAP, close to Google Gemini 3 Pro's 46.3.

In terms of human preference Elo ranking, Uni-1 ranked first in overall preference, style and editing, and reference generation, and second in text-to-image generation.

Pricing

API charges are based on tokens: $0.50 per million tokens for input text, $1.20 per million tokens for input images, $3.00 per million tokens for output text and thought chain, and $45.45 per million tokens for output images.

Converted to a single image: Text-to-image (2048px) costs approximately $0.0909, editing with a single reference image costs around $0.0933, and eight reference images cost about $0.1101.

VentureBeat reported that in enterprise scenarios with 2K resolution, Uni-1 costs 10% to 30% less than Google Nano Banana 2.

Background

Luma Labs previously focused on video generation products like Dream Machine (Ray3 series). On March 5, the company released the Luma Agents creative agent platform based on the Unified Intelligence architecture. Uni-1 is the first application of this architecture in a static image product.

Within hours of the release, related posts on the X platform received over 2.3 million views. Luma stated that subsequent video and audio versions will be launched, but the specific timing has not been disclosed.

Try address: lumalabs.ai/uni-1

Formal Release of the 'Self-Regulation Code for Personal Information Protection of Intelligent Agents' - 31 Companies Including Tencent and Baidu Sign It as the First Batch

The 'Self-Regulation Code for Personal Information Protection of Intelligent Agents' was released at the 2026 China Internet Conference, with 31 companies including Tencent, Baidu, and Meituan signing it as the first batch. The code fills the gap in specialized regulations for personal information protection of AI agents in China, aiming to leverage the leading role of top enterprises and promote industry self-discipline.

Others Ring the Bell, We Reset: Zhipu Reveals Its Stretching Plan, Betting on a Fully Automated Intelligent Ecosystem

Zhipu founder Tang Jie announced the "Gaokao" plan, a two-year strategic investment in four core engines: long-context tasks, autonomous agents, fully self-training, and ultimate safety governance, aiming for next-gen AGI. Meanwhile, GLM-5.2 was released as open source under MIT license, supporting million-token context and leading in long-range tasks.....

Samsung Returns to the PC Chip Market: Self-Developed AI-Specific Chip GAIA Has Been Sent for Testing to Lenovo and HP, Expected to Be Mass-Produced in 2027

Samsung is accelerating AI PC chip development with its self-developed accelerator, codenamed 'GAIA,' set for mass production in 2027. Led by the System LSI Division, it uses a 4nm process and NPU-centric design to speed up generative AI tasks. Prototypes have been sent to Lenovo and HP for performance testing.....

Meta Joins Forces with Broadcom and TSMC, Self-Developed AI Chip Iris to Begin Mass Production in September

Meta is accelerating its self-developed AI chip initiative, collaborating with Broadcom and TSMC to develop a processor codenamed "Iris," expected to enter mass production in September 2026. The chip is the core of the MTIA project, having passed initial testing, aiming to optimize platform recommendations and generative algorithms, reduce data center costs, and gradually reduce reliance on third-party GPUs.

27B Large Model Fits into iPhone! Apple Focuses on AI Compression Tech: Volume Reduced to 1/14, Speed Increased 8 Times

Tech media The Information reported that Apple is in talks with AI startup PrismML to evaluate the feasibility of running larger AI models directly on iPhones. PrismML's core breakthrough is its native 1-bit model compression technology, which can compress model size to about 1/14 and reduce memory usage by over 90%. This move could enable large-scale AI models to run on mobile devices, achieving a breakthrough in edge AI.

AI Startup Lyzr Completes $100 Million Series B Funding Using Its Own Self-Developed Agent

On July 9, three-year-old enterprise AI agent company Lyzr secured $100M in Series B funding at a ~$500M valuation. Its self-developed AI system SivaClaw independently led the entire negotiation and core process. Bloomberg called it a breakthrough for AI agents in complex commercial capital operations.....

Anthropic Expands Big in New York: Leases a 16-Story Office Building in Manhattan, Doubling Staff to 1,000

Anthropic has leased a 16-story office building in Manhattan, New York, and plans to expand its local staff to 1,000 people, accelerating its East Coast strategy to get closer to talent and clients in the financial and media centers. The New York office was already its largest office outside of its San Francisco headquarters.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Luma AI releases Uni-1 image generation model with an autoregressive architecture that generates text and pixels simultaneously

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Formal Release of the 'Self-Regulation Code for Personal Information Protection of Intelligent Agents' - 31 Companies Including Tencent and Baidu Sign It as the First Batch

Others Ring the Bell, We Reset: Zhipu Reveals Its Stretching Plan, Betting on a Fully Automated Intelligent Ecosystem

Samsung Returns to the PC Chip Market: Self-Developed AI-Specific Chip GAIA Has Been Sent for Testing to Lenovo and HP, Expected to Be Mass-Produced in 2027

Meta Joins Forces with Broadcom and TSMC, Self-Developed AI Chip Iris to Begin Mass Production in September

27B Large Model Fits into iPhone! Apple Focuses on AI Compression Tech: Volume Reduced to 1/14, Speed Increased 8 Times

AI Startup Lyzr Completes $100 Million Series B Funding Using Its Own Self-Developed Agent

Prime Intellect Completes $130 Million Funding, Valuation Reaches $1 Billion

Anthropic Expands Big in New York: Leases a 16-Story Office Building in Manhattan, Doubling Staff to 1,000

WeChat Adds a Big Move: 1 Billion Token Model Quota Free for AI Image Generation, Quota Multiplied by Ten

Mistral AI's Open-Source Mathematical Proof Tool: 119B Parameters Activate Only 6B, Problem-Solving Cost Is Just 1% of Competitors'

AI News Recommendations

Formal Release of the 'Self-Regulation Code for Personal Information Protection of Intelligent Agents' - 31 Companies Including Tencent and Baidu Sign It as the First Batch

Others Ring the Bell, We Reset: Zhipu Reveals Its Stretching Plan, Betting on a Fully Automated Intelligent Ecosystem

Samsung Returns to the PC Chip Market: Self-Developed AI-Specific Chip GAIA Has Been Sent for Testing to Lenovo and HP, Expected to Be Mass-Produced in 2027

Meta Joins Forces with Broadcom and TSMC, Self-Developed AI Chip Iris to Begin Mass Production in September

27B Large Model Fits into iPhone! Apple Focuses on AI Compression Tech: Volume Reduced to 1/14, Speed Increased 8 Times

AI Startup Lyzr Completes $100 Million Series B Funding Using Its Own Self-Developed Agent

Prime Intellect Completes $130 Million Funding, Valuation Reaches $1 Billion

Anthropic Expands Big in New York: Leases a 16-Story Office Building in Manhattan, Doubling Staff to 1,000

WeChat Adds a Big Move: 1 Billion Token Model Quota Free for AI Image Generation, Quota Multiplied by Ten

Mistral AI's Open-Source Mathematical Proof Tool: 119B Parameters Activate Only 6B, Problem-Solving Cost Is Just 1% of Competitors'