Single Image to 3D! Apple Releases LiTo Large Model: Lighting and Textures Are Fully Achieved. Is This the Perfect Partner for Vision Pro?

AIbase基地

Published inAI News · 3 min read · Mar 17, 2026

Apple has just unveiled a groundbreaking technology in the AI field of 3D reconstruction, which has long been considered a "tough nut" to crack.

According to the latest report, Apple's AI research team has released a new model called LiTo (Surface Light Field Tokenization). Its core breakthrough lies in:reconstructing a complete 3D object from just a regular 2D image, with a level of detail that reaches physical realism.

For a long time, the biggest challenge in generating 3D models from a single image has been "lighting consistency." When you change the viewing angle, the surface reflections and highlights on the object often become distorted or unrealistic. However, the LiTo model has successfully solved this problem by introducing an innovative "latent space" representation. Instead of simply memorizing pixels, it now understands the fundamental laws of how light interacts with surfaces through mathematical vectors.

In simple terms, LiTo has strong "imagination" capabilities. Even with just a front-facing photo, it can accurately predict mirror highlights and Fresnel reflections on the back of the object under different lighting conditions. In official comparison tests, LiTo has significantly surpassed the industry-leading TRELLIS model in multi-view lighting accuracy.

To refine this "detail-oriented" AI, researchers trained it using thousands of 3D objects across 150 perspectives and three lighting conditions. This almost obsessive pursuit of lighting details clearly indicates Apple's efforts to lay the groundwork for a spatial computing ecosystem.

Imagine, in the future, you only need to take a photo with your iPhone, and LiTo will instantly convert it into a lifelike 3D model, seamlessly placed into the Vision Pro virtual space. This seamless transition from 2D content to 3D assets might be the key weapon for Apple to achieve a "late-mover advantage" in the AI race.

3DReconstruction AINeologism LiTo AppleInc.

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Visual Infrastructure in the Era of Physical AI: How Does Aitek Redefine Robot Perception?

Aitek has transformed from a traditional hardware supplier into a provider of visual infrastructure in the era of physical AI, focusing on solving the core bottleneck of "enabling robots to understand" the physical world. The company precisely positions itself in the new global technological competition field of physical AI through its full-stack technology matrix, aiming to become an industry intermediary and promote the practical application of large models in real spaces.

May 25, 2026

130

AI Daily: ByteDance Open-Sources Unified Multimodal Large Model Lance 3B; Zhipei Launches GLM-5.1 High-Speed Version; CapCut Collaborates with Gemini for Deep Integration

Welcome to the [AI Daily] segment! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers to help you understand technology trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. ByteDance Open-Sources Lance3B: 'One Brain' That Handles Image and Text Understanding and Generation Simultaneously ByteDance has open-sourced its native unified multimodal large model Lance, achieving full functionality with 3B parameters.

May 22, 2026

350

ByteDance Open Sources Lance 3B: A Single Model That Handles Both Vision and Language Understanding and Generation

ByteDance open-sources Lance, a native unified multimodal large model with only 3B activated parameters, breaking the technical barriers between understanding models (VLM) and generation models (DiT/Diffusion). It achieves full functionality with extreme lightweight design, challenging the current industry trend of stacking parameters or assembling models, marking an important breakthrough in technological innovation.

May 22, 2026

480

Tencent Meeting Launches AI Simultaneous Interpretation Function: Real-time Translation Delay Reduced to 3 Seconds

Tencent Meeting has officially launched the AI Simultaneous Interpretation function, offering real-time Chinese-English translation to all users for the first time, aiming to improve communication efficiency in cross-border meetings and remote collaboration. The function keeps the translation delay within 3 seconds, achieving near-synchronous speech and translation, effectively solving issues of delay and interruption in traditional simultaneous interpretation, helping participants communicate more smoothly and avoid information loss and misunderstandings.

May 21, 2026

260

Pentagon Establishes Working Group to Accelerate the Use of AI Tools in Sensitive Networks

The Pentagon's cyber operations unit is forming a specialized task force to accelerate the deployment of advanced AI tools in sensitive networks, addressing security risks from rapidly emerging private-sector AI models that can identify digital system vulnerabilities faster than top hackers. Two weeks ago, General Joshua Rader, leader of the NSA and Cyber Command, announced via internal email that the task force aims to enhance cybersecurity defe....

May 21, 2026

210

Software giant Intuit announces layoffs of over 3,000 people: Fully shifting towards AI, high executive salaries come under scrutiny

Enterprise software giant Intuit announced layoffs of about 17% (over 3,000 people), but not due to financial difficulties, but to fully restructure its framework and shift core resources toward the integration and development of artificial intelligence (AI) products. CEO Sasan Ghodoussi stated that this move aims to streamline the organization and focus on the AI strategy.

May 21, 2026

180

27B Mathematical SOTA and 3-Second Emotional Cloning: Youdao Fully Opens Source for Zi Yue 4 Multimodal and TTS Engine

NetEase Youdao releases 'Ziyue' large model 4.0, upgrading to full modality with text, image, and audio fusion. Open-sources core multimodal and TTS models. Reconstructed translation model boosts quality and efficiency. Multimodal model achieves SOTA in vision and math; excels in text-based math problem-solving.....

May 21, 2026

350

DeepMind CEO Criticizes AI Layoff Theory: Replacing Developers with AI is a Major Mistake

At Google I/O, DeepMind CEO Hassabis opposed the notion that AI would replace programmers, calling it a lack of imagination. He emphasized that AI should not be used as an excuse for layoffs, and technological progress should empower humans rather than replace jobs.

May 20, 2026

270

Soy MiniMax3 Monitor Speakers: Big Power in a Small Package

Audio brand Soy launched the MiniMax3 Monitor professional studio monitor speakers, priced at 2099 yuan per unit, specifically designed for personal video and audio studios and desktop near-field monitoring. Its core feature is a coaxial three-way four-driver configuration, including a 1-inch silk dome tweeter and 3-inch carbon fiber driver, providing high-quality sound in a compact design, attracting attention from audio enthusiasts.

May 20, 2026

190

Visual Large Models Encounter Setback: First Chinese Ancient Script OCR Evaluation Benchmark Open-Sourced

Tencent's Hunyuan large model, in collaboration with the Palace Museum and other institutions, launched 'Chronicles-OCR,' the industry's first ancient script perception benchmark covering the evolution trajectory of the 'seven script styles' of Chinese characters. The dataset, cross-annotated by experts with 2,800 images, tests AI's ability to recognize ancient scripts like oracle bone inscriptions, advancing AI's understanding of Chinese charact....

May 19, 2026

230

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Single Image to 3D! Apple Releases LiTo Large Model: Lighting and Textures Are Fully Achieved. Is This the Perfect Partner for Vision Pro?

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Visual Infrastructure in the Era of Physical AI: How Does Aitek Redefine Robot Perception?

AI Daily: ByteDance Open-Sources Unified Multimodal Large Model Lance 3B; Zhipei Launches GLM-5.1 High-Speed Version; CapCut Collaborates with Gemini for Deep Integration

ByteDance Open Sources Lance 3B: A Single Model That Handles Both Vision and Language Understanding and Generation

Tencent Meeting Launches AI Simultaneous Interpretation Function: Real-time Translation Delay Reduced to 3 Seconds

Pentagon Establishes Working Group to Accelerate the Use of AI Tools in Sensitive Networks

Software giant Intuit announces layoffs of over 3,000 people: Fully shifting towards AI, high executive salaries come under scrutiny

27B Mathematical SOTA and 3-Second Emotional Cloning: Youdao Fully Opens Source for Zi Yue 4 Multimodal and TTS Engine

DeepMind CEO Criticizes AI Layoff Theory: Replacing Developers with AI is a Major Mistake

Soy MiniMax3 Monitor Speakers: Big Power in a Small Package

Visual Large Models Encounter Setback: First Chinese Ancient Script OCR Evaluation Benchmark Open-Sourced

AI News Recommendations

Visual Infrastructure in the Era of Physical AI: How Does Aitek Redefine Robot Perception?

AI Daily: ByteDance Open-Sources Unified Multimodal Large Model Lance 3B; Zhipei Launches GLM-5.1 High-Speed Version; CapCut Collaborates with Gemini for Deep Integration

ByteDance Open Sources Lance 3B: A Single Model That Handles Both Vision and Language Understanding and Generation

Tencent Meeting Launches AI Simultaneous Interpretation Function: Real-time Translation Delay Reduced to 3 Seconds

Pentagon Establishes Working Group to Accelerate the Use of AI Tools in Sensitive Networks

Software giant Intuit announces layoffs of over 3,000 people: Fully shifting towards AI, high executive salaries come under scrutiny

27B Mathematical SOTA and 3-Second Emotional Cloning: Youdao Fully Opens Source for Zi Yue 4 Multimodal and TTS Engine

DeepMind CEO Criticizes AI Layoff Theory: Replacing Developers with AI is a Major Mistake

Soy MiniMax3 Monitor Speakers: Big Power in a Small Package

Visual Large Models Encounter Setback: First Chinese Ancient Script OCR Evaluation Benchmark Open-Sourced