LeCun's New Proposal: Rebuilding Language Models with CV Approaches, Performance Significantly Improved!

AIbase基地

Published inAI News · 4 min read · Sep 22, 2025

In the current field of artificial intelligence, Yann LeCun's proposed JEPA (Joint Embedding Prediction Architecture) is redefining the way large language models (LLMs) are trained. This Nobel Prize winner is not criticizing existing LLMs, but rather taking matters into his own hands to improve them. Traditional LLM training methods mainly rely on reconstruction and generation within the input space, such as predicting the next word, a method that has been proven to have limitations in the visual domain.

LeCun and his team believe that advanced techniques from the computer vision (CV) field can be used to enhance the performance of language models. The core idea of JEPA is to efficiently learn about the world by predicting missing features in an abstract representation space. The Meta AI team has successfully applied JEPA in image and video processing, and now they hope to expand this concept to the field of language models.

To fill this gap, researchers Hai Huang, Yann LeCun, and Randall Balestriero jointly proposed LLM-JEPA. This new model treats text and code as different perspectives of the same concept, and for the first time, successfully applied JEPA's self-supervised learning architecture to LLMs. By combining the advantages of JEPA in learning in the embedding space, LLM-JEPA not only retains the strong generative capabilities of LLMs, but also achieves dual benefits in performance and robustness.

Experiments have shown that LLM-JEPA performs well on multiple mainstream models (such as Llama3, OpenELM, Gemma2, etc.) and diverse datasets (such as GSM8K, Spider, etc.), significantly surpassing traditional LLM training objectives. In addition, it shows strong robustness in preventing overfitting, providing a new direction for the future development of language models.

Although the current research mainly focuses on the fine-tuning phase, preliminary pre-training results show great potential. The team plans to further explore the application of LLM-JEPA in the pre-training process in future work, expecting to inject new energy into the performance improvement of language models.

JEPA Large Language Model Yann LeCun MetaAI

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Xiaomi Launches Embodied Large Model MiMo-Embodied and Opens Source

Xiaomi officially released the embodied large model MiMo-Embodied and announced that the model will be fully open-sourced. This move marks an important step for Xiaomi in the field of general embodied intelligence research.

Nov 22, 2025

180

AI Daily: Tencent Yuanfang Launches Video Model HunyuanVideo 1.5; Google Nano Banana Pro Released; Quark AI Glasses Partner with AutoNavi to Strengthen Collaboration

Tencent Yuanbao introduces a new feature enabling video generation from a single sentence or image using the open-source HunyuanVideo1.5 model, streamlining video creation.....

Nov 21, 2025

160

China has become the largest provider of global open-source AI large models

At OpenAtom 2025, Ni Guangnan highlighted China's leading role in open-source AI models like Qwen, DeepSeek, and Kimi, which excel globally. He emphasized open-source tech's vital role in advancing AI innovation worldwide.....

Nov 21, 2025

150

Tencent Releases New Video Generation Model HunyuanVideo1.5 to Lower the Barriers to Video Creation

Tencent released HunyuanVideo1.5, an 8.3B-parameter DiT model generating 5-10s HD videos. Available on Yuanbao platform, it supports text-to-video and image+text generation for diverse video creation.....

Nov 21, 2025

160

OpenAI Releases New GPT-5 Model to Accelerate Mathematics and Scientific Research

OpenAI releases the new generation GPT-5 model, enhancing computational and language capabilities, with a focus on application in mathematics and scientific research. This technology has the potential to accelerate drug development and the discovery of new materials, helping scientific research efficiently solve complex problems, in line with the industry trend of AI driving technological advancement.

Nov 21, 2025

150

Wikipedia Publishes AI Writing Detection Guide: These Tactics Reveal the AI's Language Fingerprint

Wikipedia releases AI Writing Identification Guide, detailing LLM behavioral fingerprints and actionable methods for detecting AI-generated text, based on the 2023 AI Cleanup Initiative.....

Nov 21, 2025

190

Former Google Team Launches AI Photo Party App "Mixup": "Nano Banana+" Fill-in-the-Blank Recipe iOS Limited Invite Codes Now Available

Things Company launches AI photo editor Mixup, available first on iOS and requires an invitation code to experience. Based on Google's model, it introduces the innovative "Recipe" fill-in-the-blank prompt feature, allowing users to upload photos or doodles and instantly generate fun secondary creations, such as Renaissance self-portraits. It supports sharing recipes to a public community, making it easy for friends to reuse materials, effectively solving the problem of creative prompts.

Nov 21, 2025

130

MOSS-Speech Open Source: China's First Speech-to-Speech Large Model, Bypassing Text Intermediate

The MOSS team from Fudan University released MOSS-Speech, which realizes end-to-end speech dialogue for the first time. The model is now available and open-sourced on Hugging Face. It adopts a 'layer splitting' architecture, freezing the original text model and adding new layers for speech understanding, semantic alignment, and vocoder. It can complete speech Q&A, emotional imitation, and laughter generation in one step, without the traditional three-step process. Evaluation results show that the word error rate has been reduced to 4.1% in the ZeroSpeech2025 task, and the emotion recognition accuracy reached 91.2%.

Nov 20, 2025

210

AI Daily: Meta Opens Source Interactive 3D Model SAM 3D; Lenovo to Launch Personal Super Agent; Warner Music Reaches Copyright Settlement with Udio

Volcano Engine leads China in execution ability and ranks fifth globally in Gartner's AI Platform Magic Quadrant, entering the Challengers quadrant as No. 1 with Doubao model and Volcano Ark platform.....

Nov 20, 2025

180

Volc Engine Ranks First in China and Fifth Globally in Gartner's Report on On-Premise Capabilities

Gartner released its first Magic Quadrant for AI Development Platforms, with Volc Engine ranked as the top challenger. It has the fifth strongest on-premise capabilities globally and first in China. Its strengths lie in a complete loop of models, tools, computing power, and scenarios, enabling leading customers in industries such as consumer goods and finance to quickly build multimodal applications. By the first half of 2025, Volc Engine's market share in large model services on public clouds in China reached 49.2%, capturing nearly half of the Chinese market.

Nov 20, 2025

140

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

LeCun's New Proposal: Rebuilding Language Models with CV Approaches, Performance Significantly Improved!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Xiaomi Launches Embodied Large Model MiMo-Embodied and Opens Source

AI Daily: Tencent Yuanfang Launches Video Model HunyuanVideo 1.5; Google Nano Banana Pro Released; Quark AI Glasses Partner with AutoNavi to Strengthen Collaboration

China has become the largest provider of global open-source AI large models

Tencent Releases New Video Generation Model HunyuanVideo1.5 to Lower the Barriers to Video Creation

OpenAI Releases New GPT-5 Model to Accelerate Mathematics and Scientific Research

Wikipedia Publishes AI Writing Detection Guide: These Tactics Reveal the AI's Language Fingerprint

Former Google Team Launches AI Photo Party App "Mixup": "Nano Banana+" Fill-in-the-Blank Recipe iOS Limited Invite Codes Now Available

MOSS-Speech Open Source: China's First Speech-to-Speech Large Model, Bypassing Text Intermediate

AI Daily: Meta Opens Source Interactive 3D Model SAM 3D; Lenovo to Launch Personal Super Agent; Warner Music Reaches Copyright Settlement with Udio

Volc Engine Ranks First in China and Fifth Globally in Gartner's Report on On-Premise Capabilities

GEO Services