AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
EN

AI News

View More

Redefining Multimodal AI! Zhiyuan Releases the Native Multimodal World Model Emu3

Beijing Zhiyuan Artificial Intelligence Research Institute announces the launch of the native multimodal world model Emu3. This model is based on next-token prediction technology and does not rely on diffusion models or combinatorial methods to achieve understanding and generation across text, image, and video modalities. Emu3 surpasses existing well-known open-source models such as SDXL, LLaVA, and OpenSora in tasks like image generation, video generation, and visual language understanding, showcasing exceptional performance.

12.3k 5 days ago
Redefining Multimodal AI! Zhiyuan Releases the Native Multimodal World Model Emu3

Models

View More

OPensora

Compumacy

O

Open-Sora is an open-source, efficient video generation project dedicated to making advanced video generation technology accessible to everyone.

MultimodalSafetensorsSafetensors
Compumacy
43
0
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2025AIBase
Business CooperationSite Map