AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
Datasets
EN

AI News

View More

Tsinghua University and Tencent Jointly Launch Fully Open Source Multi-Modal Architecture Oryx Supporting Ultra-Long Video Input

In today's rapidly advancing field of artificial intelligence, a multi-modal large language model named ORYX is quietly transforming our understanding of AI's ability to perceive the visual world. This AI system, developed collaboratively by researchers from Tsinghua University, Tencent, and Nanyang Technological University, is regarded as the 'Transformers' of visual processing. ORYX, short for Oryx Multi-Modal Large Language Models, is an AI model specifically designed for processing images, videos, and 3D scene time-space understanding.

12.6k 6 days ago
Tsinghua University and Tencent Jointly Launch Fully Open Source Multi-Modal Architecture Oryx Supporting Ultra-Long Video Input

Models

View More

Oryx 1.5 7B

THUdyh

O

Oryx-1.5-7B is a 7B-parameter model developed based on the Qwen2.5 language model, supporting a 32K token context window and specializing in efficiently processing visual inputs of arbitrary spatial dimensions and durations.

MultimodalSafetensorsSafetensorsMultiple Languages
THUdyh
133
7
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2025AIBase
Business CooperationSite Map