AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
EN

AI News

View More

Tsinghua Tang Jie & Zhipu AI's CogVLM-17B: A Domestic Multimodal Model Challenging GPT-4V

The CogVLM-17B, developed through a collaboration between Tsinghua University and Zhipu AI, is a domestic multimodal model with outstanding performance. CogVLM-17B can not only recognize objects in images but also distinguish between fully visible and partially visible objects. The model employs a unique deep fusion method, achieving deep alignment of image features and text features through four key components. CogVLM-17B outperforms Google's models in various fields and is aptly referred to as the '14-sided warrior', showcasing its multimodal capabilities.

9.8k 12-14
Tsinghua Tang Jie & Zhipu AI's CogVLM-17B: A Domestic Multimodal Model Challenging GPT-4V
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2025AIBase
Business CooperationSite Map