AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation MCP

Challenge GPT-4V! Tsinghua's Tang Jie & Zhipu's Open Source Multimodal 14-Sided Warrior CogVLM-17B, Play Online

量子位公众号

Published inAI News · 1 min read · Oct 10, 2023

The CogVLM-17B multi-modal model, developed in collaboration between Tsinghua University and DeepSeek AI, has achieved state-of-the-art performance on multiple benchmarks. This model has realized deep integration, enhancing the performance of visual-language models and supporting functions such as object detection and text recognition. The article also mentions the competition and development of other multi-modal models, signaling intense competition in the multi-modal AI field. CogVLM-17B is poised to challenge the leadership position of GPT-4V.

GPT-4V CogVLM-17B Multimodal Models

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Google Launches LMEval: A New Tool for Uniformly Evaluating Large Language and Multimodal Models

May 27, 2025

550

Aliyun Modao Launches Two Latest Open Source Multimodal Models - Jump Star

Feb 21, 2025

2.0k

New Breakthrough in Multimodal Models: Fei-Fei Li's Team Unifies Actions and Language, Not Only Understanding Commands but also Reading Implicit Emotions

Dec 18, 2024

3.4k

Microsoft Launches New Model OmniParser: Understanding Screenshot Content Instantly with GPT-4V

Oct 25, 2024

10.0k

AI Daily: New version of GPT-4o launched; Wall-facing AI open-source mobile version 'GPT-4V'; Huawei unveils new 3D digital human framework EmoTalk3D; Alibaba launches Olympic Moments poster workflow

Welcome to the 【AI Daily】 segment! Here is your daily guide to exploring the world of artificial intelligence. Every day, we present hot topics in the AI field, focusing on developers, helping you gain insights into technological trends and understand innovative AI product applications. Discover fresh AI products here: https://top.aibase.com/ 1. Developers rejoice! There are issues with AI capabilities but they can be resolved. More work needs to be done across the entire development stack, and attention should be paid to 'jagged edges' while maintaining human involvement.

Aug 7, 2024

1.1k

Wall-Facing Intelligent Open Source MiniCPM-V 2.6 Edge AI Multimodal Capabilities Comparable to GPT-4V

"MiniCPM-V2.6" is an edge-side multimodal artificial intelligence model that, with only 8B parameters, has achieved SOTA (State of the Art) results in single image, multiple images, and video understanding, all under 20B parameters, significantly enhancing edge AI's multimodal capabilities and being fully comparable to GPT-4V.

Aug 7, 2024

5.5k

AI Daily: Google Launches Experimental Version of Gemini 1.5 Pro 0801; Open Source Image Generation Model FLUX1 Emerges; Ultra-Fast 3D Image Generation Model Stable Fast 3D Released; Alibaba's Voice Synthesis Model CosyVoice Updated

Welcome to the 【AI Daily】 column! Here is your daily guide to exploring the world of artificial intelligence. Every day, we present the hottest content in the AI field, focusing on developers and helping you gain insights into technology trends and innovative AI product applications. For fresh AI products, click here to learn more: https://top.aibase.com/1. Google has launched the powerful multimodal model experimental version Gemini 1.5 Pro, ranking ahead of GPT-4o and Claude-3.5 Sonnet.

Aug 2, 2024

920

Shusheng · Puyu Lingbi Multimodal Model Upgrade Version 2.5 Supports Longer Contexts and Image-Video Understanding Comparable to GPT-4V

Shusheng · Puyu Lingbi (InternLM-XComposer) Version 2.5 was developed by the Shanghai Artificial Intelligence Laboratory, focusing on long context input and output capabilities, operating smoothly within a length of 96K, and trained with 24K interleaved image-text data. Key upgrades include: high-resolution image understanding, fine-grained video understanding, and multi-turn multi-image dialogue. In application, it can create web pages and write high-quality text-image articles. Evaluations show it surpasses state-of-the-art open-source models across 16 benchmark tests and performs at par with key tasks compared to GPT-4V and Gem.

Jul 31, 2024

2.2k

Cyber Enthusiast Connects GPT-4V to Home Camera, Attracts Millions of Viewers! In a remarkable display of technological integration, a tech-savvy individual has seamlessly connected the advanced GPT-4V AI system to their home surveillance camera, captivating an audience of millions who tuned in to witness this innovative endeavor. The fusion of AI capabilities with real-time monitoring technology has sparked widespread curiosity and interest in the potential applications of such cutting-edge technology in everyday life.

Lately, a user named Home Assistant on the Internet has come up with a new twist on GPT-4V. He boldly connected GPT-4Vision directly to his home camera, allowing the AI to observe him around the clock without any blind spots, and the resulting video has been viewed over a million times on X! Imagine walking around your home and suddenly realizing you're being watched 24/7 by an AI that covers every angle. It might feel a bit like a "lab rat," but this foreign netizen takes it in stride. He think

Jul 3, 2024

1.2k

LeCun Launches New Visual Multimodal Model Cambrian-1, Visual Capabilities Outperform GPT-4V

In the world of AI, we have just welcomed a remarkable new member—Cambrian-1, a large multimodal language model (MLLM) developed by industry giants LeCun and Xie Saineng. The emergence of this model represents not only a technological leap but also a profound reflection on the research of multimodal learning.

Jun 27, 2024

2.9k