Awesome-LLM-Eval
PublicAwesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
All-in-One GEO Brand Insights Platform
Quickly check how your brand is perceived and presented in AI-powered search results.
Detect brand's visibility on AI platforms
Quickly evaluate the citation of promotion articles on AI platforms
Discover Popular AI-MCP Services - Find Your Perfect Match Instantly
Easy MCP Client Integration - Access Powerful AI Capabilities
Master MCP Usage - From Beginner to Expert
Top MCP Service Performance Rankings - Find Your Best Choice
Publish & Promote Your MCP Services
Multi-Dimensional Large Model Comparison - Find Your Perfect Match
Calculate AI Model Costs Accurately - Optimize Your Budget
Multi-Model Real-Time Evaluation & Quick Output Comparison
Free PC Hardware Test for DeepSeek & Llama
Enter Your Large Model Computing Requirements for Instant GPU, Memory & Server Configuration Recommendations
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.