AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
EN

AI News

View More

Tongyi Qianwen Joins ModelScope Community to Open Source P-MMEval Testing Set: Evaluating Multilingual Capabilities of Models

Alibaba DAMO Academy, in collaboration with the ModelScope community, recently announced the open sourcing of a new multilingual benchmark testing set, P-MMEval, aimed at comprehensively evaluating the multilingual capabilities of Large Language Models (LLMs) and conducting comparative analysis of cross-language transfer abilities. This testing set covers efficient datasets for both basic and specialized capabilities, ensuring consistency in multilingual coverage across all selected datasets, and provides parallel samples across multiple languages, supporting up to 10 languages from 8 different language families, including English, Chinese, and Arabic.

12.2k 3 days ago
Tongyi Qianwen Joins ModelScope Community to Open Source P-MMEval Testing Set: Evaluating Multilingual Capabilities of Models

AI Products

View More
P-MMEval

P-MMEval

A multilingual multi-task benchmark for evaluating large language models (LLMs).

Research tools
6k
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2025AIBase
Business CooperationSite Map