arabic-llm-evaluation-framework
PublicA toolkit for evaluating and scoring LLMs in educational conversational applications. Includes GUI scoring tool, performance analysis, inter-rater agreement, and visualization capabilities.
Discover Popular AI-MCP Services - Find Your Perfect Match Instantly
Easy MCP Client Integration - Access Powerful AI Capabilities
Master MCP Usage - From Beginner to Expert
Top MCP Service Performance Rankings - Find Your Best Choice
Publish & Promote Your MCP Services
A toolkit for evaluating and scoring LLMs in educational conversational applications. Includes GUI scoring tool, performance analysis, inter-rater agreement, and visualization capabilities.