Minimal-GRPO
PublicImplementation of Group Relative Policy Optimization (GRPO) and Evolutionary Strategy (ES) to fine-tune Open Language Models (like LlaMa-3.2, Qwen2.5) for Tasks with verifiable rewards.
Discover Popular AI-MCP Services - Find Your Perfect Match Instantly
Easy MCP Client Integration - Access Powerful AI Capabilities
Master MCP Usage - From Beginner to Expert
Top MCP Service Performance Rankings - Find Your Best Choice
Publish & Promote Your MCP Services
Implementation of Group Relative Policy Optimization (GRPO) and Evolutionary Strategy (ES) to fine-tune Open Language Models (like LlaMa-3.2, Qwen2.5) for Tasks with verifiable rewards.