Minimal-GRPO

Public

Implementation of Group Relative Policy Optimization (GRPO) and Evolutionary Strategy (ES) to fine-tune Open Language Models (like LlaMa-3.2, Qwen2.5) for Tasks with verifiable rewards.

evolution-strategies finetuning-llms grpo huggingface llm pytorch

Creat：2025-03-06T01:55:06

Update：2025-11-03T13:05:51

Stars

Stars Increase

Related projects

Generative Ai For Beginners

Hot

21 Lessons, Get Started Building with Generative AI ? https://microsoft.github.io/generative-ai-for-beginners/

102828

1年前

+172today

Awesome Llm Apps

Hot

llms

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

82076

1年前

+398today

Unsloth

Hot

deepseek

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! ?

49117

1年前

+115today

Made With ML

data-engineering

Learn how to design, develop, deploy and iterate on production-grade ML applications.

44677

1年前

+43today

WeChatMsg

chatgpt

提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手

40201

1年前

+12today

Mindsdb

agi

AI's query engine - Platform for building AI that can learn and answer questions over large scale federated data.

37471

1年前

+25today

Tabby

Self-hosted AI coding assistant

32551

1年前

+18today

Chroma

Hot

document-retrieval

the AI-native open-source embedding database

24816

1年前

+81today

Kotaemon

chatbot

An open-source RAG-based tool for chatting with your documents.

24731

1年前

+27today

Gpt Researcher

agent

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

24437

1年前

+44today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator