发现与 Gigpo 相关的最受欢迎的开源项目和工具,了解最新的开发趋势和创新。
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"