Tencent and RUC Gaoqiang Jointly Launch Open-Source Planning Evaluation Framework PlanningBench
Tencent Hunyuan team, along with Renmin University of China and other institutions, has open-sourced PlanningBench, a framework for evaluating and training large language models' planning abilities. It systematically abstracts tasks, constraints, and difficulty levels, covering over 30 planning task types, and supports data generation and validation to assess models' practical planning capabilities.....