Prompt-OIRL
Publiccode for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
inverse-reinforcement-learningirllarge-language-modelsllmoffline-irloffline-rlprompt-engineeringrlaifrlhf
Creat:2023-09-11T01:06:59
Update:2025-03-17T20:12:40
https://arxiv.org/pdf/2309.06553.pdf
41
Stars
0
Stars Increase