LLM-RLHF
PublicThis repository contains some of the most influential papers of on the RLHF technique of fine-tuning LLMs.
finetuning-large-language-modelslargelanguagemodelllm-trainingmachine-learningreinforcement-learning
Creat:2024-01-21T03:10:55
Update:2024-01-21T20:05:21
0
Stars
0
Stars Increase