PaLM-rlhf-pytorch
PublicImplementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
artificial-intelligenceattention-mechanismsdeep-learninghuman-feedbackreinforcement-learningtransformers
Creat:2022-12-10T01:53:46
Update:2025-03-25T19:38:43
7.9K
Stars
0
Stars Increase