AIbase
Product LibraryTool NavigationMCP

ChatGLM-LoRA-RLHF-PyTorch

Public

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

Creat2023-04-18T14:03:53
Update2025-02-09T17:14:49
136
Stars
0
Stars Increase