AIbase
Product LibraryTool NavigationMCP

Vicuna-LoRA-RLHF-PyTorch

Public

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna

Creat2023-04-22T13:09:55
Update2025-03-03T17:00:35
217
Stars
1
Stars Increase