AIbase
プロダクトライブラリツールナビゲーションMCP

Alpaca-LoRA-RLHF-PyTorch

Public

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

作成時間2023-04-18T14:03:08
更新時間2024-12-24T11:41:43
58
Stars
0
Stars Increase

関連プロジェクト