notus
PublicNotus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
Creat:2023-11-16T18:01:23
Update:2025-03-20T04:58:35
168
Stars
0
Stars Increase
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach