llm-summarization
PublicLoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset
document-summarizationfine-tuningllama3llmreinforcement-learningretrieval-augmented-generationsupervised-learningtldr
Creat:2024-03-19T23:27:02
Update:2025-03-12T04:28:41
12
Stars
0
Stars Increase