Travel-Agent-based-on-Qwen2-RLHF
PublicA travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using Prompt-Template + Tool-Use + Chroma embedding database + LangChain
Creat:2024-04-19T15:07:03
Update:2025-03-26T14:47:57
22
Stars
0
Stars Increase