HomeAI Tutorial

? OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Creat2024-10-15T13:53:45
Update2025-03-27T08:59:26
578
Stars
2
Stars Increase