Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

filtered-dpo

Public

Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lower-quality samples compared to those generated by the learning model

Creat2024-04-15T14:03:47
Update2025-01-22T12:57:52
https://arxiv.org/abs/2404.13846
16
Stars
0
Stars Increase

Related projects