RewardModelingBeyondBradleyTerry

Public

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

inverse-reinforcement-learning large-language-models largelanguagemodels llm-aligment llmalignment reward reward-modeling reward-models rlhf

Creat：2024-09-19T05:11:40

Update：2025-03-25T21:26:07

https://sites.google.com/view/rewardmodels

Stars

Stars Increase

Related projects

Tensorflow

deep-learning

An Open Source Machine Learning Framework for Everyone

192719

2年前

+38today

Stable Diffusion Webui

Hot

Stable Diffusion web UI

158833

2年前

+73today

Transformers

Hot

bert

? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

153615

3年前

+136today

30 Seconds Of Code

astro

Coding articles to level up your development skills

125990

1年前

+35today

Rust

Hot

compiler

Empowering everyone to build reliable and efficient software.

108357

1年前

+91today

TypeScript

Hot

javascript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

107029

1年前

+53today

Generative Ai For Beginners

Hot

21 Lessons, Get Started Building with Generative AI ? https://microsoft.github.io/generative-ai-for-beginners/

102828

1年前

+172today

Pytorch

Hot

autograd

Tensors and Dynamic neural networks in Python with strong GPU acceleration

95701

1年前

+88today

Django

apps

The Web framework for perfectionists with deadlines.

86093

1年前

+42today

Opencv

c-plus-plus

Open Source Computer Vision Library

85202

8年前

+42today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator