DPO-ST

Public

[ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

chain-of-thought dpo math-word-problem

Creat：2024-06-04T23:37:20

Update：2025-02-28T09:50:26

https://arxiv.org/abs/2407.18248

Stars

Stars Increase

Related projects

LeetCode Go

acm-icpc

? Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解

33768

1年前

+5today

PDFMathTranslate

Hot

chinese

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/Docker/Zotero

30396

1年前

+64today

YAPI

ansi

A Collection of useful Methods in Java

27739

3年前

-1today

Etherpad Lite

collaboration

Etherpad: A modern really-real-time collaborative document editor.

17966

1年前

+10today

Awesome Multimodal Large Language Models

chain-of-thought

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

16892

1年前

+44today

Numpy Ml

attention

Machine learning, in numpy

16210

1年前

+1today

LaTeX OCR

dataset

pix2tex: Using a ViT to convert images of equations into LaTeX code.

16012

1年前

+12today

Chinese Word Vectors

chinese

100+ Chinese Word Vectors 上百种预训练中文词向量

12140

1年前

+5today

LLMSurvey

chain-of-thought

The official GitHub page for the survey paper "A Survey of Large Language Models".

11998

1年前

+8today

Univer

appscript

Univer is a full-stack framework for creating and editing spreadsheets, documents, and slides on both web and server.

11819

1年前

+20today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator