ChainForge-R1-SuperCoT

Public

A multi-stage pipeline that enhances Qwen2.5 language models with DeepSeek Reasoner's chain-of-thought capabilities. Implements the DeepSeek-R1 methodology through cold-start SFT, reasoning-oriented RL, rejection sampling, and optional model distillation.

ai cold-start-sft deepseek deepseek-r1 qwen r1 reasoning training

Creat：2025-01-25T03:13:53

Update：2025-02-24T17:02:19

Stars

Stars Increase

Related projects

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

178997

8个月前

+17today

Stable Diffusion Webui

Stable Diffusion web UI

157233

1年前

+15today

N8n

Hot

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

148421

4年前

+365today

Langchain

Hot

Elixir implementation of a LangChain style framework that lets Elixir projects integrate with and leverage LLMs.

117179

1年前

+90today

Dify

Hot

agent

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

116370

6个月前

+83today