Logic-RL-Lite

Public

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".

deepseek deepseek-r1 fine-tuning gpt-o1 llm post-training reasoning-language-models reasoning-models reinforcement-learning

Creat：2025-02-28T23:22:29

Update：2025-03-26T22:50:05

Stars

Stars Increase

Related projects

Lobe Chat

Hot

? Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.

68780

1年前

+205today

Vllm

Hot

amd

A high-throughput and memory-efficient inference and serving engine for LLMs

64909

2年前

+236today

LLaMA Factory

Hot

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

63700

1年前

+147today

Anything Llm

Hot

agent-framework-javascript

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.

51978

1年前

+113today

Unsloth

Hot

deepseek

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! ?

49117

1年前

+115today

Llama_index

Hot

agents

LlamaIndex is the leading framework for building LLM-powered agents over your data.

45704

1年前

+65today

JeecgBoot

activiti

?「AI 低代码平台」前后端分离 SpringBoot 2.x/3.x，SpringCloud，Ant Design&Vue3，Mybatis，Shiro！强大的代码生成器让前后端代码一键生成，无需写任何代码! 引领AI低代码开发模式 AI生成->OnlineCoding->代码生成->手工MERGE，帮助Java项目解决80%重复工作，让开发更关注业务，提高开发效率、节省成本，同时又不失灵活性

44617

1年前

+38today

Pake

Hot

chatgpt

?? Turn any webpage into a desktop app with Rust. ?? 利用 Rust 轻松构建轻量级多端桌面应用

43758

1年前

+73today

Chatgpt On Wechat

Hot

基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。

40008

1年前

+71today

Siyuan

Hot

anki

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

39629

1年前

+116today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator