AIbase
Product LibraryTool Navigation

Logic-RL-Lite

Public

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".

Creat2025-02-28T23:22:29
Update2025-03-26T22:50:05
48
Stars
0
Stars Increase

Related projects