AIBase
Home
AI NEWS
AI Tools
GEO & AEO
MCP
AI Models
EN

AI News

View More

360 ZhiNao Team Successfully Replicates DeepSeek Reinforcement Learning Results, Releases Open-Source Model Light-R1-14B-DS

Recently, the 360 ZhiNao team announced the successful replication of DeepSeek's reinforcement learning results and the official release of the open-source reasoning model Light-R1-14B-DS. This model surpasses DeepSeek-R1-Distill-Llama-70B and DeepSeek-R1-Distill-Qwen-32B in performance, becoming the industry's first 14B parameter model to achieve reinforcement learning effects. It significantly improves mathematical reasoning capabilities, outperforming most 32B-parameter models.

10.2k 5 days ago
360 ZhiNao Team Successfully Replicates DeepSeek Reinforcement Learning Results, Releases Open-Source Model Light-R1-14B-DS

AI Products

View More
Light-R1-14B-DS

Light-R1-14B-DS

An open-source 14B-parameter mathematical model, trained using reinforcement learning, with excellent performance.

AI model
12.5k

Models

View More

Light R1 14B DS GGUF

qihoo360

L

Light-R1-14B-DS is a 14B-parameter quantized large language model supporting text generation tasks, designed for efficient inference in resource-constrained environments.

Natural Language ProcessingGgufGguf
qihoo360
2.8k
9

Light R1 14B DS

qihoo360

L

Light-R1-14B-DS is a 14B-parameter math SOTA model trained with reinforcement learning, excelling in AIME24/25 and GPQA benchmarks.

Natural Language ProcessingTransformersTransformers
qihoo360
2.9k
33
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAI MarketingLLM LeaderboardAI Ranking
© 2026AIBase
Business CooperationSite Map