Best Light-R1-14B-DS AI Tools & Models - Premium Light-R1-14B-DS News

AI News

360 ZhiNao Team Successfully Replicates DeepSeek Reinforcement Learning Results, Releases Open-Source Model Light-R1-14B-DS

Recently, the 360 ZhiNao team announced the successful replication of DeepSeek's reinforcement learning results and the official release of the open-source reasoning model Light-R1-14B-DS. This model surpasses DeepSeek-R1-Distill-Llama-70B and DeepSeek-R1-Distill-Qwen-32B in performance, becoming the industry's first 14B parameter model to achieve reinforcement learning effects. It significantly improves mathematical reasoning capabilities, outperforming most 32B-parameter models.

10.2k 5 days ago

360 ZhiNao Team Successfully Replicates DeepSeek Reinforcement Learning Results, Releases Open-Source Model Light-R1-14B-DS

AI Products

Light-R1-14B-DS

An open-source 14B-parameter mathematical model, trained using reinforcement learning, with excellent performance.

AI model

12.5k

Models

Light R1 14B DS GGUF

qihoo360

Light-R1-14B-DS is a 14B-parameter quantized large language model supporting text generation tasks, designed for efficient inference in resource-constrained environments.

Natural Language Processing Gguf

Gguf

qihoo360

2.8k

Light R1 14B DS

qihoo360

Light-R1-14B-DS is a 14B-parameter math SOTA model trained with reinforcement learning, excelling in AIME24/25 and GPQA benchmarks.

Natural Language Processing

Transformers

qihoo360

2.9k

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AI Marketing LLM Leaderboard AI Ranking

Business Cooperation Site Map