DeepSeek-R1 is an inference model trained through large-scale reinforcement learning. It performs excellently in mathematics, code, and reasoning tasks, and can demonstrate powerful reasoning abilities without supervised fine-tuning, including self-verification, reflection, and generating long thought chains, etc.
Natural Language Processing
TransformersEnglish