Comprehensive performance. DeepSeek-R1 uses reinforcement learning technology on a large scale in the post-training stage, which greatly improves the model's reasoning ability with very little labeled data. In tasks such as mathematics, code, and natural language reasoning, its performance is comparable to the official version of OpenAI o1.
Available for