DeepSeek: DeepSeek R1 Zero (free)

deepseek/deepseek-r1-zero:free

About DeepSeek: DeepSeek R1 Zero (free)

DeepSeek-R1-Zero is a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step. It's 671B parameters in size, with 37B active in an inference pass.

It demonstrates remarkable performance on reasoning. With RL, DeepSeek-R1-Zero naturally emerged with numerous powerful and interesting reasoning behaviors.

DeepSeek-R1-Zero encounters challenges such as endless repetition, poor readability, and language mixing. See DeepSeek R1 for the SFT model.

Specifications

Context Length

163,840

Tokenizer

Other

Pricing

Prompt

0.000

Completion

0.000

Image

0

Request

0

Last updated: 4/11/2025