deepseek-ai
/

DeepSeek-R1

Text Generation

Model card Files Files and versions Community

egegvner commited on 12 days ago

Commit

102dfbf

·

verified ·

1 Parent(s): 8a58a13

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -49,7 +49,7 @@ library_name: transformers
 ## 1. Introduction
-We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
 DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning.
 With RL, DeepSeek-R1-Zero naturally emerged with numerous powerful and interesting reasoning behaviors.
 However, DeepSeek-R1-Zero encounters challenges such as endless repetition, poor readability, and language mixing. To address these issues and further enhance reasoning performance,

 ## 1. Introduction
+We introduce ou first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1.
 DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning.
 With RL, DeepSeek-R1-Zero naturally emerged with numerous powerful and interesting reasoning behaviors.
 However, DeepSeek-R1-Zero encounters challenges such as endless repetition, poor readability, and language mixing. To address these issues and further enhance reasoning performance,