stojchet commited on
Commit
45af223
1 Parent(s): 59bced1

End of training

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -17,13 +17,13 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.1596
21
- - Eval/rewards/chosen: 2.1581
22
- - Eval/logps/chosen: -121.0155
23
- - Eval/rewards/rejected: -14.2694
24
- - Eval/logps/rejected: -323.7450
25
- - Eval/rewards/margins: 16.4275
26
- - Eval/kl: 3.1515
27
 
28
  ## Model description
29
 
@@ -59,7 +59,7 @@ The following hyperparameters were used during training:
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | |
61
  |:-------------:|:------:|:----:|:---------------:|:------:|
62
- | 0.0966 | 1.7058 | 100 | 0.1596 | 3.1515 |
63
 
64
 
65
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.1316
21
+ - Eval/rewards/chosen: 2.3534
22
+ - Eval/logps/chosen: -119.0631
23
+ - Eval/rewards/rejected: -14.8466
24
+ - Eval/logps/rejected: -329.5165
25
+ - Eval/rewards/margins: 17.1999
26
+ - Eval/kl: 2.9675
27
 
28
  ## Model description
29
 
 
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | |
61
  |:-------------:|:------:|:----:|:---------------:|:------:|
62
+ | 0.0963 | 1.7058 | 100 | 0.1316 | 2.9675 |
63
 
64
 
65
  ### Framework versions