weqweasdas commited on
Commit
6ac4b2e
1 Parent(s): 6c417bc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -87,7 +87,7 @@ We train the model for one epoch with a learning rate of 1e-5, batch size 256, c
87
 
88
  We collect the existing preference datasets and use them as a benchmark to evaluate the resulting reawrd model.
89
 
90
-
91
 
92
 
93
 
 
87
 
88
  We collect the existing preference datasets and use them as a benchmark to evaluate the resulting reawrd model.
89
 
90
+ Note that for MT-Bench dataset (lmsys/mt_bench_human_judgments), we delete the samples with tie as the comparison results. The Alpaca data is from [Here](https://huggingface.co/datasets/tatsu-lab/alpaca_eval/tree/main).
91
 
92
 
93