|
epoch = 3.0 |
|
eval_avg_sts = 0.852688164945469 |
|
eval_sickr_spearman = 0.8305016923227837 |
|
eval_stsb_spearman = 0.8748746375681541 |
|
------ test ------ |
|
+-------+-------+-------+-------+-------+--------------+-----------------+-------+ |
|
| STS12 | STS13 | STS14 | STS15 | STS16 | STSBenchmark | SICKRelatedness | Avg. | |
|
+-------+-------+-------+-------+-------+--------------+-----------------+-------+ |
|
| 77.30 | 86.62 | 81.69 | 86.02 | 84.17 | 85.79 | 81.62 | 83.32 | |
|
+-------+-------+-------+-------+-------+--------------+-----------------+-------+ |
|
+------+------+------+------+------+------+------+------+ |
|
| MR | CR | SUBJ | MPQA | SST2 | TREC | MRPC | Avg. | |
|
+------+------+------+------+------+------+------+------+ |
|
| 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |
|
+------+------+------+------+------+------+------+------+ |
|
------ test ------ |
|
+-------+-------+-------+-------+-------+--------------+-----------------+-------+ |
|
| STS12 | STS13 | STS14 | STS15 | STS16 | STSBenchmark | SICKRelatedness | Avg. | |
|
+-------+-------+-------+-------+-------+--------------+-----------------+-------+ |
|
| 77.30 | 86.62 | 81.69 | 86.02 | 84.17 | 85.79 | 81.62 | 83.32 | |
|
+-------+-------+-------+-------+-------+--------------+-----------------+-------+ |
|
+-------+-------+-------+-------+-------+-------+-------+-------+ |
|
| MR | CR | SUBJ | MPQA | SST2 | TREC | MRPC | Avg. | |
|
+-------+-------+-------+-------+-------+-------+-------+-------+ |
|
| 86.91 | 91.81 | 93.81 | 90.89 | 91.54 | 90.60 | 72.87 | 88.35 | |
|
+-------+-------+-------+-------+-------+-------+-------+-------+ |
|
|