JeremiahZ autoevaluator HF staff commited on
Commit
c2d9067
1 Parent(s): 3fd6c7c

Add evaluation results on the mnli_matched config and validation split of glue (#1)

Browse files

- Add evaluation results on the mnli_matched config and validation split of glue (d7a2ae125a388343bf14b92bcfb9f4d69a90e7db)


Co-authored-by: Evaluation Bot <[email protected]>

Files changed (1) hide show
  1. README.md +53 -0
README.md CHANGED
@@ -22,6 +22,59 @@ model-index:
22
  - name: Accuracy
23
  type: accuracy
24
  value: 0.8500813669650122
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
22
  - name: Accuracy
23
  type: accuracy
24
  value: 0.8500813669650122
25
+ - task:
26
+ type: natural-language-inference
27
+ name: Natural Language Inference
28
+ dataset:
29
+ name: glue
30
+ type: glue
31
+ config: mnli_matched
32
+ split: validation
33
+ metrics:
34
+ - name: Accuracy
35
+ type: accuracy
36
+ value: 0.8467651553744269
37
+ verified: true
38
+ - name: Precision Macro
39
+ type: precision
40
+ value: 0.8460148987014974
41
+ verified: true
42
+ - name: Precision Micro
43
+ type: precision
44
+ value: 0.8467651553744269
45
+ verified: true
46
+ - name: Precision Weighted
47
+ type: precision
48
+ value: 0.8475656756385261
49
+ verified: true
50
+ - name: Recall Macro
51
+ type: recall
52
+ value: 0.8463172075485045
53
+ verified: true
54
+ - name: Recall Micro
55
+ type: recall
56
+ value: 0.8467651553744269
57
+ verified: true
58
+ - name: Recall Weighted
59
+ type: recall
60
+ value: 0.8467651553744269
61
+ verified: true
62
+ - name: F1 Macro
63
+ type: f1
64
+ value: 0.8459654597797398
65
+ verified: true
66
+ - name: F1 Micro
67
+ type: f1
68
+ value: 0.8467651553744269
69
+ verified: true
70
+ - name: F1 Weighted
71
+ type: f1
72
+ value: 0.8469586362613581
73
+ verified: true
74
+ - name: loss
75
+ type: loss
76
+ value: 0.42515239119529724
77
+ verified: true
78
  ---
79
 
80
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You