JeremiahZ autoevaluator HF staff commited on
Commit
749a735
1 Parent(s): 6687715

Add verifyToken field to verify evaluation results are produced by Hugging Face's automatic model evaluator (#2)

Browse files

- Add verifyToken field to verify evaluation results are produced by Hugging Face's automatic model evaluator (d69b7275aaf0b5eb9a6de54b06d3ecb52f0ace33)


Co-authored-by: Evaluation Bot <[email protected]>

Files changed (1) hide show
  1. README.md +23 -17
README.md CHANGED
@@ -13,19 +13,19 @@ model-index:
13
  - name: roberta-base-qqp
14
  results:
15
  - task:
16
- name: Text Classification
17
  type: text-classification
 
18
  dataset:
19
  name: GLUE QQP
20
  type: glue
21
  args: qqp
22
  metrics:
23
- - name: Accuracy
24
- type: accuracy
25
  value: 0.9152609448429384
26
- - name: F1
27
- type: f1
28
  value: 0.8867138416771377
 
29
  - task:
30
  type: natural-language-inference
31
  name: Natural Language Inference
@@ -35,30 +35,36 @@ model-index:
35
  config: qqp
36
  split: validation
37
  metrics:
38
- - name: Accuracy
39
- type: accuracy
40
  value: 0.9153104130596093
 
41
  verified: true
42
- - name: Precision
43
- type: precision
44
  value: 0.8732009117551286
 
45
  verified: true
46
- - name: Recall
47
- type: recall
48
  value: 0.9007725898555593
 
49
  verified: true
50
- - name: AUC
51
- type: auc
52
  value: 0.9685235648551861
 
53
  verified: true
54
- - name: F1
55
- type: f1
56
  value: 0.8867724867724867
 
57
  verified: true
58
- - name: loss
59
- type: loss
60
  value: 0.4435121417045593
 
61
  verified: true
 
62
  ---
63
 
64
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
13
  - name: roberta-base-qqp
14
  results:
15
  - task:
 
16
  type: text-classification
17
+ name: Text Classification
18
  dataset:
19
  name: GLUE QQP
20
  type: glue
21
  args: qqp
22
  metrics:
23
+ - type: accuracy
 
24
  value: 0.9152609448429384
25
+ name: Accuracy
26
+ - type: f1
27
  value: 0.8867138416771377
28
+ name: F1
29
  - task:
30
  type: natural-language-inference
31
  name: Natural Language Inference
 
35
  config: qqp
36
  split: validation
37
  metrics:
38
+ - type: accuracy
 
39
  value: 0.9153104130596093
40
+ name: Accuracy
41
  verified: true
42
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMTBmYmQ4MjhlZDBkOWM4YzNiNTE3MDNhMDVlMDNhNmU4YjBiZjNmMDlhOGU2ZmZjMzAwODczNDA0NzkwMDJkMyIsInZlcnNpb24iOjF9.Xpv1jn9glM7lbsQNQvtCnQuueHeGLD0xzEaquc3HfB1p_zFvDRe38mv_B1aHt-YxR16AhfpIbENOM1sPTaAJDA
43
+ - type: precision
44
  value: 0.8732009117551286
45
+ name: Precision
46
  verified: true
47
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOTYyYWEwOWE0YjI1NWJiNWMwNTMxOTc4OTFmYWI4MTJmMDRkMmEwYWRhMDAzYzVmNDA3Y2YzMzJkMDIzYzNjYyIsInZlcnNpb24iOjF9.O0KMG-s8zO6-tAat0HZRL6MN1ZaZQ_Ng3a_-qC5FndZefHktoJDSD9hiuZFTmlY6Vn1UkDlvG1XnnAi1Gv6pBg
48
+ - type: recall
49
  value: 0.9007725898555593
50
+ name: Recall
51
  verified: true
52
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYjQ3MzQyM2FlZTc1Y2VjOGY4MGEwMzY2MGM0YjQwNzIwNmVjMmRlNmExYWFlZjU3ZTIyZmJkMGRiZmJkMGZhMCIsInZlcnNpb24iOjF9.eYT8-djtIVkGrr8rhjqE2arUYgXQY0so9o8F4dXkLQt1fNEVa9kxTicapp4h1yTfU2jPpH778J_nvMCzwqixDw
53
+ - type: auc
54
  value: 0.9685235648551861
55
+ name: AUC
56
  verified: true
57
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOWRiZGYyYjE5MDFmNWQzN2ZkNGRkMTA4ZTEwNzYxMTg1NTNlY2VjODM0ZDY0NzA1NjQ3MGE2ZWNmY2MxYmNkMyIsInZlcnNpb24iOjF9.aQOO1uk3UON5hgbuMkKK93Yt1aRH4TpBad-KDwjj0_IM9Y11_-itRf6vZuWCkr0gZmyZ-4b0PA4v_dvf88y8Aw
58
+ - type: f1
59
  value: 0.8867724867724867
60
+ name: F1
61
  verified: true
62
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYTgxODBkMjZmMjdlZjAyNjA3Njk5OTA4NWExOWQ4NzMwNDlhODNlYTQ1NWZhM2JmNjhjOTA4ZjQxY2QwYTk4ZiIsInZlcnNpb24iOjF9.AjkBwMnuDZVnIXs6EE_ooluFrJSavg58EmUt5Oux2feFP7SvUaWbnetkHIyzBIKb5MEyxuPkSxXU3A6Di-t6CA
63
+ - type: loss
64
  value: 0.4435121417045593
65
+ name: loss
66
  verified: true
67
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNzU1ZDg0MGMyOGE0ZWM1Y2MwZjk0ZTAzNjc1MjBlZTUxNDIyNGZmN2EyZTUyZGM3N2E4NmQwOGUyNDBkOTVjNiIsInZlcnNpb24iOjF9.66LOnSclusAZY9uELpElvbcTuUVEJ95oXnspi9BHHw0tgwv38uUeq0cfojuQ_VsNN0UykiT0NooJdWaixpK4BA
68
  ---
69
 
70
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You