Text Classification
Transformers
PyTorch
English
deberta-v2
Inference Endpoints
ikrysinska commited on
Commit
73ed339
1 Parent(s): 0ca1b82

Update README.md

Browse files

Add evaluation metrics

Files changed (1) hide show
  1. README.md +16 -42
README.md CHANGED
@@ -12,7 +12,7 @@ pipeline_tag: text-classification
12
  ### Model Description
13
 
14
  <!-- Provide a longer summary of what this model is. -->
15
- This is a DeBERTa-v3-base-tasksource-nli model with an adapter trained on [More Information Needed, which contains X pairs of a tweet and a conspiracy theory along with class labels: support, denies, neutral. The model was finetuned for text classification to predict whether a tweet supports a given conspiracy theory or not. The model was trained on tweets related to six common COVID-19 conspiracy theories.
16
 
17
  1. **Vaccines are unsafe.** The coronavirus vaccine is either unsafe or part of a larger plot to control people or reduce the population.
18
 
@@ -60,7 +60,6 @@ This model is suitable for English only.
60
 
61
  ## Bias, Risks, and Limitations
62
 
63
-
64
  <!-- This section is meant to convey both technical and sociotechnical limitations. -->
65
 
66
  [More Information Needed]
@@ -89,59 +88,35 @@ Use the code below to get started with the model.
89
 
90
  The adapter was trained for 5 epochs with a batch size of 16.
91
 
92
- #### Preprocessing [optional]
93
 
94
  The training data was cleaned before the training. All URLs, Twitter user mentions, and non-ASCII characters were removed.
95
 
96
  ## Evaluation
97
 
98
- The model was evaluated on a sample of the tweets collected during the COVID-19 pandemic. All the tweets were rated against each of the six theories by five annotators. Using sliding scales, they rated each tweets' endorsement likelihood for the respective conspiracy theory from 0% to 100%. The consensus among raters was substantial for every conspiracy theory (see table below).
99
-
100
- ### Testing Data, Factors & Metrics
101
 
102
- #### Testing Data
103
 
104
- <!-- This should link to a Data Card if possible. -->
105
-
106
- [More Information Needed]
 
 
 
 
 
107
 
108
- #### Factors
109
-
110
- The evaluation dataset
111
-
112
- [More Information Needed]
113
 
114
- #### Metrics
115
-
116
- <!-- These are the evaluation metrics being used, ideally with a description of why. -->
117
-
118
- [More Information Needed]
119
-
120
- ### Results
121
-
122
- [More Information Needed]
123
-
124
- #### Summary
125
-
126
-
127
-
128
- ## Model Examination [optional]
129
-
130
- <!-- Relevant interpretability work for the model goes here -->
131
-
132
- [More Information Needed]
133
 
134
  ## Environmental Impact
135
 
136
- <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
137
-
138
- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
139
 
140
  - **Hardware Type:** GPU Tesla V100
141
  - **Hours used:** 40
142
  - **Cloud Provider:** Google Cloud Platform
143
  - **Compute Region:** us-east1
144
- - **Carbon Emitted:** 4.44 kg CO2 ([equivalent to: 17.9 km driven by an average ICE car, 2.22 kgs of coal burned, 0.07 tree seedlings sequesting carbon for 10 years](https://www.epa.gov/energy/greenhouse-gases-equivalencies-calculator-calculations-and-references)
145
 
146
 
147
  ## Citation [optional]
@@ -162,16 +137,15 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
162
 
163
  [More Information Needed]
164
 
165
- ## More Information [optional]
166
-
167
- [More Information Needed]
168
 
169
- ## Model Card Authors [optional]
170
 
171
  @ikrysinska, @wtomi
172
 
173
  ## Model Card Contact
174
 
175
 
176
 
177
 
12
  ### Model Description
13
 
14
  <!-- Provide a longer summary of what this model is. -->
15
+ This is a DeBERTa-v3-base-tasksource-nli model with an adapter trained on [More Information Needed], which contains X pairs of a tweet and a conspiracy theory along with class labels: support, deny, neutral. The model was finetuned for text classification to predict whether a tweet supports a given conspiracy theory or not. The model was trained on tweets related to six common COVID-19 conspiracy theories.
16
 
17
  1. **Vaccines are unsafe.** The coronavirus vaccine is either unsafe or part of a larger plot to control people or reduce the population.
18
 
 
60
 
61
  ## Bias, Risks, and Limitations
62
 
 
63
  <!-- This section is meant to convey both technical and sociotechnical limitations. -->
64
 
65
  [More Information Needed]
 
88
 
89
  The adapter was trained for 5 epochs with a batch size of 16.
90
 
91
+ #### Preprocessing
92
 
93
  The training data was cleaned before the training. All URLs, Twitter user mentions, and non-ASCII characters were removed.
94
 
95
  ## Evaluation
96
 
97
+ The model was evaluated on a sample of the tweets collected during the COVID-19 pandemic. All the tweets were rated against each of the six theories by five annotators. Using sliding scales, they rated each tweets' endorsement likelihood for the respective conspiracy theory from 0% to 100%. The consensus among raters was substantial for every conspiracy theory. Comparisons with human evaluations revealed substantial correlations. The model significantly surpasses the performance of the pre-trained model without the finetuned adapter (see table below).
 
 
98
 
 
99
 
100
+ | Conspiracy Theory | Correlations between human raters | Correlation between human ratings and model without adapter | Correlation between human ratings and model with finetuned adapter |
101
+ |---|---|---|---|
102
+ | **Vaccines are unsafe.** | 0.78 | 0.29 | 0.57 |
103
+ | **Governments and politicians spread misinformation.** | 0.58 | 0.32 | 0.72 |
104
+ | **The Chinese intentionally spread the virus.** | 0.62 | 0.53 | 0.64 |
105
+ | **Deliberate strategy to create economic instability or benefit large corporations.** | 0.56 | 0.33 | 0.54 |
106
+ | **Public was intentionally misled about the true nature of the virus and prevention.** | 0.66 | 0.37 | 0.68 |
107
+ | **Human made and bioweapon.** | 0.67 | 0.15 | .78 |
108
 
 
 
 
 
 
109
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
110
 
111
  ## Environmental Impact
112
 
113
+ Carbon emissions are estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
 
 
114
 
115
  - **Hardware Type:** GPU Tesla V100
116
  - **Hours used:** 40
117
  - **Cloud Provider:** Google Cloud Platform
118
  - **Compute Region:** us-east1
119
+ - **Carbon Emitted:** 4.44 kg CO2 eq ([equivalent to: 17.9 km driven by an average ICE car, 2.22 kgs of coal burned, 0.07 tree seedlings sequesting carbon for 10 years](https://www.epa.gov/energy/greenhouse-gases-equivalencies-calculator-calculations-and-references)
120
 
121
 
122
  ## Citation [optional]
 
137
 
138
  [More Information Needed]
139
 
 
 
 
140
 
141
+ ## Model Card Authors
142
 
143
  @ikrysinska, @wtomi
144
 
145
  ## Model Card Contact
146
 
147
148
+
149
150
+
151