Text Classification
Transformers
PyTorch
English
deberta-v2
Inference Endpoints
ikrysinska commited on
Commit
3693df1
1 Parent(s): 0783058

Update README.md

Browse files

Add misuse and limitations and training details

Files changed (1) hide show
  1. README.md +14 -30
README.md CHANGED
@@ -3,6 +3,7 @@ language: en
3
  license: cc-by-4.0
4
  datasets:
5
  - multi_nli
 
6
  library_name: transformers
7
  pipeline_tag: text-classification
8
  ---
@@ -40,42 +41,27 @@ This model is suitable for English only.
40
 
41
  - **Paper:** [More Information Needed]
42
 
43
- - ## Uses
 
44
 
45
- <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
46
-
47
- [More Information Needed]
48
-
49
- ### Downstream Use [optional]
50
-
51
- <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
52
-
53
- [More Information Needed]
54
 
55
  ### Out-of-Scope Use
56
 
57
- - spreading/generating tweets that support a given conspiracy theory
58
- - amplifying echo chambers of social subnetworks susceptible to believe in conspiracy theories
59
-
60
-
61
- <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
62
-
63
- [More Information Needed]
64
 
65
  ## Bias, Risks, and Limitations
66
 
67
- - results are distorted for the conspiracy theories out of the training dataset
68
- - unintentional stifling of legitimate public discourse (elimination of discussion that resembles conspiracy theories from social subnetworks)
69
- - bias: text style, economic status...
70
- <!-- This section is meant to convey both technical and sociotechnical limitations. -->
71
-
72
- [More Information Needed]
73
 
74
- ### Recommendations
75
 
76
- <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
77
-
78
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
79
 
80
  ## How to Get Started with the Model
81
 
@@ -87,9 +73,7 @@ Use the code below to get started with the model.
87
 
88
  ### Training Data
89
 
90
- <!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
91
-
92
- [More Information Needed]
93
 
94
  ### Training Procedure
95
 
 
3
  license: cc-by-4.0
4
  datasets:
5
  - multi_nli
6
+ - webimmunization/COVID-19-conspiracy-theories-tweets
7
  library_name: transformers
8
  pipeline_tag: text-classification
9
  ---
 
41
 
42
  - **Paper:** [More Information Needed]
43
 
44
+ ## Uses
45
+ The model was trained to classify a pair of short texts: tweet and conspiracy theory. The model returns a float number which represents the likelihood that the tweet supports a given conspiracy theory.
46
 
 
 
 
 
 
 
 
 
 
47
 
48
  ### Out-of-Scope Use
49
 
50
+ **Spreading/Generating Tweets that support conspiracy theories:**
51
+ This model is explicitly designed for the purpose of classifying and understanding tweets related to COVID-19 conspiracy theories, particularly to determine whether a tweet supports or denies a specific conspiracy theory. It is not intended for, and should not be used to generate or propagate tweets that endorse or support any conspiracy theory. Any use of the model for such purposes is considered unethical and goes against the intended use case.
52
+ **Amplifying echo chambers of social subnetworks susceptible to conspiracy theories:**
53
+ While the model can help identify tweets that are related to conspiracy theories, it is important to note that it should not be used to target or amplify echo chambers or social subnetworks that are susceptible to believing in conspiracy theories. Ethical use of this model involves promoting responsible and unbiased information dissemination and discourages actions that may contribute to the spread of misinformation or polarization. Users should be cautious about using this model in ways that may further divide communities or promote harmful narratives.
 
 
 
54
 
55
  ## Bias, Risks, and Limitations
56
 
57
+ **Results may be distorted for conspiracy theories out of the training dataset:**
58
+ This model has been specifically fine-tuned to classify tweets related to a predefined set of COVID-19 conspiracy theories. As a result, its performance may not be as reliable when applied to conspiracy theories or topics that were not included in the training data. Users should exercise caution and consider the potential for distorted results when applying this model to subjects beyond its training scope. The model may not perform well in categorizing or understanding content that falls outside the designated conspiracy theories.
59
+ **Unintentional stifling of legitimate public discourse:**
60
+ The model's primary purpose is to identify tweets related to COVID-19 conspiracy theories, and it is not intended to stifle legitimate public discourse or eliminate discussions that merely resemble conspiracy theories. There is a risk that using this model inappropriately may lead to the suppression of valid conversations and the removal of content that is not explicitly conspiratorial but might be flagged due to similarities in language or topic. Users should be aware of this limitation and use the model judiciously, ensuring that it does not impede the free exchange of ideas and discussions.
61
+ **Bias in decision making:**
62
+ Like many machine learning models, this model may exhibit bias in its decision-making process. Factors such as the text style which may represent the socio-economical status of the individuals may inadvertently affect the model's classifications. The model's outputs may not always be entirely free from bias and to use its predictions as supplementary information rather than definitive judgments.
63
 
 
64
 
 
 
 
65
 
66
  ## How to Get Started with the Model
67
 
 
73
 
74
  ### Training Data
75
 
76
+ The model was finetuned with [webimmunization/COVID-19-CT-tweets-classification and ](https://huggingface.co/webimmunization/COVID-19-CT-tweets-classification) and [mnli](https://huggingface.co/datasets/multi_nli) datasets.
 
 
77
 
78
  ### Training Procedure
79