dnozza commited on
Commit
758ba71
·
1 Parent(s): 6c6d3e6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -1
README.md CHANGED
@@ -33,12 +33,28 @@ HATE-ITA is a binary hate speech classification model for Italian social media t
33
 
34
  ## Abstract
35
 
36
- Online hate speech is a dangerous phenomenon that can (and should) be promptly counteracted properly. While Natural Language Processing has been successfully used for the purpose, many of the research efforts are directed toward the English language. This choice severely limits the classification power in non-English languages. In this paper, we test several learning frameworks for identifying hate speech in Italian text. We release HATE-ITA, a set of multi-language models trained on a large set of English data and available Italian datasets. HATE-ITA performs better than mono-lingual models and seems to adapt well also on language-specific slurs. We believe our findings will encourage research in other mid-to-low resource communities and provide a valuable benchmarking tool for the Italian community.
37
 
38
  ## Model
39
 
40
  This model is the fine-tuned version of the [XLM-RoBERTa-large](https://huggingface.co/xlm-roberta-large) model.
41
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
42
 
43
  ## Citation
44
  Please use the following BibTeX entry if you use this model in your project:
 
33
 
34
  ## Abstract
35
 
36
+ Online hate speech is a dangerous phenomenon that can (and should) be promptly counteracted properly. While Natural Language Processing has been successfully used for the purpose, many of the research efforts are directed toward the English language. This choice severely limits the classification power in non-English languages. In this paper, we test several learning frameworks for identifying hate speech in Italian text. We release **HATE-ITA, a set of multi-language models trained on a large set of English data and available Italian datasets**. HATE-ITA performs better than mono-lingual models and seems to adapt well also on language-specific slurs. We believe our findings will encourage research in other mid-to-low resource communities and provide a valuable benchmarking tool for the Italian community.
37
 
38
  ## Model
39
 
40
  This model is the fine-tuned version of the [XLM-RoBERTa-large](https://huggingface.co/xlm-roberta-large) model.
41
 
42
+ | Model | Download |
43
+ | ------ | -------------------------|
44
+ | `hate-ita` | [Link](https://huggingface.co/MilaNLProc/hate-ita) |
45
+ | `hate-ita-xlm-r-base` | [Link](https://huggingface.co/MilaNLProc/hate-ita-xlm-r-base) |
46
+ | `hate-ita-xlm-r-large` | [Link](https://huggingface.co/MilaNLProc/hate-ita-xlm-r-large) |
47
+
48
+
49
+
50
+ ## Usage
51
+
52
+ ```python
53
+ from transformers import pipeline
54
+ classifier = pipeline("text-classification",model='MilaNLProc/hate-ita',top_k=2)
55
+ prediction = classifier("ti odio")
56
+ print(prediction)
57
+ ```
58
 
59
  ## Citation
60
  Please use the following BibTeX entry if you use this model in your project: