Update README.md
#3
by
Willcherry444
- opened
- README.md +6 -56
- config.json +6 -6
README.md
CHANGED
@@ -1,16 +1,14 @@
|
|
1 |
---
|
2 |
-
language:
|
3 |
widget:
|
4 |
-
- text: Covid cases are increasing fast!
|
5 |
-
datasets:
|
6 |
-
- tweet_eval
|
7 |
---
|
8 |
|
9 |
|
10 |
-
# Twitter-roBERTa-base for Sentiment Analysis - UPDATED (
|
11 |
|
12 |
-
This is a
|
13 |
-
The original
|
14 |
|
15 |
- Reference Paper: [TimeLMs paper](https://arxiv.org/abs/2202.03829).
|
16 |
- Git Repo: [TimeLMs official repository](https://github.com/cardiffnlp/timelms).
|
@@ -20,8 +18,6 @@ The original Twitter-based RoBERTa model can be found [here](https://huggingface
|
|
20 |
1 -> Neutral;
|
21 |
2 -> Positive
|
22 |
|
23 |
-
This sentiment analysis model has been integrated into [TweetNLP](https://github.com/cardiffnlp/tweetnlp). You can access the demo [here](https://tweetnlp.org).
|
24 |
-
|
25 |
## Example Pipeline
|
26 |
```python
|
27 |
from transformers import pipeline
|
@@ -83,50 +79,4 @@ Output:
|
|
83 |
1) Negative 0.7236
|
84 |
2) Neutral 0.2287
|
85 |
3) Positive 0.0477
|
86 |
-
```
|
87 |
-
|
88 |
-
|
89 |
-
### References
|
90 |
-
```
|
91 |
-
@inproceedings{camacho-collados-etal-2022-tweetnlp,
|
92 |
-
title = "{T}weet{NLP}: Cutting-Edge Natural Language Processing for Social Media",
|
93 |
-
author = "Camacho-collados, Jose and
|
94 |
-
Rezaee, Kiamehr and
|
95 |
-
Riahi, Talayeh and
|
96 |
-
Ushio, Asahi and
|
97 |
-
Loureiro, Daniel and
|
98 |
-
Antypas, Dimosthenis and
|
99 |
-
Boisson, Joanne and
|
100 |
-
Espinosa Anke, Luis and
|
101 |
-
Liu, Fangyu and
|
102 |
-
Mart{\'\i}nez C{\'a}mara, Eugenio" and others,
|
103 |
-
booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations",
|
104 |
-
month = dec,
|
105 |
-
year = "2022",
|
106 |
-
address = "Abu Dhabi, UAE",
|
107 |
-
publisher = "Association for Computational Linguistics",
|
108 |
-
url = "https://aclanthology.org/2022.emnlp-demos.5",
|
109 |
-
pages = "38--49"
|
110 |
-
}
|
111 |
-
|
112 |
-
```
|
113 |
-
|
114 |
-
```
|
115 |
-
@inproceedings{loureiro-etal-2022-timelms,
|
116 |
-
title = "{T}ime{LM}s: Diachronic Language Models from {T}witter",
|
117 |
-
author = "Loureiro, Daniel and
|
118 |
-
Barbieri, Francesco and
|
119 |
-
Neves, Leonardo and
|
120 |
-
Espinosa Anke, Luis and
|
121 |
-
Camacho-collados, Jose",
|
122 |
-
booktitle = "Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations",
|
123 |
-
month = may,
|
124 |
-
year = "2022",
|
125 |
-
address = "Dublin, Ireland",
|
126 |
-
publisher = "Association for Computational Linguistics",
|
127 |
-
url = "https://aclanthology.org/2022.acl-demo.25",
|
128 |
-
doi = "10.18653/v1/2022.acl-demo.25",
|
129 |
-
pages = "251--260"
|
130 |
-
}
|
131 |
-
|
132 |
-
```
|
|
|
1 |
---
|
2 |
+
language: english
|
3 |
widget:
|
4 |
+
- text: "Covid cases are increasing fast!"
|
|
|
|
|
5 |
---
|
6 |
|
7 |
|
8 |
+
# Twitter-roBERTa-base for Sentiment Analysis - UPDATED (2021)
|
9 |
|
10 |
+
This is a roBERTa-base model trained on ~124M tweets from January 2018 to December 2021 (see [here](https://huggingface.co/cardiffnlp/twitter-roberta-base-2021-124m)), and finetuned for sentiment analysis with the TweetEval benchmark.
|
11 |
+
The original roBERTa-base model can be found [here](https://huggingface.co/cardiffnlp/twitter-roberta-base-2021-124m) and the original reference paper is [TweetEval](https://github.com/cardiffnlp/tweeteval). This model is suitable for English.
|
12 |
|
13 |
- Reference Paper: [TimeLMs paper](https://arxiv.org/abs/2202.03829).
|
14 |
- Git Repo: [TimeLMs official repository](https://github.com/cardiffnlp/timelms).
|
|
|
18 |
1 -> Neutral;
|
19 |
2 -> Positive
|
20 |
|
|
|
|
|
21 |
## Example Pipeline
|
22 |
```python
|
23 |
from transformers import pipeline
|
|
|
79 |
1) Negative 0.7236
|
80 |
2) Neutral 0.2287
|
81 |
3) Positive 0.0477
|
82 |
+
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
config.json
CHANGED
@@ -12,16 +12,16 @@
|
|
12 |
"hidden_dropout_prob": 0.1,
|
13 |
"hidden_size": 768,
|
14 |
"id2label": {
|
15 |
-
"0": "
|
16 |
-
"1": "
|
17 |
-
"2": "
|
18 |
},
|
19 |
"initializer_range": 0.02,
|
20 |
"intermediate_size": 3072,
|
21 |
"label2id": {
|
22 |
-
"
|
23 |
-
"
|
24 |
-
"
|
25 |
},
|
26 |
"layer_norm_eps": 1e-05,
|
27 |
"max_position_embeddings": 514,
|
|
|
12 |
"hidden_dropout_prob": 0.1,
|
13 |
"hidden_size": 768,
|
14 |
"id2label": {
|
15 |
+
"0": "Negative",
|
16 |
+
"1": "Neutral",
|
17 |
+
"2": "Positive"
|
18 |
},
|
19 |
"initializer_range": 0.02,
|
20 |
"intermediate_size": 3072,
|
21 |
"label2id": {
|
22 |
+
"Negative": 0,
|
23 |
+
"Neutral": 1,
|
24 |
+
"Positive": 2
|
25 |
},
|
26 |
"layer_norm_eps": 1e-05,
|
27 |
"max_position_embeddings": 514,
|