Pclanglais
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -19,6 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
19 |
|
20 |
Pleias-Topic-Detection is a finetuned version of t5-small on a set of 70,000 documents and associated topics from Common Corpus. While t5-small has been reportedly only trained in English, the model actually shows unexpected capacities for multilingual annotation. The final corpus include a significant amount of texts in French, Spanish, Italian, Dutch and German and has been proven to work somewhat in all of theses languages.
|
21 |
|
|
|
22 |
|
23 |
### Training hyperparameters
|
24 |
|
|
|
19 |
|
20 |
Pleias-Topic-Detection is a finetuned version of t5-small on a set of 70,000 documents and associated topics from Common Corpus. While t5-small has been reportedly only trained in English, the model actually shows unexpected capacities for multilingual annotation. The final corpus include a significant amount of texts in French, Spanish, Italian, Dutch and German and has been proven to work somewhat in all of theses languages.
|
21 |
|
22 |
+
Given that Pleias-Topic-Detection is a relatively lightweight model (70 million parameters) it can be used for classification at scale on a large corpus.
|
23 |
|
24 |
### Training hyperparameters
|
25 |
|