sakelariev
commited on
Commit
•
0441395
1
Parent(s):
d3cc9ca
update model card
Browse files
README.md
CHANGED
@@ -1,3 +1,21 @@
|
|
1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
license: mit
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
tags:
|
3 |
+
- spacy
|
4 |
+
- floret
|
5 |
+
- token-classification
|
6 |
+
language:
|
7 |
+
- bg
|
8 |
license: mit
|
9 |
---
|
10 |
+
Bulgarian word vectors for a Bulgarian Spacy model.
|
11 |
+
|
12 |
+
The model is trained on the Oscar 21.09 corpus and Bulgarian Wikipedia pages using floret with the following hyperparameters: `floret cbow -dim 300 -mode floret -bucket 200000 -minn 4 -maxn 5 -minCount 20 -neg 10 -hashCount 2 -lr 0.05 -thread 8`
|
13 |
+
|
14 |
+
| Feature | Description |
|
15 |
+
| --- | --- |
|
16 |
+
| **Name** | `bg_floret_vectors_lg` |
|
17 |
+
| **Version** | `1.0` |
|
18 |
+
| **Vectors** | 200000 keys (300 dimensions) |
|
19 |
+
| **Sources** | OSCAR Corpus 21.09 (Julien Abadji, Pedro Ortiz Suarez), Wikipedia (bgwiki-latest-pages-articles from June 11th) |
|
20 |
+
| **License** | `MIT` |
|
21 |
+
| **Author** | Ivaylo Sakelariev |
|