sakelariev commited on
Commit
0441395
1 Parent(s): d3cc9ca

update model card

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -1,3 +1,21 @@
1
  ---
 
 
 
 
 
 
2
  license: mit
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ tags:
3
+ - spacy
4
+ - floret
5
+ - token-classification
6
+ language:
7
+ - bg
8
  license: mit
9
  ---
10
+ Bulgarian word vectors for a Bulgarian Spacy model.
11
+
12
+ The model is trained on the Oscar 21.09 corpus and Bulgarian Wikipedia pages using floret with the following hyperparameters: `floret cbow -dim 300 -mode floret -bucket 200000 -minn 4 -maxn 5 -minCount 20 -neg 10 -hashCount 2 -lr 0.05 -thread 8`
13
+
14
+ | Feature | Description |
15
+ | --- | --- |
16
+ | **Name** | `bg_floret_vectors_lg` |
17
+ | **Version** | `1.0` |
18
+ | **Vectors** | 200000 keys (300 dimensions) |
19
+ | **Sources** | OSCAR Corpus 21.09 (Julien Abadji, Pedro Ortiz Suarez), Wikipedia (bgwiki-latest-pages-articles from June 11th) |
20
+ | **License** | `MIT` |
21
+ | **Author** | Ivaylo Sakelariev |