oroszgy commited on
Commit
dda2292
1 Parent(s): afc6096

Update spacy pipeline to 3.2.3

Browse files
README.md CHANGED
@@ -14,73 +14,73 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8500891266
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8384317862
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.844220216
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
- value: 0.969328676
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
  - name: POS (UPOS) Accuracy
36
  type: accuracy
37
- value: 0.9660749318
38
  - task:
39
  name: MORPH
40
  type: token-classification
41
  metrics:
42
  - name: Morph (UFeats) Accuracy
43
  type: accuracy
44
- value: 0.9291798258
45
  - task:
46
  name: LEMMA
47
  type: token-classification
48
  metrics:
49
  - name: Lemma Accuracy
50
  type: accuracy
51
- value: 0.9654578509
52
  - task:
53
  name: UNLABELED_DEPENDENCIES
54
  type: token-classification
55
  metrics:
56
  - name: Unlabeled Attachment Score (UAS)
57
  type: f_score
58
- value: 0.8102173426
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
  - name: Labeled Attachment Score (LAS)
64
  type: f_score
65
- value: 0.7343303646
66
  - task:
67
  name: SENTS
68
  type: token-classification
69
  metrics:
70
  - name: Sentences F-Score
71
  type: f_score
72
- value: 0.9766407119
73
  ---
74
  Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner
75
 
76
  | Feature | Description |
77
  | --- | --- |
78
  | **Name** | `hu_core_news_lg` |
79
- | **Version** | `3.2.1b1` |
80
  | **spaCy** | `>=3.2.4,<3.3.0` |
81
  | **Default Pipeline** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lemmatizer`, `parser`, `ner` |
82
  | **Components** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lemmatizer`, `parser`, `ner` |
83
- | **Vectors** | 1140008 keys, 1140008 unique vectors (300 dimensions) |
84
  | **Sources** | [UD Hungarian Szeged](https://universaldependencies.org/treebanks/hu_szeged/index.html) (Richárd Farkas, Katalin Simkó, Zsolt Szántó, Viktor Varga, Veronika Vincze (MTA-SZTE Research Group on Artificial Intelligence))<br />[NYTK-NerKor Corpus](https://github.com/nytud/NYTK-NerKor) (Eszter Simon, Noémi Vadász (Department of Language Technology and Applied Linguistics))<br />[hunNERwiki](http://hlt.sztaki.hu/resources/hunnerwiki.html) (Eszter Simon, Dávid Márk Nemeskey (HLT Group, Budapest University of Technology and Economics))<br />[Szeged NER Corpus](https://rgai.inf.u-szeged.hu/node/130) (György Szarvas, Richárd Farkas, László Felföldi, András Kocsor, János Csirik (MTA-SZTE Research Group on Artificial Intelligence))<br />[Webcorpuswiki word2vec model](https://github.com/oroszgy/hunlp-resources/releases/tag/webcorpuswiki_word2vec_v0.1) (György Orosz) |
85
  | **License** | `cc-by-sa-4.0` |
86
  | **Author** | [SzegedAI, MILAB](https://github.com/huspacy/huspacy) |
@@ -108,18 +108,18 @@ Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morpholog
108
  | `TOKEN_P` | 99.86 |
109
  | `TOKEN_R` | 99.93 |
110
  | `TOKEN_F` | 99.89 |
111
- | `SENTS_P` | 97.56 |
112
- | `SENTS_R` | 97.77 |
113
- | `SENTS_F` | 97.66 |
114
- | `TAG_ACC` | 96.93 |
115
- | `POS_ACC` | 96.61 |
116
- | `MORPH_ACC` | 92.92 |
117
- | `MORPH_MICRO_P` | 96.82 |
118
- | `MORPH_MICRO_R` | 95.61 |
119
- | `MORPH_MICRO_F` | 96.21 |
120
- | `LEMMA_ACC` | 96.55 |
121
- | `DEP_UAS` | 81.02 |
122
- | `DEP_LAS` | 73.43 |
123
- | `ENTS_P` | 85.01 |
124
- | `ENTS_R` | 83.84 |
125
- | `ENTS_F` | 84.42 |
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.8669194655
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8440576653
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.8553358275
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
+ value: 0.9645437581
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
  - name: POS (UPOS) Accuracy
36
  type: accuracy
37
+ value: 0.9641609646
38
  - task:
39
  name: MORPH
40
  type: token-classification
41
  metrics:
42
  - name: Morph (UFeats) Accuracy
43
  type: accuracy
44
+ value: 0.9311895875
45
  - task:
46
  name: LEMMA
47
  type: token-classification
48
  metrics:
49
  - name: Lemma Accuracy
50
  type: accuracy
51
+ value: 0.9638312123
52
  - task:
53
  name: UNLABELED_DEPENDENCIES
54
  type: token-classification
55
  metrics:
56
  - name: Unlabeled Attachment Score (UAS)
57
  type: f_score
58
+ value: 0.8157554644
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
  - name: Labeled Attachment Score (LAS)
64
  type: f_score
65
+ value: 0.7455189077
66
  - task:
67
  name: SENTS
68
  type: token-classification
69
  metrics:
70
  - name: Sentences F-Score
71
  type: f_score
72
+ value: 0.9776785714
73
  ---
74
  Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner
75
 
76
  | Feature | Description |
77
  | --- | --- |
78
  | **Name** | `hu_core_news_lg` |
79
+ | **Version** | `3.2.3` |
80
  | **spaCy** | `>=3.2.4,<3.3.0` |
81
  | **Default Pipeline** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lemmatizer`, `parser`, `ner` |
82
  | **Components** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lemmatizer`, `parser`, `ner` |
83
+ | **Vectors** | 0 keys, 200000 unique vectors (300 dimensions) |
84
  | **Sources** | [UD Hungarian Szeged](https://universaldependencies.org/treebanks/hu_szeged/index.html) (Richárd Farkas, Katalin Simkó, Zsolt Szántó, Viktor Varga, Veronika Vincze (MTA-SZTE Research Group on Artificial Intelligence))<br />[NYTK-NerKor Corpus](https://github.com/nytud/NYTK-NerKor) (Eszter Simon, Noémi Vadász (Department of Language Technology and Applied Linguistics))<br />[hunNERwiki](http://hlt.sztaki.hu/resources/hunnerwiki.html) (Eszter Simon, Dávid Márk Nemeskey (HLT Group, Budapest University of Technology and Economics))<br />[Szeged NER Corpus](https://rgai.inf.u-szeged.hu/node/130) (György Szarvas, Richárd Farkas, László Felföldi, András Kocsor, János Csirik (MTA-SZTE Research Group on Artificial Intelligence))<br />[Webcorpuswiki word2vec model](https://github.com/oroszgy/hunlp-resources/releases/tag/webcorpuswiki_word2vec_v0.1) (György Orosz) |
85
  | **License** | `cc-by-sa-4.0` |
86
  | **Author** | [SzegedAI, MILAB](https://github.com/huspacy/huspacy) |
 
108
  | `TOKEN_P` | 99.86 |
109
  | `TOKEN_R` | 99.93 |
110
  | `TOKEN_F` | 99.89 |
111
+ | `SENTS_P` | 0.00 |
112
+ | `SENTS_R` | 0.00 |
113
+ | `SENTS_F` | 0.00 |
114
+ | `TAG_ACC` | 96.17 |
115
+ | `POS_ACC` | 96.21 |
116
+ | `MORPH_ACC` | 92.98 |
117
+ | `MORPH_MICRO_P` | 96.80 |
118
+ | `MORPH_MICRO_R` | 95.85 |
119
+ | `MORPH_MICRO_F` | 96.32 |
120
+ | `LEMMA_ACC` | 96.01 |
121
+ | `DEP_UAS` | 75.78 |
122
+ | `DEP_LAS` | 68.82 |
123
+ | `ENTS_P` | 85.95 |
124
+ | `ENTS_R` | 85.53 |
125
+ | `ENTS_F` | 85.74 |
config.cfg CHANGED
@@ -1,7 +1,7 @@
1
  [paths]
2
- parser_model = "models/hu_core_news_lg-parser-3.2.1b1/model-best"
3
- lemmy_model = "models/lemmy-3.2.1b1.bin"
4
- ner_model = "models/hu_core_news_lg-ner_merged-3.2.1b1/model-best"
5
  train = null
6
  dev = null
7
  vectors = null
@@ -59,10 +59,10 @@ use_upper = true
59
  nO = null
60
 
61
  [components.ner.model.tok2vec]
62
- @architectures = "spacy.Tok2Vec.v1"
63
 
64
  [components.ner.model.tok2vec.embed]
65
- @architectures = "spacy.MultiHashEmbed.v1"
66
  width = 300
67
  attrs = ["LOWER","PREFIX","SUFFIX","SHAPE"]
68
  rows = [5000,2500,2500,2500]
@@ -130,10 +130,10 @@ upstream = "*"
130
  factory = "tok2vec"
131
 
132
  [components.tok2vec.model]
133
- @architectures = "spacy.Tok2Vec.v1"
134
 
135
  [components.tok2vec.model.embed]
136
- @architectures = "spacy.MultiHashEmbed.v1"
137
  width = 300
138
  attrs = ["LOWER","PREFIX","SUFFIX","SHAPE"]
139
  rows = [5000,2500,2500,2500]
 
1
  [paths]
2
+ parser_model = "models/hu_core_news_lg-parser-3.2.3/model-best"
3
+ lemmy_model = "models/lemmy-3.2.3.bin"
4
+ ner_model = "models/hu_core_news_lg-ner-3.2.3/model-best"
5
  train = null
6
  dev = null
7
  vectors = null
 
59
  nO = null
60
 
61
  [components.ner.model.tok2vec]
62
+ @architectures = "spacy.Tok2Vec.v2"
63
 
64
  [components.ner.model.tok2vec.embed]
65
+ @architectures = "spacy.MultiHashEmbed.v2"
66
  width = 300
67
  attrs = ["LOWER","PREFIX","SUFFIX","SHAPE"]
68
  rows = [5000,2500,2500,2500]
 
130
  factory = "tok2vec"
131
 
132
  [components.tok2vec.model]
133
+ @architectures = "spacy.Tok2Vec.v2"
134
 
135
  [components.tok2vec.model.embed]
136
+ @architectures = "spacy.MultiHashEmbed.v2"
137
  width = 300
138
  attrs = ["LOWER","PREFIX","SUFFIX","SHAPE"]
139
  rows = [5000,2500,2500,2500]
hu_core_news_lg-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2ebfc080595d1da2061f10af9b8727480859466641719a316ec6fea20c669c3f
3
- size 1417819569
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:459eb3ac9d28ffbcc116cb696829d2e592178ef88376588e71ce3d6e43331f7d
3
+ size 343229002
meta.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "lang":"hu",
3
  "name":"core_news_lg",
4
- "version":"3.2.1b1",
5
  "description":"Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner",
6
  "author":"SzegedAI, MILAB",
7
  "email":"[email protected]",
@@ -11,8 +11,8 @@
11
  "spacy_git_version":"b50fe5ec6",
12
  "vectors":{
13
  "width":300,
14
- "vectors":1140008,
15
- "keys":1140008,
16
  "name":"hu_core_news_lg.vectors"
17
  },
18
  "labels":{
@@ -1270,90 +1270,90 @@
1270
  "token_p":0.998565417,
1271
  "token_r":0.9993300153,
1272
  "token_f":0.9989475698,
1273
- "sents_p":0.9755555556,
1274
- "sents_r":0.9777282851,
1275
- "sents_f":0.9766407119,
1276
- "tag_acc":0.969328676,
1277
- "pos_acc":0.9660749318,
1278
- "morph_acc":0.9291798258,
1279
- "morph_micro_p":0.9681897302,
1280
- "morph_micro_r":0.9561237645,
1281
- "morph_micro_f":0.9621189189,
1282
  "morph_per_feat":{
1283
  "Definite":{
1284
- "p":0.9680703378,
1285
- "r":0.9762015866,
1286
- "f":0.9721189591
1287
  },
1288
  "PronType":{
1289
- "p":0.9768976898,
1290
- "r":0.9801324503,
1291
- "f":0.9785123967
1292
  },
1293
  "Case":{
1294
- "p":0.9713204541,
1295
- "r":0.9636435487,
1296
- "f":0.9674667725
1297
  },
1298
  "Degree":{
1299
- "p":0.9282511211,
1300
- "r":0.8610648918,
1301
- "f":0.8933966336
1302
  },
1303
  "Number":{
1304
- "p":0.979925776,
1305
- "r":0.9735210323,
1306
- "f":0.9767129046
1307
  },
1308
  "Mood":{
1309
- "p":0.935840708,
1310
- "r":0.9379157428,
1311
- "f":0.9368770764
1312
  },
1313
  "Person":{
1314
- "p":0.9611248966,
1315
- "r":0.9555921053,
1316
- "f":0.9583505155
1317
  },
1318
  "Tense":{
1319
- "p":0.9734513274,
1320
  "r":0.9723756906,
1321
- "f":0.9729132117
1322
  },
1323
  "VerbForm":{
1324
- "p":0.9663582843,
1325
- "r":0.9214113873,
1326
- "f":0.9433497537
1327
  },
1328
  "Voice":{
1329
- "p":0.9704081633,
1330
- "r":0.972392638,
1331
- "f":0.9713993871
1332
  },
1333
  "Number[psor]":{
1334
- "p":0.9623493976,
1335
- "r":0.9102564103,
1336
- "f":0.9355783309
1337
  },
1338
  "Person[psor]":{
1339
- "p":0.9623493976,
1340
- "r":0.9115549215,
1341
- "f":0.9362637363
1342
  },
1343
  "NumType":{
1344
- "p":0.9343065693,
1345
- "r":0.9365853659,
1346
- "f":0.9354445798
1347
  },
1348
  "Poss":{
1349
- "p":0.6,
1350
  "r":1.0,
1351
- "f":0.75
1352
  },
1353
  "Reflex":{
1354
  "p":1.0,
1355
- "r":0.875,
1356
- "f":0.9333333333
1357
  },
1358
  "Aspect":{
1359
  "p":0.0,
@@ -1366,114 +1366,114 @@
1366
  "f":0.0
1367
  }
1368
  },
1369
- "lemma_acc":0.9654578509,
1370
- "dep_uas":0.8102173426,
1371
- "dep_las":0.7343303646,
1372
  "dep_las_per_type":{
1373
  "det":{
1374
- "p":0.865248227,
1375
- "r":0.8742038217,
1376
- "f":0.8697029703
1377
  },
1378
  "amod:att":{
1379
- "p":0.8596491228,
1380
- "r":0.8413736713,
1381
- "f":0.8504132231
1382
  },
1383
  "nsubj":{
1384
- "p":0.6852941176,
1385
  "r":0.728125,
1386
- "f":0.7060606061
1387
  },
1388
  "advmod:mode":{
1389
- "p":0.6179487179,
1390
- "r":0.5906862745,
1391
- "f":0.6040100251
1392
  },
1393
  "nmod:att":{
1394
- "p":0.7483552632,
1395
- "r":0.7711864407,
1396
- "f":0.7595993322
1397
  },
1398
  "obl":{
1399
- "p":0.7714016933,
1400
- "r":0.7380738074,
1401
- "f":0.7543698252
1402
  },
1403
  "obj":{
1404
- "p":0.8793103448,
1405
- "r":0.802247191,
1406
- "f":0.839012926
1407
  },
1408
  "root":{
1409
- "p":0.78,
1410
  "r":0.7817371938,
1411
- "f":0.7808676307
1412
  },
1413
  "cc":{
1414
- "p":0.646,
1415
- "r":0.68,
1416
- "f":0.6625641026
1417
  },
1418
  "conj":{
1419
- "p":0.4508547009,
1420
- "r":0.4395833333,
1421
- "f":0.4451476793
1422
  },
1423
  "advmod":{
1424
- "p":0.79,
1425
- "r":0.8315789474,
1426
- "f":0.8102564103
1427
  },
1428
  "flat:name":{
1429
- "p":0.780876494,
1430
- "r":0.9158878505,
1431
- "f":0.8430107527
1432
  },
1433
  "appos":{
1434
- "p":0.2482269504,
1435
- "r":0.3723404255,
1436
- "f":0.2978723404
1437
  },
1438
  "advcl":{
1439
- "p":0.2773722628,
1440
- "r":0.387755102,
1441
- "f":0.3234042553
1442
  },
1443
  "advmod:tlocy":{
1444
- "p":0.7081545064,
1445
- "r":0.7173913043,
1446
- "f":0.7127429806
1447
  },
1448
  "ccomp:obj":{
1449
- "p":0.2608695652,
1450
- "r":0.3636363636,
1451
- "f":0.3037974684
1452
  },
1453
  "mark":{
1454
- "p":0.7810650888,
1455
- "r":0.835443038,
1456
- "f":0.8073394495
1457
  },
1458
  "compound:preverb":{
1459
- "p":0.9099099099,
1460
- "r":0.9266055046,
1461
- "f":0.9181818182
1462
  },
1463
  "advmod:locy":{
1464
- "p":0.72,
1465
- "r":0.5625,
1466
- "f":0.6315789474
1467
  },
1468
  "cop":{
1469
- "p":0.7857142857,
1470
- "r":0.5365853659,
1471
- "f":0.6376811594
1472
  },
1473
  "nmod:obl":{
1474
- "p":0.243902439,
1475
- "r":0.25,
1476
- "f":0.2469135802
1477
  },
1478
  "advmod:to":{
1479
  "p":0.0,
@@ -1481,69 +1481,69 @@
1481
  "f":0.0
1482
  },
1483
  "obj:lvc":{
1484
- "p":1.0,
1485
- "r":0.0833333333,
1486
- "f":0.1538461538
1487
  },
1488
  "ccomp:obl":{
1489
- "p":0.5384615385,
1490
- "r":0.21875,
1491
- "f":0.3111111111
1492
  },
1493
  "iobj":{
1494
- "p":0.2222222222,
1495
- "r":0.2666666667,
1496
- "f":0.2424242424
1497
  },
1498
  "case":{
1499
- "p":0.9479166667,
1500
- "r":0.9285714286,
1501
- "f":0.9381443299
1502
  },
1503
  "csubj":{
1504
- "p":0.4444444444,
1505
- "r":0.3243243243,
1506
- "f":0.375
1507
  },
1508
  "parataxis":{
1509
- "p":0.0,
1510
- "r":0.0,
1511
- "f":0.0
1512
  },
1513
  "xcomp":{
1514
- "p":0.8648648649,
1515
- "r":0.8648648649,
1516
- "f":0.8648648649
1517
  },
1518
  "nummod":{
1519
- "p":0.5,
1520
- "r":0.5483870968,
1521
- "f":0.5230769231
1522
- },
1523
- "acl":{
1524
- "p":0.4090909091,
1525
- "r":0.25,
1526
- "f":0.3103448276
1527
  },
1528
- "dep":{
1529
  "p":0.0,
1530
  "r":0.0,
1531
  "f":0.0
1532
  },
 
 
 
 
 
1533
  "advmod:tto":{
1534
- "p":0.5,
1535
- "r":0.4,
1536
- "f":0.4444444444
1537
  },
1538
  "nmod":{
1539
- "p":0.4,
1540
- "r":0.1818181818,
1541
- "f":0.25
1542
  },
1543
  "aux":{
1544
- "p":0.8888888889,
1545
- "r":0.6666666667,
1546
- "f":0.7619047619
1547
  },
1548
  "advmod:tfrom":{
1549
  "p":0.0,
@@ -1556,9 +1556,9 @@
1556
  "f":0.0
1557
  },
1558
  "compound":{
1559
- "p":0.9512195122,
1560
- "r":0.975,
1561
- "f":0.962962963
1562
  },
1563
  "obl:lvc":{
1564
  "p":0.0,
@@ -1570,11 +1570,6 @@
1570
  "r":0.0,
1571
  "f":0.0
1572
  },
1573
- "ccomp:pred":{
1574
- "p":0.0,
1575
- "r":0.0,
1576
- "f":0.0
1577
- },
1578
  "nsubj:lvc":{
1579
  "p":0.0,
1580
  "r":0.0,
@@ -1585,43 +1580,48 @@
1585
  "r":0.1666666667,
1586
  "f":0.2857142857
1587
  },
1588
- "ccomp":{
1589
  "p":0.0,
1590
  "r":0.0,
1591
  "f":0.0
1592
  },
1593
  "advmod:que":{
1594
  "p":1.0,
1595
- "r":0.75,
1596
- "f":0.8571428571
 
 
 
 
 
1597
  }
1598
  },
1599
- "ents_p":0.8500891266,
1600
- "ents_r":0.8384317862,
1601
- "ents_f":0.844220216,
1602
  "ents_per_type":{
1603
  "ORG":{
1604
- "p":0.8754545455,
1605
- "r":0.892906815,
1606
- "f":0.8840945605
1607
  },
1608
  "PER":{
1609
- "p":0.8946406821,
1610
- "r":0.8775388292,
1611
- "f":0.8860072376
1612
  },
1613
  "LOC":{
1614
- "p":0.8645454545,
1615
- "r":0.8255208333,
1616
- "f":0.8445825933
1617
  },
1618
  "MISC":{
1619
- "p":0.6332335329,
1620
- "r":0.6,
1621
- "f":0.6161689731
1622
  }
1623
  },
1624
- "speed":1078.2164666366
1625
  },
1626
  "sources":[
1627
  {
 
1
  {
2
  "lang":"hu",
3
  "name":"core_news_lg",
4
+ "version":"3.2.3",
5
  "description":"Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner",
6
  "author":"SzegedAI, MILAB",
7
  "email":"[email protected]",
 
11
  "spacy_git_version":"b50fe5ec6",
12
  "vectors":{
13
  "width":300,
14
+ "vectors":200000,
15
+ "keys":0,
16
  "name":"hu_core_news_lg.vectors"
17
  },
18
  "labels":{
 
1270
  "token_p":0.998565417,
1271
  "token_r":0.9993300153,
1272
  "token_f":0.9989475698,
1273
+ "sents_p":0.9798657718,
1274
+ "sents_r":0.9755011136,
1275
+ "sents_f":0.9776785714,
1276
+ "tag_acc":0.9645437581,
1277
+ "pos_acc":0.9641609646,
1278
+ "morph_acc":0.9311895875,
1279
+ "morph_micro_p":0.9655590694,
1280
+ "morph_micro_r":0.9577997422,
1281
+ "morph_micro_f":0.9616637542,
1282
  "morph_per_feat":{
1283
  "Definite":{
1284
+ "p":0.9580674567,
1285
+ "r":0.9808679421,
1286
+ "f":0.9693336408
1287
  },
1288
  "PronType":{
1289
+ "p":0.9713498623,
1290
+ "r":0.9729580574,
1291
+ "f":0.9721532947
1292
  },
1293
  "Case":{
1294
+ "p":0.9741638294,
1295
+ "r":0.9610748864,
1296
+ "f":0.9675750945
1297
  },
1298
  "Degree":{
1299
+ "p":0.9223907226,
1300
+ "r":0.8602329451,
1301
+ "f":0.8902281533
1302
  },
1303
  "Number":{
1304
+ "p":0.9846153846,
1305
+ "r":0.9760348584,
1306
+ "f":0.9803063457
1307
  },
1308
  "Mood":{
1309
+ "p":0.9292709467,
1310
+ "r":0.9467849224,
1311
+ "f":0.9379461834
1312
  },
1313
  "Person":{
1314
+ "p":0.953526971,
1315
+ "r":0.9449013158,
1316
+ "f":0.9491945477
1317
  },
1318
  "Tense":{
1319
+ "p":0.957562568,
1320
  "r":0.9723756906,
1321
+ "f":0.9649122807
1322
  },
1323
  "VerbForm":{
1324
+ "p":0.9448499594,
1325
+ "r":0.9342421812,
1326
+ "f":0.939516129
1327
  },
1328
  "Voice":{
1329
+ "p":0.9547738693,
1330
+ "r":0.9713701431,
1331
+ "f":0.9630005068
1332
  },
1333
  "Number[psor]":{
1334
+ "p":0.9675994109,
1335
+ "r":0.9358974359,
1336
+ "f":0.9514844316
1337
  },
1338
  "Person[psor]":{
1339
+ "p":0.9690721649,
1340
+ "r":0.9386590585,
1341
+ "f":0.9536231884
1342
  },
1343
  "NumType":{
1344
+ "p":0.9334975369,
1345
+ "r":0.9243902439,
1346
+ "f":0.9289215686
1347
  },
1348
  "Poss":{
1349
+ "p":0.75,
1350
  "r":1.0,
1351
+ "f":0.8571428571
1352
  },
1353
  "Reflex":{
1354
  "p":1.0,
1355
+ "r":0.75,
1356
+ "f":0.8571428571
1357
  },
1358
  "Aspect":{
1359
  "p":0.0,
 
1366
  "f":0.0
1367
  }
1368
  },
1369
+ "lemma_acc":0.9638312123,
1370
+ "dep_uas":0.8157554644,
1371
+ "dep_las":0.7455189077,
1372
  "dep_las_per_type":{
1373
  "det":{
1374
+ "p":0.8627906977,
1375
+ "r":0.8861464968,
1376
+ "f":0.8743126473
1377
  },
1378
  "amod:att":{
1379
+ "p":0.8473029046,
1380
+ "r":0.8348323794,
1381
+ "f":0.8410214168
1382
  },
1383
  "nsubj":{
1384
+ "p":0.7456,
1385
  "r":0.728125,
1386
+ "f":0.7367588933
1387
  },
1388
  "advmod:mode":{
1389
+ "p":0.5935162095,
1390
+ "r":0.5833333333,
1391
+ "f":0.5883807169
1392
  },
1393
  "nmod:att":{
1394
+ "p":0.75,
1395
+ "r":0.7728813559,
1396
+ "f":0.7612687813
1397
  },
1398
  "obl":{
1399
+ "p":0.7294117647,
1400
+ "r":0.7812781278,
1401
+ "f":0.754454585
1402
  },
1403
  "obj":{
1404
+ "p":0.8410596026,
1405
+ "r":0.8561797753,
1406
+ "f":0.8485523385
1407
  },
1408
  "root":{
1409
+ "p":0.7852348993,
1410
  "r":0.7817371938,
1411
+ "f":0.7834821429
1412
  },
1413
  "cc":{
1414
+ "p":0.7077922078,
1415
+ "r":0.6884210526,
1416
+ "f":0.6979722519
1417
  },
1418
  "conj":{
1419
+ "p":0.4870259481,
1420
+ "r":0.5083333333,
1421
+ "f":0.49745158
1422
  },
1423
  "advmod":{
1424
+ "p":0.7920792079,
1425
+ "r":0.8421052632,
1426
+ "f":0.8163265306
1427
  },
1428
  "flat:name":{
1429
+ "p":0.8962264151,
1430
+ "r":0.8878504673,
1431
+ "f":0.8920187793
1432
  },
1433
  "appos":{
1434
+ "p":0.4333333333,
1435
+ "r":0.2765957447,
1436
+ "f":0.3376623377
1437
  },
1438
  "advcl":{
1439
+ "p":0.3648648649,
1440
+ "r":0.2755102041,
1441
+ "f":0.3139534884
1442
  },
1443
  "advmod:tlocy":{
1444
+ "p":0.7450980392,
1445
+ "r":0.6608695652,
1446
+ "f":0.7004608295
1447
  },
1448
  "ccomp:obj":{
1449
+ "p":0.2037037037,
1450
+ "r":0.3333333333,
1451
+ "f":0.2528735632
1452
  },
1453
  "mark":{
1454
+ "p":0.8136645963,
1455
+ "r":0.8291139241,
1456
+ "f":0.8213166144
1457
  },
1458
  "compound:preverb":{
1459
+ "p":0.9326923077,
1460
+ "r":0.8899082569,
1461
+ "f":0.9107981221
1462
  },
1463
  "advmod:locy":{
1464
+ "p":0.9333333333,
1465
+ "r":0.4375,
1466
+ "f":0.5957446809
1467
  },
1468
  "cop":{
1469
+ "p":0.8666666667,
1470
+ "r":0.6341463415,
1471
+ "f":0.7323943662
1472
  },
1473
  "nmod:obl":{
1474
+ "p":0.25,
1475
+ "r":0.05,
1476
+ "f":0.0833333333
1477
  },
1478
  "advmod:to":{
1479
  "p":0.0,
 
1481
  "f":0.0
1482
  },
1483
  "obj:lvc":{
1484
+ "p":0.0,
1485
+ "r":0.0,
1486
+ "f":0.0
1487
  },
1488
  "ccomp:obl":{
1489
+ "p":0.4166666667,
1490
+ "r":0.3125,
1491
+ "f":0.3571428571
1492
  },
1493
  "iobj":{
1494
+ "p":0.25,
1495
+ "r":0.2,
1496
+ "f":0.2222222222
1497
  },
1498
  "case":{
1499
+ "p":0.915,
1500
+ "r":0.9336734694,
1501
+ "f":0.9242424242
1502
  },
1503
  "csubj":{
1504
+ "p":0.5238095238,
1505
+ "r":0.2972972973,
1506
+ "f":0.3793103448
1507
  },
1508
  "parataxis":{
1509
+ "p":0.0909090909,
1510
+ "r":0.0273972603,
1511
+ "f":0.0421052632
1512
  },
1513
  "xcomp":{
1514
+ "p":0.8904109589,
1515
+ "r":0.8783783784,
1516
+ "f":0.8843537415
1517
  },
1518
  "nummod":{
1519
+ "p":0.515625,
1520
+ "r":0.7096774194,
1521
+ "f":0.5972850679
 
 
 
 
 
1522
  },
1523
+ "ccomp":{
1524
  "p":0.0,
1525
  "r":0.0,
1526
  "f":0.0
1527
  },
1528
+ "acl":{
1529
+ "p":0.3125,
1530
+ "r":0.2777777778,
1531
+ "f":0.2941176471
1532
+ },
1533
  "advmod:tto":{
1534
+ "p":1.0,
1535
+ "r":0.2,
1536
+ "f":0.3333333333
1537
  },
1538
  "nmod":{
1539
+ "p":0.0,
1540
+ "r":0.0,
1541
+ "f":0.0
1542
  },
1543
  "aux":{
1544
+ "p":0.8333333333,
1545
+ "r":0.8333333333,
1546
+ "f":0.8333333333
1547
  },
1548
  "advmod:tfrom":{
1549
  "p":0.0,
 
1556
  "f":0.0
1557
  },
1558
  "compound":{
1559
+ "p":0.9047619048,
1560
+ "r":0.95,
1561
+ "f":0.9268292683
1562
  },
1563
  "obl:lvc":{
1564
  "p":0.0,
 
1570
  "r":0.0,
1571
  "f":0.0
1572
  },
 
 
 
 
 
1573
  "nsubj:lvc":{
1574
  "p":0.0,
1575
  "r":0.0,
 
1580
  "r":0.1666666667,
1581
  "f":0.2857142857
1582
  },
1583
+ "dep":{
1584
  "p":0.0,
1585
  "r":0.0,
1586
  "f":0.0
1587
  },
1588
  "advmod:que":{
1589
  "p":1.0,
1590
+ "r":0.5,
1591
+ "f":0.6666666667
1592
+ },
1593
+ "ccomp:pred":{
1594
+ "p":0.0,
1595
+ "r":0.0,
1596
+ "f":0.0
1597
  }
1598
  },
1599
+ "ents_p":0.8669194655,
1600
+ "ents_r":0.8440576653,
1601
+ "ents_f":0.8553358275,
1602
  "ents_per_type":{
1603
  "ORG":{
1604
+ "p":0.8834161771,
1605
+ "r":0.906351414,
1606
+ "f":0.8947368421
1607
  },
1608
  "PER":{
1609
+ "p":0.9008674102,
1610
+ "r":0.8685782557,
1611
+ "f":0.8844282238
1612
  },
1613
  "LOC":{
1614
+ "p":0.8708520179,
1615
+ "r":0.8428819444,
1616
+ "f":0.8566387296
1617
  },
1618
  "MISC":{
1619
+ "p":0.7063758389,
1620
+ "r":0.5971631206,
1621
+ "f":0.6471944658
1622
  }
1623
  },
1624
+ "speed":894.0900391347
1625
  },
1626
  "sources":[
1627
  {
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:472afd37d21214b6eb57d995ee14cb16ad6d28ee088e51f09fcbb029299fb1ae
3
  size 1383846
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0e9e4211d09f80110ffe45463c9d72358a913fc2bd9f1765db7792f14c60627b
3
  size 1383846
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bcb9c4767becd8af8b2d6964d7e47743566ded371e33f6b4d8f709229565d2ae
3
- size 56989356
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a4ef8b733133cabae75a368301a774b601b074b79981e405af6e5fffded26e2e
3
+ size 56989063
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c1ae6c71bccee15a0ecb7c4970b2aea5dbf3a3b684af076c60f531a80e6e2558
3
  size 26010735
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce27f85ef0da69d646299ed5f09b8f2e48e21443d786f22b78166c1864dbca86
3
  size 26010735
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cd3ac677ce30e7b8b7d3a071f61fabf4e55d12b66f1a16911517f3fab3eca899
3
  size 2845
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:61a76c1078f7ee221b9b741fcfdac2d61b0e6a48b3002892c417b85d0ad8241b
3
  size 2845
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:97c1483a1a58789653846504abd5dc03401f765b466b890ec97da4cad6150da6
3
  size 20905
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0e8a09e671890eb057e862963417d8276101768d5e0278a634de7b240769231b
3
  size 20905
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c99df6ba6c05cd2f739ee00f1cd9f2e2ffa929361391b4f626f5d5ce2ce0bc02
3
- size 56806592
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3ec825bb232bfcf6c2ace443f8422ca22805d57aea26e1eb299cdea3ec537ec9
3
+ size 56806299
vocab/key2row CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9098bf22d0ef2a0b4f27c222ad591c39028b01ec86cd4b75b1824f7ad30c5dfc
3
- size 15828427
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76be8b528d0075f7aae98d6fa57a6d3c83ae480a8469e668d7b0af968995ac71
3
+ size 1
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a422e3379328d9a3b0bc3fe201d44621d8162bdf11c49d762138c545f1ca519a
3
- size 30752589
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bca5adfb4991079eff285df8aba96364adb2788b542bda5a91127d16ed748fad
3
+ size 6355378
vocab/vectors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:319277b84318aa3476e8eb618f177dc01893e3e24179c66607e7e22e13ad4057
3
- size 1368009728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0d88c58b99f50d2c4be7a6fa3712da8d0c7bc4d3c6749d718a09059288ef917d
3
+ size 240000128
vocab/vectors.cfg CHANGED
@@ -1,3 +1,9 @@
1
  {
2
- "mode":"default"
 
 
 
 
 
 
3
  }
 
1
  {
2
+ "mode":"floret",
3
+ "minn":4,
4
+ "maxn":6,
5
+ "hash_count":2,
6
+ "hash_seed":2166136261,
7
+ "bow":"<",
8
+ "eow":">"
9
  }