osanseviero commited on
Commit
5516623
·
1 Parent(s): 1b5232e

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -549,7 +549,7 @@ terms of this License.```
549
 
550
 
551
 
552
- # UD Romanian RRT v2.5
553
 
554
  * Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
555
  * URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
 
549
 
550
 
551
 
552
+ # UD Romanian RRT v2.8
553
 
554
  * Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
555
  * URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
README.md CHANGED
@@ -4,7 +4,7 @@ tags:
4
  - token-classification
5
  language:
6
  - ro
7
- license: CC-BY-SA-4.0
8
  model-index:
9
  - name: ro_core_news_md
10
  results:
@@ -14,47 +14,47 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.7524828113
18
  - name: NER Recall
19
  type: recall
20
- value: 0.7568190549
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.7546447041
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS Accuracy
29
  type: accuracy
30
- value: 0.9730009254
31
  - task:
32
  name: SENTER
33
  type: token-classification
34
  metrics:
35
  - name: SENTER Precision
36
  type: precision
37
- value: 0.9598393574
38
  - name: SENTER Recall
39
  type: recall
40
- value: 0.9534574468
41
  - name: SENTER F Score
42
  type: f_score
43
- value: 0.9566377585
44
  - task:
45
  name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
  - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
- value: 0.8902760351
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
  - name: Labeled Dependencies Accuracy
56
  type: accuracy
57
- value: 0.8902760351
58
  ---
59
  ### Details: https://spacy.io/models/ro#ro_core_news_md
60
 
@@ -63,12 +63,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `ro_core_news_md` |
66
- | **Version** | `3.1.0` |
67
- | **spaCy** | `>=3.1.0,<3.2.0` |
68
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 500000 keys, 20000 unique vectors (300 dimensions) |
71
- | **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.5](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
72
  | **License** | `CC BY-SA 4.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
@@ -76,12 +76,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
76
 
77
  <details>
78
 
79
- <summary>View label scheme (534 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
- | **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc`, `Mc-p-d`, `Mc-p-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrln`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps2ms-s`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp1s`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp-sr`, `Yr` |
84
- | **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:agent`, `nmod:pmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
  | **`senter`** | `I`, `S` |
86
  | **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
87
 
@@ -92,15 +92,21 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
92
  | Type | Score |
93
  | --- | --- |
94
  | `TOKEN_ACC` | 99.90 |
95
- | `TAG_ACC` | 97.30 |
96
- | `POS_ACC` | 96.43 |
97
- | `MORPH_ACC` | 97.43 |
98
- | `LEMMA_ACC` | 81.87 |
99
- | `DEP_UAS` | 89.03 |
100
- | `DEP_LAS` | 84.43 |
101
- | `ENTS_P` | 75.25 |
102
- | `ENTS_R` | 75.68 |
103
- | `ENTS_F` | 75.46 |
104
- | `SENTS_P` | 95.98 |
105
- | `SENTS_R` | 95.35 |
106
- | `SENTS_F` | 95.66 |
 
 
 
 
 
 
 
4
  - token-classification
5
  language:
6
  - ro
7
+ license: cc-by-sa-4.0
8
  model-index:
9
  - name: ro_core_news_md
10
  results:
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.7485865058
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.7629658087
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.7557077626
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS Accuracy
29
  type: accuracy
30
+ value: 0.9619726156
31
  - task:
32
  name: SENTER
33
  type: token-classification
34
  metrics:
35
  - name: SENTER Precision
36
  type: precision
37
+ value: 0.9626168224
38
  - name: SENTER Recall
39
  type: recall
40
+ value: 0.9587765957
41
  - name: SENTER F Score
42
  type: f_score
43
+ value: 0.9606928714
44
  - task:
45
  name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
  - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
+ value: 0.8893350063
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
  - name: Labeled Dependencies Accuracy
56
  type: accuracy
57
+ value: 0.8893350063
58
  ---
59
  ### Details: https://spacy.io/models/ro#ro_core_news_md
60
 
 
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `ro_core_news_md` |
66
+ | **Version** | `3.2.0` |
67
+ | **spaCy** | `>=3.2.0,<3.3.0` |
68
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 500000 keys, 20000 unique vectors (300 dimensions) |
71
+ | **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.8](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
72
  | **License** | `CC BY-SA 4.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
 
76
 
77
  <details>
78
 
79
+ <summary>View label scheme (541 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
+ | **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afp-srn`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3---sy`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc-p-d`, `Mc-p-l`, `Mc-s-b`, `Mc-s-d`, `Mc-s-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrl`, `Mcfsrln`, `Mcfsrly`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrln`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3-so--------s`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vag-------y`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp2p`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp,Yn`, `Yp-sr`, `Yr` |
84
+ | **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `obl:pmod`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
  | **`senter`** | `I`, `S` |
86
  | **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
87
 
 
92
  | Type | Score |
93
  | --- | --- |
94
  | `TOKEN_ACC` | 99.90 |
95
+ | `TOKEN_P` | 99.67 |
96
+ | `TOKEN_R` | 99.57 |
97
+ | `TOKEN_F` | 99.59 |
98
+ | `TAG_ACC` | 96.20 |
99
+ | `SENTS_P` | 96.26 |
100
+ | `SENTS_R` | 95.88 |
101
+ | `SENTS_F` | 96.07 |
102
+ | `DEP_UAS` | 88.93 |
103
+ | `DEP_LAS` | 83.88 |
104
+ | `POS_ACC` | 93.82 |
105
+ | `MORPH_ACC` | 94.69 |
106
+ | `MORPH_MICRO_P` | 98.71 |
107
+ | `MORPH_MICRO_R` | 95.58 |
108
+ | `MORPH_MICRO_F` | 96.84 |
109
+ | `LEMMA_ACC` | 81.83 |
110
+ | `ENTS_P` | 74.86 |
111
+ | `ENTS_R` | 76.30 |
112
+ | `ENTS_F` | 75.57 |
accuracy.json CHANGED
@@ -1,185 +1,64 @@
1
  {
2
  "token_acc": 0.9990029326,
3
- "tag_acc": 0.9730009254,
4
- "pos_acc": 0.9642915465,
5
- "morph_acc": 0.9742780152,
6
- "lemma_acc": 0.8186589263,
7
- "dep_uas": 0.8902760351,
8
- "dep_las": 0.8442910916,
9
- "ents_p": 0.7524828113,
10
- "ents_r": 0.7568190549,
11
- "ents_f": 0.7546447041,
12
- "sents_p": 0.9598393574,
13
- "sents_r": 0.9534574468,
14
- "sents_f": 0.9566377585,
15
- "speed": 8493.8160932984,
16
- "morph_per_feat": {
17
- "AdpType": {
18
- "p": 0.997492687,
19
- "r": 0.9933416563,
20
- "f": 0.995412844
21
- },
22
- "Case": {
23
- "p": 0.9877617623,
24
- "r": 0.9809588116,
25
- "f": 0.9843485331
26
- },
27
- "Variant": {
28
- "p": 0.9846153846,
29
- "r": 0.9241877256,
30
- "f": 0.9534450652
31
- },
32
- "Gender": {
33
- "p": 0.9798818233,
34
- "r": 0.9754901961,
35
- "f": 0.977681078
36
- },
37
- "Number": {
38
- "p": 0.9811536265,
39
- "r": 0.9754712696,
40
- "f": 0.9783041968
41
- },
42
- "PronType": {
43
- "p": 0.9943589744,
44
- "r": 0.9892857143,
45
- "f": 0.9918158568
46
- },
47
- "Definite": {
48
- "p": 0.9773605743,
49
- "r": 0.9711046086,
50
- "f": 0.9742225484
51
- },
52
- "Degree": {
53
- "p": 0.9527845036,
54
- "r": 0.9369047619,
55
- "f": 0.9447779112
56
- },
57
- "Polarity": {
58
- "p": 0.9884318766,
59
- "r": 0.9846350832,
60
- "f": 0.9865298268
61
- },
62
- "Mood": {
63
- "p": 0.9760869565,
64
- "r": 0.9621428571,
65
- "f": 0.9690647482
66
- },
67
- "Person": {
68
- "p": 0.9822419534,
69
- "r": 0.9696859021,
70
- "f": 0.9759235435
71
- },
72
- "Tense": {
73
- "p": 0.9691497366,
74
- "r": 0.9491525424,
75
- "f": 0.9590469099
76
- },
77
- "VerbForm": {
78
- "p": 0.9661582459,
79
- "r": 0.9579395085,
80
- "f": 0.9620313242
81
- },
82
- "NumForm": {
83
- "p": 0.9926650367,
84
- "r": 0.9902439024,
85
- "f": 0.9914529915
86
- },
87
- "NumType": {
88
- "p": 0.9951807229,
89
- "r": 0.9904076739,
90
- "f": 0.9927884615
91
- },
92
- "PartType": {
93
- "p": 0.9473684211,
94
- "r": 0.9,
95
- "f": 0.9230769231
96
- },
97
- "Strength": {
98
- "p": 0.9931623932,
99
- "r": 0.9781144781,
100
- "f": 0.9855810008
101
- },
102
- "Reflex": {
103
- "p": 0.9969135802,
104
- "r": 0.990797546,
105
- "f": 0.9938461538
106
- },
107
- "Poss": {
108
- "p": 0.9826989619,
109
- "r": 0.993006993,
110
- "f": 0.987826087
111
- },
112
- "Position": {
113
- "p": 0.9791666667,
114
- "r": 0.9724137931,
115
- "f": 0.9757785467
116
- },
117
- "Number[psor]": {
118
- "p": 0.9436619718,
119
- "r": 0.9710144928,
120
- "f": 0.9571428571
121
- },
122
- "Abbr": {
123
- "p": 0.9625,
124
- "r": 0.9058823529,
125
- "f": 0.9333333333
126
- },
127
- "Foreign": {
128
- "p": 0.0,
129
- "r": 0.0,
130
- "f": 0.0
131
- }
132
- },
133
  "dep_las_per_type": {
134
  "case": {
135
- "p": 0.9279192696,
136
- "r": 0.9410331384,
137
- "f": 0.934430196
138
  },
139
  "det": {
140
- "p": 0.9426751592,
141
- "r": 0.9736842105,
142
- "f": 0.9579288026
143
  },
144
  "nmod:tmod": {
145
- "p": 0.3333333333,
146
- "r": 0.023255814,
147
- "f": 0.0434782609
148
  },
149
  "amod": {
150
- "p": 0.8767985612,
151
- "r": 0.8847549909,
152
- "f": 0.8807588076
153
  },
154
  "cc": {
155
- "p": 0.8734693878,
156
- "r": 0.8953974895,
157
- "f": 0.8842975207
158
  },
159
  "conj": {
160
- "p": 0.6020864382,
161
- "r": 0.6102719033,
162
- "f": 0.6061515379
163
  },
164
  "nmod": {
165
- "p": 0.7827130852,
166
- "r": 0.8242730721,
167
- "f": 0.802955665
168
  },
169
  "mark": {
170
- "p": 0.8881578947,
171
- "r": 0.9101123596,
172
- "f": 0.8990011099
173
  },
174
  "fixed": {
175
- "p": 0.8504273504,
176
- "r": 0.6945898778,
177
- "f": 0.7646493756
178
  },
179
  "nsubj": {
180
- "p": 0.8195386703,
181
- "r": 0.7674714104,
182
- "f": 0.7926509186
183
  },
184
  "advcl:tcl": {
185
  "p": 0.0,
@@ -187,84 +66,84 @@
187
  "f": 0.0
188
  },
189
  "obj": {
190
- "p": 0.7511811024,
191
- "r": 0.8139931741,
192
- "f": 0.7813267813
193
  },
194
  "nummod": {
195
- "p": 0.9028213166,
196
- "r": 0.8861538462,
197
- "f": 0.8944099379
198
  },
199
  "flat": {
200
- "p": 0.765625,
201
- "r": 0.7,
202
- "f": 0.7313432836
203
  },
204
  "obl": {
205
- "p": 0.6436548223,
206
- "r": 0.7196367764,
207
- "f": 0.679528403
208
  },
209
- "nmod:pmod": {
210
- "p": 0.5454545455,
211
- "r": 0.1384615385,
212
- "f": 0.2208588957
213
  },
214
  "acl": {
215
- "p": 0.6765498652,
216
- "r": 0.7150997151,
217
- "f": 0.6952908587
218
  },
219
  "advmod": {
220
- "p": 0.7627785059,
221
- "r": 0.75,
222
- "f": 0.7563352827
223
  },
224
  "expl:pv": {
225
- "p": 0.7788944724,
226
- "r": 0.8288770053,
227
- "f": 0.8031088083
228
  },
229
  "root": {
230
- "p": 0.9196787149,
231
- "r": 0.9135638298,
232
- "f": 0.916611074
233
  },
234
  "advcl": {
235
- "p": 0.5634920635,
236
- "r": 0.5772357724,
237
- "f": 0.5702811245
238
  },
239
  "iobj": {
240
- "p": 0.7578125,
241
- "r": 0.6554054054,
242
- "f": 0.7028985507
243
  },
244
  "ccomp": {
245
- "p": 0.7272727273,
246
- "r": 0.808988764,
247
- "f": 0.7659574468
248
  },
249
  "goeswith": {
250
- "p": 0.7,
251
- "r": 0.5833333333,
252
- "f": 0.6363636364
253
  },
254
  "parataxis": {
255
- "p": 0.7553191489,
256
- "r": 0.5419847328,
257
- "f": 0.6311111111
258
  },
259
  "expl:poss": {
260
- "p": 0.6666666667,
261
- "r": 0.6976744186,
262
- "f": 0.6818181818
263
  },
264
  "cop": {
265
- "p": 0.7607361963,
266
- "r": 0.7654320988,
267
- "f": 0.7630769231
268
  },
269
  "cc:preconj": {
270
  "p": 0.0,
@@ -272,54 +151,49 @@
272
  "f": 0.0
273
  },
274
  "aux": {
275
- "p": 0.9772079772,
276
  "r": 0.9122340426,
277
- "f": 0.9436038514
278
  },
279
  "expl": {
280
- "p": 0.5365853659,
281
- "r": 0.511627907,
282
- "f": 0.5238095238
283
  },
284
  "appos": {
285
- "p": 0.5060240964,
286
- "r": 0.4158415842,
287
- "f": 0.4565217391
288
  },
289
  "xcomp": {
290
- "p": 0.5737704918,
291
- "r": 0.4268292683,
292
- "f": 0.4895104895
293
  },
294
- "dep": {
295
- "p": 0.0,
296
- "r": 0.0,
297
- "f": 0.0
298
  },
299
  "csubj": {
300
- "p": 0.7966101695,
301
- "r": 0.746031746,
302
- "f": 0.7704918033
303
  },
304
- "nmod:agent": {
305
- "p": 0.75,
306
- "r": 0.7846153846,
307
- "f": 0.7669172932
308
  },
309
  "aux:pass": {
310
- "p": 0.75,
311
- "r": 0.9,
312
- "f": 0.8181818182
313
- },
314
- "nsubj:pass": {
315
- "p": 0.6060606061,
316
- "r": 0.6711409396,
317
- "f": 0.6369426752
318
  },
319
- "ccomp:pmod": {
320
- "p": 0.375,
321
- "r": 0.2,
322
- "f": 0.2608695652
323
  },
324
  "advmod:tmod": {
325
  "p": 0.0,
@@ -331,10 +205,15 @@
331
  "r": 0.6666666667,
332
  "f": 0.5714285714
333
  },
 
 
 
 
 
334
  "expl:pass": {
335
- "p": 0.6966292135,
336
- "r": 0.6813186813,
337
- "f": 0.6888888889
338
  },
339
  "orphan": {
340
  "p": 0.0,
@@ -347,9 +226,9 @@
347
  "f": 0.1666666667
348
  },
349
  "csubj:pass": {
350
- "p": 0.5,
351
- "r": 0.3333333333,
352
- "f": 0.4
353
  },
354
  "vocative": {
355
  "p": 0.0,
@@ -362,86 +241,213 @@
362
  "f": 0.0
363
  }
364
  },
365
- "ents_per_type": {
366
- "DATETIME": {
367
- "p": 0.762541806,
368
- "r": 0.7944250871,
369
- "f": 0.7781569966
 
 
 
 
 
370
  },
371
- "ORGANIZATION": {
372
- "p": 0.6898734177,
373
- "r": 0.6942675159,
374
- "f": 0.6920634921
375
  },
376
- "FACILITY": {
377
- "p": 0.536,
378
- "r": 0.5114503817,
379
- "f": 0.5234375
380
  },
381
- "NUMERIC_VALUE": {
382
- "p": 0.9253112033,
383
- "r": 0.9449152542,
384
- "f": 0.9350104822
385
  },
386
- "ORDINAL": {
387
- "p": 0.8653846154,
388
- "r": 0.8181818182,
389
- "f": 0.8411214953
390
  },
391
- "EVENT": {
392
- "p": 0.6785714286,
393
- "r": 0.5135135135,
394
- "f": 0.5846153846
395
  },
396
- "GPE": {
397
- "p": 0.8545454545,
398
- "r": 0.8643678161,
399
- "f": 0.8594285714
400
  },
401
- "PERSON": {
402
- "p": 0.7031746032,
403
- "r": 0.7432885906,
404
- "f": 0.722675367
405
  },
406
- "NAT_REL_POL": {
407
- "p": 0.9300699301,
408
- "r": 0.8866666667,
409
- "f": 0.9078498294
410
  },
411
- "MONEY": {
412
- "p": 0.9230769231,
413
- "r": 0.8275862069,
414
- "f": 0.8727272727
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
415
  },
416
  "PRODUCT": {
417
- "p": 0.536,
418
- "r": 0.4890510949,
419
- "f": 0.5114503817
420
  },
421
  "LOC": {
422
- "p": 0.4868421053,
423
- "r": 0.4868421053,
424
- "f": 0.4868421053
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
425
  },
426
  "WORK_OF_ART": {
427
- "p": 0.3157894737,
428
- "r": 0.3157894737,
429
- "f": 0.3157894737
 
 
 
 
 
 
 
 
 
 
430
  },
431
  "QUANTITY": {
432
- "p": 0.8571428571,
433
- "r": 0.9230769231,
434
- "f": 0.8888888889
 
 
 
 
 
435
  },
436
  "LANGUAGE": {
437
- "p": 0.5714285714,
438
- "r": 1.0,
439
- "f": 0.7272727273
440
  },
441
  "PERIOD": {
442
- "p": 0.8648648649,
443
- "r": 0.7619047619,
444
- "f": 0.8101265823
445
  }
446
- }
 
447
  }
 
1
  {
2
  "token_acc": 0.9990029326,
3
+ "token_p": 0.9967350492,
4
+ "token_r": 0.9957244934,
5
+ "token_f": 0.9959492157,
6
+ "tag_acc": 0.9619726156,
7
+ "sents_p": 0.9626168224,
8
+ "sents_r": 0.9587765957,
9
+ "sents_f": 0.9606928714,
10
+ "dep_uas": 0.8893350063,
11
+ "dep_las": 0.8388068128,
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  "dep_las_per_type": {
13
  "case": {
14
+ "p": 0.9337493999,
15
+ "r": 0.9492435334,
16
+ "f": 0.9414327202
17
  },
18
  "det": {
19
+ "p": 0.9484425349,
20
+ "r": 0.966083151,
21
+ "f": 0.9571815718
22
  },
23
  "nmod:tmod": {
24
+ "p": 0.6666666667,
25
+ "r": 0.0930232558,
26
+ "f": 0.1632653061
27
  },
28
  "amod": {
29
+ "p": 0.8737690242,
30
+ "r": 0.8864668483,
31
+ "f": 0.8800721371
32
  },
33
  "cc": {
34
+ "p": 0.877016129,
35
+ "r": 0.910041841,
36
+ "f": 0.8932238193
37
  },
38
  "conj": {
39
+ "p": 0.5879699248,
40
+ "r": 0.5915279879,
41
+ "f": 0.5897435897
42
  },
43
  "nmod": {
44
+ "p": 0.7885679164,
45
+ "r": 0.8099747475,
46
+ "f": 0.7991279975
47
  },
48
  "mark": {
49
+ "p": 0.9161147903,
50
+ "r": 0.9222222222,
51
+ "f": 0.919158361
52
  },
53
  "fixed": {
54
+ "p": 0.8559322034,
55
+ "r": 0.7163120567,
56
+ "f": 0.7799227799
57
  },
58
  "nsubj": {
59
+ "p": 0.8134920635,
60
+ "r": 0.7824427481,
61
+ "f": 0.7976653696
62
  },
63
  "advcl:tcl": {
64
  "p": 0.0,
 
66
  "f": 0.0
67
  },
68
  "obj": {
69
+ "p": 0.7793880837,
70
+ "r": 0.8273504274,
71
+ "f": 0.8026533997
72
  },
73
  "nummod": {
74
+ "p": 0.8892405063,
75
+ "r": 0.8619631902,
76
+ "f": 0.8753894081
77
  },
78
  "flat": {
79
+ "p": 0.7441860465,
80
+ "r": 0.6857142857,
81
+ "f": 0.7137546468
82
  },
83
  "obl": {
84
+ "p": 0.6402378593,
85
+ "r": 0.731596829,
86
+ "f": 0.6828752643
87
  },
88
+ "obl:pmod": {
89
+ "p": 0.4375,
90
+ "r": 0.1615384615,
91
+ "f": 0.2359550562
92
  },
93
  "acl": {
94
+ "p": 0.7222222222,
95
+ "r": 0.7303370787,
96
+ "f": 0.7262569832
97
  },
98
  "advmod": {
99
+ "p": 0.8060686016,
100
+ "r": 0.7823303457,
101
+ "f": 0.7940220923
102
  },
103
  "expl:pv": {
104
+ "p": 0.7777777778,
105
+ "r": 0.8191489362,
106
+ "f": 0.7979274611
107
  },
108
  "root": {
109
+ "p": 0.9103078983,
110
+ "r": 0.9042553191,
111
+ "f": 0.9072715143
112
  },
113
  "advcl": {
114
+ "p": 0.5579710145,
115
+ "r": 0.6260162602,
116
+ "f": 0.5900383142
117
  },
118
  "iobj": {
119
+ "p": 0.7966101695,
120
+ "r": 0.6394557823,
121
+ "f": 0.7094339623
122
  },
123
  "ccomp": {
124
+ "p": 0.6995073892,
125
+ "r": 0.802259887,
126
+ "f": 0.7473684211
127
  },
128
  "goeswith": {
129
+ "p": 0.25,
130
+ "r": 0.1428571429,
131
+ "f": 0.1818181818
132
  },
133
  "parataxis": {
134
+ "p": 0.8494623656,
135
+ "r": 0.6030534351,
136
+ "f": 0.7053571429
137
  },
138
  "expl:poss": {
139
+ "p": 0.6086956522,
140
+ "r": 0.6511627907,
141
+ "f": 0.6292134831
142
  },
143
  "cop": {
144
+ "p": 0.75,
145
+ "r": 0.773006135,
146
+ "f": 0.7613293051
147
  },
148
  "cc:preconj": {
149
  "p": 0.0,
 
151
  "f": 0.0
152
  },
153
  "aux": {
154
+ "p": 0.9661971831,
155
  "r": 0.9122340426,
156
+ "f": 0.9384404925
157
  },
158
  "expl": {
159
+ "p": 0.5714285714,
160
+ "r": 0.4761904762,
161
+ "f": 0.5194805195
162
  },
163
  "appos": {
164
+ "p": 0.4691358025,
165
+ "r": 0.3762376238,
166
+ "f": 0.4175824176
167
  },
168
  "xcomp": {
169
+ "p": 0.5538461538,
170
+ "r": 0.4337349398,
171
+ "f": 0.4864864865
172
  },
173
+ "nsubj:pass": {
174
+ "p": 0.5878787879,
175
+ "r": 0.6381578947,
176
+ "f": 0.6119873817
177
  },
178
  "csubj": {
179
+ "p": 0.8448275862,
180
+ "r": 0.7777777778,
181
+ "f": 0.8099173554
182
  },
183
+ "obl:agent": {
184
+ "p": 0.7538461538,
185
+ "r": 0.7538461538,
186
+ "f": 0.7538461538
187
  },
188
  "aux:pass": {
189
+ "p": 0.7428571429,
190
+ "r": 0.8666666667,
191
+ "f": 0.8
 
 
 
 
 
192
  },
193
+ "dep": {
194
+ "p": 0.0,
195
+ "r": 0.0,
196
+ "f": 0.0
197
  },
198
  "advmod:tmod": {
199
  "p": 0.0,
 
205
  "r": 0.6666666667,
206
  "f": 0.5714285714
207
  },
208
+ "ccomp:pmod": {
209
+ "p": 0.5,
210
+ "r": 0.1875,
211
+ "f": 0.2727272727
212
+ },
213
  "expl:pass": {
214
+ "p": 0.6808510638,
215
+ "r": 0.7032967033,
216
+ "f": 0.6918918919
217
  },
218
  "orphan": {
219
  "p": 0.0,
 
226
  "f": 0.1666666667
227
  },
228
  "csubj:pass": {
229
+ "p": 0.6666666667,
230
+ "r": 0.6666666667,
231
+ "f": 0.6666666667
232
  },
233
  "vocative": {
234
  "p": 0.0,
 
241
  "f": 0.0
242
  }
243
  },
244
+ "pos_acc": 0.9381923087,
245
+ "morph_acc": 0.9469023954,
246
+ "morph_micro_p": 0.9870716332,
247
+ "morph_micro_r": 0.9558096483,
248
+ "morph_micro_f": 0.9683797083,
249
+ "morph_per_feat": {
250
+ "AdpType": {
251
+ "p": 0.9954051796,
252
+ "r": 0.9941593659,
253
+ "f": 0.9947818827
254
  },
255
+ "Case": {
256
+ "p": 0.9873727088,
257
+ "r": 0.9820391627,
258
+ "f": 0.9846987136
259
  },
260
+ "Variant": {
261
+ "p": 0.976744186,
262
+ "r": 0.9130434783,
263
+ "f": 0.9438202247
264
  },
265
+ "Gender": {
266
+ "p": 0.9821478774,
267
+ "r": 0.9776129845,
268
+ "f": 0.9798751841
269
  },
270
+ "Number": {
271
+ "p": 0.9810964083,
272
+ "r": 0.9438508752,
273
+ "f": 0.9621133125
274
  },
275
+ "PronType": {
276
+ "p": 0.9902862986,
277
+ "r": 0.9872579001,
278
+ "f": 0.9887697805
279
  },
280
+ "Definite": {
281
+ "p": 0.9788447388,
282
+ "r": 0.9734723747,
283
+ "f": 0.9761511649
284
  },
285
+ "Degree": {
286
+ "p": 0.9568913175,
287
+ "r": 0.9347568209,
288
+ "f": 0.9456945695
289
  },
290
+ "Polarity": {
291
+ "p": 0.9884318766,
292
+ "r": 0.9858974359,
293
+ "f": 0.9871630295
294
  },
295
+ "Mood": {
296
+ "p": 0.9740072202,
297
+ "r": 0.9677187948,
298
+ "f": 0.9708528248
299
+ },
300
+ "Person": {
301
+ "p": 0.9764359352,
302
+ "r": 0.9696526508,
303
+ "f": 0.9730324711
304
+ },
305
+ "Tense": {
306
+ "p": 0.9707207207,
307
+ "r": 0.9563609467,
308
+ "f": 0.9634873323
309
+ },
310
+ "VerbForm": {
311
+ "p": 0.9714013346,
312
+ "r": 0.9622285175,
313
+ "f": 0.9667931689
314
+ },
315
+ "NumForm": {
316
+ "p": 0.9758064516,
317
+ "r": 0.2929782082,
318
+ "f": 0.4506517691
319
+ },
320
+ "NumType": {
321
+ "p": 0.9846153846,
322
+ "r": 0.3054892601,
323
+ "f": 0.4663023679
324
+ },
325
+ "PartType": {
326
+ "p": 0.9473684211,
327
+ "r": 0.9230769231,
328
+ "f": 0.9350649351
329
+ },
330
+ "Strength": {
331
+ "p": 0.9914675768,
332
+ "r": 0.97319933,
333
+ "f": 0.9822485207
334
+ },
335
+ "Reflex": {
336
+ "p": 0.9938461538,
337
+ "r": 0.9877675841,
338
+ "f": 0.990797546
339
+ },
340
+ "Poss": {
341
+ "p": 0.986013986,
342
+ "r": 0.986013986,
343
+ "f": 0.986013986
344
+ },
345
+ "Position": {
346
+ "p": 0.986013986,
347
+ "r": 0.9724137931,
348
+ "f": 0.9791666667
349
+ },
350
+ "Number[psor]": {
351
+ "p": 0.9420289855,
352
+ "r": 0.9558823529,
353
+ "f": 0.9489051095
354
+ },
355
+ "Foreign": {
356
+ "p": 0.0,
357
+ "r": 0.0,
358
+ "f": 0.0
359
+ },
360
+ "Abbr": {
361
+ "p": 0.9620253165,
362
+ "r": 0.9156626506,
363
+ "f": 0.9382716049
364
+ }
365
+ },
366
+ "lemma_acc": 0.8183070924,
367
+ "ents_p": 0.7485865058,
368
+ "ents_r": 0.7629658087,
369
+ "ents_f": 0.7557077626,
370
+ "ents_per_type": {
371
+ "DATETIME": {
372
+ "p": 0.0,
373
+ "r": 0.0,
374
+ "f": 0.0
375
+ },
376
+ "PERSON": {
377
+ "p": 0.0,
378
+ "r": 0.0,
379
+ "f": 0.0
380
  },
381
  "PRODUCT": {
382
+ "p": 0.0,
383
+ "r": 0.0,
384
+ "f": 0.0
385
  },
386
  "LOC": {
387
+ "p": 0.0,
388
+ "r": 0.0,
389
+ "f": 0.0
390
+ },
391
+ "GPE": {
392
+ "p": 0.0,
393
+ "r": 0.0,
394
+ "f": 0.0
395
+ },
396
+ "ORDINAL": {
397
+ "p": 0.0,
398
+ "r": 0.0,
399
+ "f": 0.0
400
+ },
401
+ "NUMERIC_VALUE": {
402
+ "p": 0.0,
403
+ "r": 0.0,
404
+ "f": 0.0
405
+ },
406
+ "ORGANIZATION": {
407
+ "p": 0.0,
408
+ "r": 0.0,
409
+ "f": 0.0
410
+ },
411
+ "NAT_REL_POL": {
412
+ "p": 0.0,
413
+ "r": 0.0,
414
+ "f": 0.0
415
  },
416
  "WORK_OF_ART": {
417
+ "p": 0.0,
418
+ "r": 0.0,
419
+ "f": 0.0
420
+ },
421
+ "EVENT": {
422
+ "p": 0.0,
423
+ "r": 0.0,
424
+ "f": 0.0
425
+ },
426
+ "FACILITY": {
427
+ "p": 0.0,
428
+ "r": 0.0,
429
+ "f": 0.0
430
  },
431
  "QUANTITY": {
432
+ "p": 0.0,
433
+ "r": 0.0,
434
+ "f": 0.0
435
+ },
436
+ "MONEY": {
437
+ "p": 0.0,
438
+ "r": 0.0,
439
+ "f": 0.0
440
  },
441
  "LANGUAGE": {
442
+ "p": 0.0,
443
+ "r": 0.0,
444
+ "f": 0.0
445
  },
446
  "PERIOD": {
447
+ "p": 0.0,
448
+ "r": 0.0,
449
+ "f": 0.0
450
  }
451
+ },
452
+ "speed": 8391.5537539766
453
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
 
config.cfg CHANGED
@@ -1,10 +1,8 @@
1
  [paths]
2
- train = "corpus/ro-dep-mixed/train.spacy"
3
- dev = "corpus/ro-dep-mixed/dev.spacy"
4
- vectors = "corpus/ro_vectors"
5
- raw = null
6
  init_tok2vec = null
7
- vocab_data = null
8
 
9
  [system]
10
  gpu_allocator = null
@@ -24,6 +22,7 @@ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
24
 
25
  [components.attribute_ruler]
26
  factory = "attribute_ruler"
 
27
  validate = false
28
 
29
  [components.lemmatizer]
@@ -31,11 +30,13 @@ factory = "lemmatizer"
31
  mode = "lookup"
32
  model = null
33
  overwrite = false
 
34
 
35
  [components.ner]
36
  factory = "ner"
37
  incorrect_spans_key = null
38
  moves = null
 
39
  update_with_oracle_cut_size = 100
40
 
41
  [components.ner.model]
@@ -53,8 +54,8 @@ nO = null
53
  [components.ner.model.tok2vec.embed]
54
  @architectures = "spacy.MultiHashEmbed.v2"
55
  width = 96
56
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
57
- rows = [5000,2500,2500,2500]
58
  include_static_vectors = true
59
 
60
  [components.ner.model.tok2vec.encode]
@@ -69,6 +70,7 @@ factory = "parser"
69
  learn_tokens = false
70
  min_action_freq = 30
71
  moves = null
 
72
  update_with_oracle_cut_size = 100
73
 
74
  [components.parser.model]
@@ -87,6 +89,8 @@ upstream = "tok2vec"
87
 
88
  [components.senter]
89
  factory = "senter"
 
 
90
 
91
  [components.senter.model]
92
  @architectures = "spacy.Tagger.v1"
@@ -98,8 +102,8 @@ nO = null
98
  [components.senter.model.tok2vec.embed]
99
  @architectures = "spacy.MultiHashEmbed.v2"
100
  width = 16
101
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
102
- rows = [1000,500,500,500]
103
  include_static_vectors = true
104
 
105
  [components.senter.model.tok2vec.encode]
@@ -111,6 +115,8 @@ maxout_pieces = 2
111
 
112
  [components.tagger]
113
  factory = "tagger"
 
 
114
 
115
  [components.tagger.model]
116
  @architectures = "spacy.Tagger.v1"
@@ -130,8 +136,8 @@ factory = "tok2vec"
130
  [components.tok2vec.model.embed]
131
  @architectures = "spacy.MultiHashEmbed.v2"
132
  width = ${components.tok2vec.model.encode:width}
133
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
134
- rows = [5000,2500,2500,2500]
135
  include_static_vectors = true
136
 
137
  [components.tok2vec.model.encode]
@@ -145,22 +151,19 @@ maxout_pieces = 3
145
 
146
  [corpora.dev]
147
  @readers = "spacy.Corpus.v1"
148
- limit = 0
149
- max_length = 0
150
- path = ${paths:dev}
151
  gold_preproc = false
 
 
152
  augmenter = null
153
 
154
  [corpora.train]
155
  @readers = "spacy.Corpus.v1"
156
- path = ${paths:train}
157
- max_length = 5000
158
  gold_preproc = false
 
159
  limit = 0
160
-
161
- [corpora.train.augmenter]
162
- @augmenters = "spacy.lower_case.v1"
163
- level = 0.1
164
 
165
  [training]
166
  train_corpus = "corpora.train"
@@ -191,9 +194,8 @@ compound = 1.001
191
  t = 0.0
192
 
193
  [training.logger]
194
- @loggers = "spacy.WandbLogger.v1"
195
- project_name = "spacy-v3.0.0a2"
196
- remove_config_values = []
197
 
198
  [training.optimizer]
199
  @optimizers = "Adam.v1"
@@ -214,16 +216,17 @@ dep_las_per_type = null
214
  sents_p = null
215
  sents_r = null
216
  sents_f = 0.02
217
- lemma_acc = 0.33
218
- ents_f = 0.33
219
  ents_p = 0.0
220
  ents_r = 0.0
221
  ents_per_type = null
 
222
 
223
  [pretraining]
224
 
225
  [initialize]
226
- vocab_data = ${paths.vocab_data}
227
  vectors = ${paths.vectors}
228
  init_tok2vec = ${paths.init_tok2vec}
229
  before_init = null
 
1
  [paths]
2
+ train = null
3
+ dev = null
4
+ vectors = null
 
5
  init_tok2vec = null
 
6
 
7
  [system]
8
  gpu_allocator = null
 
22
 
23
  [components.attribute_ruler]
24
  factory = "attribute_ruler"
25
+ scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
26
  validate = false
27
 
28
  [components.lemmatizer]
 
30
  mode = "lookup"
31
  model = null
32
  overwrite = false
33
+ scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
34
 
35
  [components.ner]
36
  factory = "ner"
37
  incorrect_spans_key = null
38
  moves = null
39
+ scorer = {"@scorers":"spacy.ner_scorer.v1"}
40
  update_with_oracle_cut_size = 100
41
 
42
  [components.ner.model]
 
54
  [components.ner.model.tok2vec.embed]
55
  @architectures = "spacy.MultiHashEmbed.v2"
56
  width = 96
57
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
58
+ rows = [5000,2500,2500,2500,100]
59
  include_static_vectors = true
60
 
61
  [components.ner.model.tok2vec.encode]
 
70
  learn_tokens = false
71
  min_action_freq = 30
72
  moves = null
73
+ scorer = {"@scorers":"spacy.parser_scorer.v1"}
74
  update_with_oracle_cut_size = 100
75
 
76
  [components.parser.model]
 
89
 
90
  [components.senter]
91
  factory = "senter"
92
+ overwrite = false
93
+ scorer = {"@scorers":"spacy.senter_scorer.v1"}
94
 
95
  [components.senter.model]
96
  @architectures = "spacy.Tagger.v1"
 
102
  [components.senter.model.tok2vec.embed]
103
  @architectures = "spacy.MultiHashEmbed.v2"
104
  width = 16
105
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
106
+ rows = [1000,500,500,500,50]
107
  include_static_vectors = true
108
 
109
  [components.senter.model.tok2vec.encode]
 
115
 
116
  [components.tagger]
117
  factory = "tagger"
118
+ overwrite = false
119
+ scorer = {"@scorers":"spacy.tagger_scorer.v1"}
120
 
121
  [components.tagger.model]
122
  @architectures = "spacy.Tagger.v1"
 
136
  [components.tok2vec.model.embed]
137
  @architectures = "spacy.MultiHashEmbed.v2"
138
  width = ${components.tok2vec.model.encode:width}
139
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
140
+ rows = [5000,2500,2500,2500,100]
141
  include_static_vectors = true
142
 
143
  [components.tok2vec.model.encode]
 
151
 
152
  [corpora.dev]
153
  @readers = "spacy.Corpus.v1"
154
+ path = ${paths.dev}
 
 
155
  gold_preproc = false
156
+ max_length = 0
157
+ limit = 0
158
  augmenter = null
159
 
160
  [corpora.train]
161
  @readers = "spacy.Corpus.v1"
162
+ path = ${paths.train}
 
163
  gold_preproc = false
164
+ max_length = 0
165
  limit = 0
166
+ augmenter = null
 
 
 
167
 
168
  [training]
169
  train_corpus = "corpora.train"
 
194
  t = 0.0
195
 
196
  [training.logger]
197
+ @loggers = "spacy.ConsoleLogger.v1"
198
+ progress_bar = false
 
199
 
200
  [training.optimizer]
201
  @optimizers = "Adam.v1"
 
216
  sents_p = null
217
  sents_r = null
218
  sents_f = 0.02
219
+ lemma_acc = 0.5
220
+ ents_f = 0.16
221
  ents_p = 0.0
222
  ents_r = 0.0
223
  ents_per_type = null
224
+ speed = 0.0
225
 
226
  [pretraining]
227
 
228
  [initialize]
229
+ vocab_data = null
230
  vectors = ${paths.vectors}
231
  init_tok2vec = ${paths.init_tok2vec}
232
  before_init = null
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"ro",
3
  "name":"core_news_md",
4
- "version":"3.1.0",
5
  "description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"[email protected]",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
- "spacy_version":">=3.1.0,<3.2.0",
11
- "spacy_git_version":"caba63b74",
12
  "vectors":{
13
  "width":300,
14
  "vectors":20000,
@@ -30,6 +30,7 @@
30
  "Afp",
31
  "Afp-p-n",
32
  "Afp-poy",
 
33
  "Afpf--n",
34
  "Afpfp-n",
35
  "Afpfp-ny",
@@ -131,6 +132,7 @@
131
  "Ds2ms-s",
132
  "Ds3---p",
133
  "Ds3---s",
 
134
  "Ds3fp-s",
135
  "Ds3fsos",
136
  "Ds3fsrs",
@@ -159,18 +161,23 @@
159
  "LSQR",
160
  "LT",
161
  "M",
162
- "Mc",
163
  "Mc-p-d",
164
  "Mc-p-l",
 
 
 
165
  "Mcfp-l",
166
  "Mcfp-ln",
167
  "Mcfprln",
168
  "Mcfprly",
169
  "Mcfsoln",
 
170
  "Mcfsrln",
 
171
  "Mcmp-l",
172
  "Mcms-ln",
173
  "Mcmsrl",
 
174
  "Mcmsrly",
175
  "Mffprln",
176
  "Mffsrln",
@@ -243,7 +250,6 @@
243
  "Pd3mpr--y",
244
  "Pd3mso",
245
  "Pd3msr",
246
- "Pi3",
247
  "Pi3--r",
248
  "Pi3-po",
249
  "Pi3-so",
@@ -289,6 +295,7 @@
289
  "Pp3-po--------s",
290
  "Pp3-sd--------w",
291
  "Pp3-sd--y-----w",
 
292
  "Pp3fpa--------w",
293
  "Pp3fpa--y-----w",
294
  "Pp3fpr--------s",
@@ -315,7 +322,6 @@
315
  "Ps2fp-s",
316
  "Ps2fsrp",
317
  "Ps2fsrs",
318
- "Ps2ms-s",
319
  "Ps3---p",
320
  "Ps3---s",
321
  "Ps3fp-s",
@@ -348,7 +354,6 @@
348
  "RPAR",
349
  "RSQR",
350
  "Rc",
351
- "Rgc",
352
  "Rgp",
353
  "Rgpy",
354
  "Rgs",
@@ -406,6 +411,7 @@
406
  "Va--3s",
407
  "Va--3s----y",
408
  "Vag",
 
409
  "Vaii1",
410
  "Vaii2s",
411
  "Vaii3p",
@@ -475,7 +481,7 @@
475
  "Vmp--sm",
476
  "Vmp--sm---y",
477
  "Vmsp1p",
478
- "Vmsp1s",
479
  "Vmsp2s",
480
  "Vmsp3",
481
  "Vmsp3-----y",
@@ -488,6 +494,7 @@
488
  "Ynmsoy",
489
  "Ynmsry",
490
  "Yp",
 
491
  "Yp-sr",
492
  "Yr"
493
  ],
@@ -525,14 +532,14 @@
525
  "iobj",
526
  "mark",
527
  "nmod",
528
- "nmod:agent",
529
- "nmod:pmod",
530
  "nmod:tmod",
531
  "nsubj",
532
  "nsubj:pass",
533
  "nummod",
534
  "obj",
535
  "obl",
 
 
536
  "orphan",
537
  "parataxis",
538
  "punct",
@@ -590,186 +597,65 @@
590
  ],
591
  "performance":{
592
  "token_acc":0.9990029326,
593
- "tag_acc":0.9730009254,
594
- "pos_acc":0.9642915465,
595
- "morph_acc":0.9742780152,
596
- "lemma_acc":0.8186589263,
597
- "dep_uas":0.8902760351,
598
- "dep_las":0.8442910916,
599
- "ents_p":0.7524828113,
600
- "ents_r":0.7568190549,
601
- "ents_f":0.7546447041,
602
- "sents_p":0.9598393574,
603
- "sents_r":0.9534574468,
604
- "sents_f":0.9566377585,
605
- "speed":8493.8160932984,
606
- "morph_per_feat":{
607
- "AdpType":{
608
- "p":0.997492687,
609
- "r":0.9933416563,
610
- "f":0.995412844
611
- },
612
- "Case":{
613
- "p":0.9877617623,
614
- "r":0.9809588116,
615
- "f":0.9843485331
616
- },
617
- "Variant":{
618
- "p":0.9846153846,
619
- "r":0.9241877256,
620
- "f":0.9534450652
621
- },
622
- "Gender":{
623
- "p":0.9798818233,
624
- "r":0.9754901961,
625
- "f":0.977681078
626
- },
627
- "Number":{
628
- "p":0.9811536265,
629
- "r":0.9754712696,
630
- "f":0.9783041968
631
- },
632
- "PronType":{
633
- "p":0.9943589744,
634
- "r":0.9892857143,
635
- "f":0.9918158568
636
- },
637
- "Definite":{
638
- "p":0.9773605743,
639
- "r":0.9711046086,
640
- "f":0.9742225484
641
- },
642
- "Degree":{
643
- "p":0.9527845036,
644
- "r":0.9369047619,
645
- "f":0.9447779112
646
- },
647
- "Polarity":{
648
- "p":0.9884318766,
649
- "r":0.9846350832,
650
- "f":0.9865298268
651
- },
652
- "Mood":{
653
- "p":0.9760869565,
654
- "r":0.9621428571,
655
- "f":0.9690647482
656
- },
657
- "Person":{
658
- "p":0.9822419534,
659
- "r":0.9696859021,
660
- "f":0.9759235435
661
- },
662
- "Tense":{
663
- "p":0.9691497366,
664
- "r":0.9491525424,
665
- "f":0.9590469099
666
- },
667
- "VerbForm":{
668
- "p":0.9661582459,
669
- "r":0.9579395085,
670
- "f":0.9620313242
671
- },
672
- "NumForm":{
673
- "p":0.9926650367,
674
- "r":0.9902439024,
675
- "f":0.9914529915
676
- },
677
- "NumType":{
678
- "p":0.9951807229,
679
- "r":0.9904076739,
680
- "f":0.9927884615
681
- },
682
- "PartType":{
683
- "p":0.9473684211,
684
- "r":0.9,
685
- "f":0.9230769231
686
- },
687
- "Strength":{
688
- "p":0.9931623932,
689
- "r":0.9781144781,
690
- "f":0.9855810008
691
- },
692
- "Reflex":{
693
- "p":0.9969135802,
694
- "r":0.990797546,
695
- "f":0.9938461538
696
- },
697
- "Poss":{
698
- "p":0.9826989619,
699
- "r":0.993006993,
700
- "f":0.987826087
701
- },
702
- "Position":{
703
- "p":0.9791666667,
704
- "r":0.9724137931,
705
- "f":0.9757785467
706
- },
707
- "Number[psor]":{
708
- "p":0.9436619718,
709
- "r":0.9710144928,
710
- "f":0.9571428571
711
- },
712
- "Abbr":{
713
- "p":0.9625,
714
- "r":0.9058823529,
715
- "f":0.9333333333
716
- },
717
- "Foreign":{
718
- "p":0.0,
719
- "r":0.0,
720
- "f":0.0
721
- }
722
- },
723
  "dep_las_per_type":{
724
  "case":{
725
- "p":0.9279192696,
726
- "r":0.9410331384,
727
- "f":0.934430196
728
  },
729
  "det":{
730
- "p":0.9426751592,
731
- "r":0.9736842105,
732
- "f":0.9579288026
733
  },
734
  "nmod:tmod":{
735
- "p":0.3333333333,
736
- "r":0.023255814,
737
- "f":0.0434782609
738
  },
739
  "amod":{
740
- "p":0.8767985612,
741
- "r":0.8847549909,
742
- "f":0.8807588076
743
  },
744
  "cc":{
745
- "p":0.8734693878,
746
- "r":0.8953974895,
747
- "f":0.8842975207
748
  },
749
  "conj":{
750
- "p":0.6020864382,
751
- "r":0.6102719033,
752
- "f":0.6061515379
753
  },
754
  "nmod":{
755
- "p":0.7827130852,
756
- "r":0.8242730721,
757
- "f":0.802955665
758
  },
759
  "mark":{
760
- "p":0.8881578947,
761
- "r":0.9101123596,
762
- "f":0.8990011099
763
  },
764
  "fixed":{
765
- "p":0.8504273504,
766
- "r":0.6945898778,
767
- "f":0.7646493756
768
  },
769
  "nsubj":{
770
- "p":0.8195386703,
771
- "r":0.7674714104,
772
- "f":0.7926509186
773
  },
774
  "advcl:tcl":{
775
  "p":0.0,
@@ -777,84 +663,84 @@
777
  "f":0.0
778
  },
779
  "obj":{
780
- "p":0.7511811024,
781
- "r":0.8139931741,
782
- "f":0.7813267813
783
  },
784
  "nummod":{
785
- "p":0.9028213166,
786
- "r":0.8861538462,
787
- "f":0.8944099379
788
  },
789
  "flat":{
790
- "p":0.765625,
791
- "r":0.7,
792
- "f":0.7313432836
793
  },
794
  "obl":{
795
- "p":0.6436548223,
796
- "r":0.7196367764,
797
- "f":0.679528403
798
  },
799
- "nmod:pmod":{
800
- "p":0.5454545455,
801
- "r":0.1384615385,
802
- "f":0.2208588957
803
  },
804
  "acl":{
805
- "p":0.6765498652,
806
- "r":0.7150997151,
807
- "f":0.6952908587
808
  },
809
  "advmod":{
810
- "p":0.7627785059,
811
- "r":0.75,
812
- "f":0.7563352827
813
  },
814
  "expl:pv":{
815
- "p":0.7788944724,
816
- "r":0.8288770053,
817
- "f":0.8031088083
818
  },
819
  "root":{
820
- "p":0.9196787149,
821
- "r":0.9135638298,
822
- "f":0.916611074
823
  },
824
  "advcl":{
825
- "p":0.5634920635,
826
- "r":0.5772357724,
827
- "f":0.5702811245
828
  },
829
  "iobj":{
830
- "p":0.7578125,
831
- "r":0.6554054054,
832
- "f":0.7028985507
833
  },
834
  "ccomp":{
835
- "p":0.7272727273,
836
- "r":0.808988764,
837
- "f":0.7659574468
838
  },
839
  "goeswith":{
840
- "p":0.7,
841
- "r":0.5833333333,
842
- "f":0.6363636364
843
  },
844
  "parataxis":{
845
- "p":0.7553191489,
846
- "r":0.5419847328,
847
- "f":0.6311111111
848
  },
849
  "expl:poss":{
850
- "p":0.6666666667,
851
- "r":0.6976744186,
852
- "f":0.6818181818
853
  },
854
  "cop":{
855
- "p":0.7607361963,
856
- "r":0.7654320988,
857
- "f":0.7630769231
858
  },
859
  "cc:preconj":{
860
  "p":0.0,
@@ -862,54 +748,49 @@
862
  "f":0.0
863
  },
864
  "aux":{
865
- "p":0.9772079772,
866
  "r":0.9122340426,
867
- "f":0.9436038514
868
  },
869
  "expl":{
870
- "p":0.5365853659,
871
- "r":0.511627907,
872
- "f":0.5238095238
873
  },
874
  "appos":{
875
- "p":0.5060240964,
876
- "r":0.4158415842,
877
- "f":0.4565217391
878
  },
879
  "xcomp":{
880
- "p":0.5737704918,
881
- "r":0.4268292683,
882
- "f":0.4895104895
883
  },
884
- "dep":{
885
- "p":0.0,
886
- "r":0.0,
887
- "f":0.0
888
  },
889
  "csubj":{
890
- "p":0.7966101695,
891
- "r":0.746031746,
892
- "f":0.7704918033
893
  },
894
- "nmod:agent":{
895
- "p":0.75,
896
- "r":0.7846153846,
897
- "f":0.7669172932
898
  },
899
  "aux:pass":{
900
- "p":0.75,
901
- "r":0.9,
902
- "f":0.8181818182
903
- },
904
- "nsubj:pass":{
905
- "p":0.6060606061,
906
- "r":0.6711409396,
907
- "f":0.6369426752
908
  },
909
- "ccomp:pmod":{
910
- "p":0.375,
911
- "r":0.2,
912
- "f":0.2608695652
913
  },
914
  "advmod:tmod":{
915
  "p":0.0,
@@ -921,10 +802,15 @@
921
  "r":0.6666666667,
922
  "f":0.5714285714
923
  },
 
 
 
 
 
924
  "expl:pass":{
925
- "p":0.6966292135,
926
- "r":0.6813186813,
927
- "f":0.6888888889
928
  },
929
  "orphan":{
930
  "p":0.0,
@@ -937,9 +823,9 @@
937
  "f":0.1666666667
938
  },
939
  "csubj:pass":{
940
- "p":0.5,
941
- "r":0.3333333333,
942
- "f":0.4
943
  },
944
  "vocative":{
945
  "p":0.0,
@@ -952,88 +838,215 @@
952
  "f":0.0
953
  }
954
  },
955
- "ents_per_type":{
956
- "DATETIME":{
957
- "p":0.762541806,
958
- "r":0.7944250871,
959
- "f":0.7781569966
 
 
 
 
 
960
  },
961
- "ORGANIZATION":{
962
- "p":0.6898734177,
963
- "r":0.6942675159,
964
- "f":0.6920634921
965
  },
966
- "FACILITY":{
967
- "p":0.536,
968
- "r":0.5114503817,
969
- "f":0.5234375
970
  },
971
- "NUMERIC_VALUE":{
972
- "p":0.9253112033,
973
- "r":0.9449152542,
974
- "f":0.9350104822
975
  },
976
- "ORDINAL":{
977
- "p":0.8653846154,
978
- "r":0.8181818182,
979
- "f":0.8411214953
980
  },
981
- "EVENT":{
982
- "p":0.6785714286,
983
- "r":0.5135135135,
984
- "f":0.5846153846
985
  },
986
- "GPE":{
987
- "p":0.8545454545,
988
- "r":0.8643678161,
989
- "f":0.8594285714
990
  },
991
- "PERSON":{
992
- "p":0.7031746032,
993
- "r":0.7432885906,
994
- "f":0.722675367
995
  },
996
- "NAT_REL_POL":{
997
- "p":0.9300699301,
998
- "r":0.8866666667,
999
- "f":0.9078498294
1000
  },
1001
- "MONEY":{
1002
- "p":0.9230769231,
1003
- "r":0.8275862069,
1004
- "f":0.8727272727
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1005
  },
1006
  "PRODUCT":{
1007
- "p":0.536,
1008
- "r":0.4890510949,
1009
- "f":0.5114503817
1010
  },
1011
  "LOC":{
1012
- "p":0.4868421053,
1013
- "r":0.4868421053,
1014
- "f":0.4868421053
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1015
  },
1016
  "WORK_OF_ART":{
1017
- "p":0.3157894737,
1018
- "r":0.3157894737,
1019
- "f":0.3157894737
 
 
 
 
 
 
 
 
 
 
1020
  },
1021
  "QUANTITY":{
1022
- "p":0.8571428571,
1023
- "r":0.9230769231,
1024
- "f":0.8888888889
 
 
 
 
 
1025
  },
1026
  "LANGUAGE":{
1027
- "p":0.5714285714,
1028
- "r":1.0,
1029
- "f":0.7272727273
1030
  },
1031
  "PERIOD":{
1032
- "p":0.8648648649,
1033
- "r":0.7619047619,
1034
- "f":0.8101265823
1035
  }
1036
- }
 
1037
  },
1038
  "sources":[
1039
  {
@@ -1043,7 +1056,7 @@
1043
  "author":"Michal M\u011bchura"
1044
  },
1045
  {
1046
- "name":"UD Romanian RRT v2.5",
1047
  "url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
1048
  "license":"CC BY-SA 4.0",
1049
  "author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
 
1
  {
2
  "lang":"ro",
3
  "name":"core_news_md",
4
+ "version":"3.2.0",
5
  "description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"[email protected]",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.2.0,<3.3.0",
11
+ "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":300,
14
  "vectors":20000,
 
30
  "Afp",
31
  "Afp-p-n",
32
  "Afp-poy",
33
+ "Afp-srn",
34
  "Afpf--n",
35
  "Afpfp-n",
36
  "Afpfp-ny",
 
132
  "Ds2ms-s",
133
  "Ds3---p",
134
  "Ds3---s",
135
+ "Ds3---sy",
136
  "Ds3fp-s",
137
  "Ds3fsos",
138
  "Ds3fsrs",
 
161
  "LSQR",
162
  "LT",
163
  "M",
 
164
  "Mc-p-d",
165
  "Mc-p-l",
166
+ "Mc-s-b",
167
+ "Mc-s-d",
168
+ "Mc-s-l",
169
  "Mcfp-l",
170
  "Mcfp-ln",
171
  "Mcfprln",
172
  "Mcfprly",
173
  "Mcfsoln",
174
+ "Mcfsrl",
175
  "Mcfsrln",
176
+ "Mcfsrly",
177
  "Mcmp-l",
178
  "Mcms-ln",
179
  "Mcmsrl",
180
+ "Mcmsrln",
181
  "Mcmsrly",
182
  "Mffprln",
183
  "Mffsrln",
 
250
  "Pd3mpr--y",
251
  "Pd3mso",
252
  "Pd3msr",
 
253
  "Pi3--r",
254
  "Pi3-po",
255
  "Pi3-so",
 
295
  "Pp3-po--------s",
296
  "Pp3-sd--------w",
297
  "Pp3-sd--y-----w",
298
+ "Pp3-so--------s",
299
  "Pp3fpa--------w",
300
  "Pp3fpa--y-----w",
301
  "Pp3fpr--------s",
 
322
  "Ps2fp-s",
323
  "Ps2fsrp",
324
  "Ps2fsrs",
 
325
  "Ps3---p",
326
  "Ps3---s",
327
  "Ps3fp-s",
 
354
  "RPAR",
355
  "RSQR",
356
  "Rc",
 
357
  "Rgp",
358
  "Rgpy",
359
  "Rgs",
 
411
  "Va--3s",
412
  "Va--3s----y",
413
  "Vag",
414
+ "Vag-------y",
415
  "Vaii1",
416
  "Vaii2s",
417
  "Vaii3p",
 
481
  "Vmp--sm",
482
  "Vmp--sm---y",
483
  "Vmsp1p",
484
+ "Vmsp2p",
485
  "Vmsp2s",
486
  "Vmsp3",
487
  "Vmsp3-----y",
 
494
  "Ynmsoy",
495
  "Ynmsry",
496
  "Yp",
497
+ "Yp,Yn",
498
  "Yp-sr",
499
  "Yr"
500
  ],
 
532
  "iobj",
533
  "mark",
534
  "nmod",
 
 
535
  "nmod:tmod",
536
  "nsubj",
537
  "nsubj:pass",
538
  "nummod",
539
  "obj",
540
  "obl",
541
+ "obl:agent",
542
+ "obl:pmod",
543
  "orphan",
544
  "parataxis",
545
  "punct",
 
597
  ],
598
  "performance":{
599
  "token_acc":0.9990029326,
600
+ "token_p":0.9967350492,
601
+ "token_r":0.9957244934,
602
+ "token_f":0.9959492157,
603
+ "tag_acc":0.9619726156,
604
+ "sents_p":0.9626168224,
605
+ "sents_r":0.9587765957,
606
+ "sents_f":0.9606928714,
607
+ "dep_uas":0.8893350063,
608
+ "dep_las":0.8388068128,
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
609
  "dep_las_per_type":{
610
  "case":{
611
+ "p":0.9337493999,
612
+ "r":0.9492435334,
613
+ "f":0.9414327202
614
  },
615
  "det":{
616
+ "p":0.9484425349,
617
+ "r":0.966083151,
618
+ "f":0.9571815718
619
  },
620
  "nmod:tmod":{
621
+ "p":0.6666666667,
622
+ "r":0.0930232558,
623
+ "f":0.1632653061
624
  },
625
  "amod":{
626
+ "p":0.8737690242,
627
+ "r":0.8864668483,
628
+ "f":0.8800721371
629
  },
630
  "cc":{
631
+ "p":0.877016129,
632
+ "r":0.910041841,
633
+ "f":0.8932238193
634
  },
635
  "conj":{
636
+ "p":0.5879699248,
637
+ "r":0.5915279879,
638
+ "f":0.5897435897
639
  },
640
  "nmod":{
641
+ "p":0.7885679164,
642
+ "r":0.8099747475,
643
+ "f":0.7991279975
644
  },
645
  "mark":{
646
+ "p":0.9161147903,
647
+ "r":0.9222222222,
648
+ "f":0.919158361
649
  },
650
  "fixed":{
651
+ "p":0.8559322034,
652
+ "r":0.7163120567,
653
+ "f":0.7799227799
654
  },
655
  "nsubj":{
656
+ "p":0.8134920635,
657
+ "r":0.7824427481,
658
+ "f":0.7976653696
659
  },
660
  "advcl:tcl":{
661
  "p":0.0,
 
663
  "f":0.0
664
  },
665
  "obj":{
666
+ "p":0.7793880837,
667
+ "r":0.8273504274,
668
+ "f":0.8026533997
669
  },
670
  "nummod":{
671
+ "p":0.8892405063,
672
+ "r":0.8619631902,
673
+ "f":0.8753894081
674
  },
675
  "flat":{
676
+ "p":0.7441860465,
677
+ "r":0.6857142857,
678
+ "f":0.7137546468
679
  },
680
  "obl":{
681
+ "p":0.6402378593,
682
+ "r":0.731596829,
683
+ "f":0.6828752643
684
  },
685
+ "obl:pmod":{
686
+ "p":0.4375,
687
+ "r":0.1615384615,
688
+ "f":0.2359550562
689
  },
690
  "acl":{
691
+ "p":0.7222222222,
692
+ "r":0.7303370787,
693
+ "f":0.7262569832
694
  },
695
  "advmod":{
696
+ "p":0.8060686016,
697
+ "r":0.7823303457,
698
+ "f":0.7940220923
699
  },
700
  "expl:pv":{
701
+ "p":0.7777777778,
702
+ "r":0.8191489362,
703
+ "f":0.7979274611
704
  },
705
  "root":{
706
+ "p":0.9103078983,
707
+ "r":0.9042553191,
708
+ "f":0.9072715143
709
  },
710
  "advcl":{
711
+ "p":0.5579710145,
712
+ "r":0.6260162602,
713
+ "f":0.5900383142
714
  },
715
  "iobj":{
716
+ "p":0.7966101695,
717
+ "r":0.6394557823,
718
+ "f":0.7094339623
719
  },
720
  "ccomp":{
721
+ "p":0.6995073892,
722
+ "r":0.802259887,
723
+ "f":0.7473684211
724
  },
725
  "goeswith":{
726
+ "p":0.25,
727
+ "r":0.1428571429,
728
+ "f":0.1818181818
729
  },
730
  "parataxis":{
731
+ "p":0.8494623656,
732
+ "r":0.6030534351,
733
+ "f":0.7053571429
734
  },
735
  "expl:poss":{
736
+ "p":0.6086956522,
737
+ "r":0.6511627907,
738
+ "f":0.6292134831
739
  },
740
  "cop":{
741
+ "p":0.75,
742
+ "r":0.773006135,
743
+ "f":0.7613293051
744
  },
745
  "cc:preconj":{
746
  "p":0.0,
 
748
  "f":0.0
749
  },
750
  "aux":{
751
+ "p":0.9661971831,
752
  "r":0.9122340426,
753
+ "f":0.9384404925
754
  },
755
  "expl":{
756
+ "p":0.5714285714,
757
+ "r":0.4761904762,
758
+ "f":0.5194805195
759
  },
760
  "appos":{
761
+ "p":0.4691358025,
762
+ "r":0.3762376238,
763
+ "f":0.4175824176
764
  },
765
  "xcomp":{
766
+ "p":0.5538461538,
767
+ "r":0.4337349398,
768
+ "f":0.4864864865
769
  },
770
+ "nsubj:pass":{
771
+ "p":0.5878787879,
772
+ "r":0.6381578947,
773
+ "f":0.6119873817
774
  },
775
  "csubj":{
776
+ "p":0.8448275862,
777
+ "r":0.7777777778,
778
+ "f":0.8099173554
779
  },
780
+ "obl:agent":{
781
+ "p":0.7538461538,
782
+ "r":0.7538461538,
783
+ "f":0.7538461538
784
  },
785
  "aux:pass":{
786
+ "p":0.7428571429,
787
+ "r":0.8666666667,
788
+ "f":0.8
 
 
 
 
 
789
  },
790
+ "dep":{
791
+ "p":0.0,
792
+ "r":0.0,
793
+ "f":0.0
794
  },
795
  "advmod:tmod":{
796
  "p":0.0,
 
802
  "r":0.6666666667,
803
  "f":0.5714285714
804
  },
805
+ "ccomp:pmod":{
806
+ "p":0.5,
807
+ "r":0.1875,
808
+ "f":0.2727272727
809
+ },
810
  "expl:pass":{
811
+ "p":0.6808510638,
812
+ "r":0.7032967033,
813
+ "f":0.6918918919
814
  },
815
  "orphan":{
816
  "p":0.0,
 
823
  "f":0.1666666667
824
  },
825
  "csubj:pass":{
826
+ "p":0.6666666667,
827
+ "r":0.6666666667,
828
+ "f":0.6666666667
829
  },
830
  "vocative":{
831
  "p":0.0,
 
838
  "f":0.0
839
  }
840
  },
841
+ "pos_acc":0.9381923087,
842
+ "morph_acc":0.9469023954,
843
+ "morph_micro_p":0.9870716332,
844
+ "morph_micro_r":0.9558096483,
845
+ "morph_micro_f":0.9683797083,
846
+ "morph_per_feat":{
847
+ "AdpType":{
848
+ "p":0.9954051796,
849
+ "r":0.9941593659,
850
+ "f":0.9947818827
851
  },
852
+ "Case":{
853
+ "p":0.9873727088,
854
+ "r":0.9820391627,
855
+ "f":0.9846987136
856
  },
857
+ "Variant":{
858
+ "p":0.976744186,
859
+ "r":0.9130434783,
860
+ "f":0.9438202247
861
  },
862
+ "Gender":{
863
+ "p":0.9821478774,
864
+ "r":0.9776129845,
865
+ "f":0.9798751841
866
  },
867
+ "Number":{
868
+ "p":0.9810964083,
869
+ "r":0.9438508752,
870
+ "f":0.9621133125
871
  },
872
+ "PronType":{
873
+ "p":0.9902862986,
874
+ "r":0.9872579001,
875
+ "f":0.9887697805
876
  },
877
+ "Definite":{
878
+ "p":0.9788447388,
879
+ "r":0.9734723747,
880
+ "f":0.9761511649
881
  },
882
+ "Degree":{
883
+ "p":0.9568913175,
884
+ "r":0.9347568209,
885
+ "f":0.9456945695
886
  },
887
+ "Polarity":{
888
+ "p":0.9884318766,
889
+ "r":0.9858974359,
890
+ "f":0.9871630295
891
  },
892
+ "Mood":{
893
+ "p":0.9740072202,
894
+ "r":0.9677187948,
895
+ "f":0.9708528248
896
+ },
897
+ "Person":{
898
+ "p":0.9764359352,
899
+ "r":0.9696526508,
900
+ "f":0.9730324711
901
+ },
902
+ "Tense":{
903
+ "p":0.9707207207,
904
+ "r":0.9563609467,
905
+ "f":0.9634873323
906
+ },
907
+ "VerbForm":{
908
+ "p":0.9714013346,
909
+ "r":0.9622285175,
910
+ "f":0.9667931689
911
+ },
912
+ "NumForm":{
913
+ "p":0.9758064516,
914
+ "r":0.2929782082,
915
+ "f":0.4506517691
916
+ },
917
+ "NumType":{
918
+ "p":0.9846153846,
919
+ "r":0.3054892601,
920
+ "f":0.4663023679
921
+ },
922
+ "PartType":{
923
+ "p":0.9473684211,
924
+ "r":0.9230769231,
925
+ "f":0.9350649351
926
+ },
927
+ "Strength":{
928
+ "p":0.9914675768,
929
+ "r":0.97319933,
930
+ "f":0.9822485207
931
+ },
932
+ "Reflex":{
933
+ "p":0.9938461538,
934
+ "r":0.9877675841,
935
+ "f":0.990797546
936
+ },
937
+ "Poss":{
938
+ "p":0.986013986,
939
+ "r":0.986013986,
940
+ "f":0.986013986
941
+ },
942
+ "Position":{
943
+ "p":0.986013986,
944
+ "r":0.9724137931,
945
+ "f":0.9791666667
946
+ },
947
+ "Number[psor]":{
948
+ "p":0.9420289855,
949
+ "r":0.9558823529,
950
+ "f":0.9489051095
951
+ },
952
+ "Foreign":{
953
+ "p":0.0,
954
+ "r":0.0,
955
+ "f":0.0
956
+ },
957
+ "Abbr":{
958
+ "p":0.9620253165,
959
+ "r":0.9156626506,
960
+ "f":0.9382716049
961
+ }
962
+ },
963
+ "lemma_acc":0.8183070924,
964
+ "ents_p":0.7485865058,
965
+ "ents_r":0.7629658087,
966
+ "ents_f":0.7557077626,
967
+ "ents_per_type":{
968
+ "DATETIME":{
969
+ "p":0.0,
970
+ "r":0.0,
971
+ "f":0.0
972
+ },
973
+ "PERSON":{
974
+ "p":0.0,
975
+ "r":0.0,
976
+ "f":0.0
977
  },
978
  "PRODUCT":{
979
+ "p":0.0,
980
+ "r":0.0,
981
+ "f":0.0
982
  },
983
  "LOC":{
984
+ "p":0.0,
985
+ "r":0.0,
986
+ "f":0.0
987
+ },
988
+ "GPE":{
989
+ "p":0.0,
990
+ "r":0.0,
991
+ "f":0.0
992
+ },
993
+ "ORDINAL":{
994
+ "p":0.0,
995
+ "r":0.0,
996
+ "f":0.0
997
+ },
998
+ "NUMERIC_VALUE":{
999
+ "p":0.0,
1000
+ "r":0.0,
1001
+ "f":0.0
1002
+ },
1003
+ "ORGANIZATION":{
1004
+ "p":0.0,
1005
+ "r":0.0,
1006
+ "f":0.0
1007
+ },
1008
+ "NAT_REL_POL":{
1009
+ "p":0.0,
1010
+ "r":0.0,
1011
+ "f":0.0
1012
  },
1013
  "WORK_OF_ART":{
1014
+ "p":0.0,
1015
+ "r":0.0,
1016
+ "f":0.0
1017
+ },
1018
+ "EVENT":{
1019
+ "p":0.0,
1020
+ "r":0.0,
1021
+ "f":0.0
1022
+ },
1023
+ "FACILITY":{
1024
+ "p":0.0,
1025
+ "r":0.0,
1026
+ "f":0.0
1027
  },
1028
  "QUANTITY":{
1029
+ "p":0.0,
1030
+ "r":0.0,
1031
+ "f":0.0
1032
+ },
1033
+ "MONEY":{
1034
+ "p":0.0,
1035
+ "r":0.0,
1036
+ "f":0.0
1037
  },
1038
  "LANGUAGE":{
1039
+ "p":0.0,
1040
+ "r":0.0,
1041
+ "f":0.0
1042
  },
1043
  "PERIOD":{
1044
+ "p":0.0,
1045
+ "r":0.0,
1046
+ "f":0.0
1047
  }
1048
+ },
1049
+ "speed":8391.5537539766
1050
  },
1051
  "sources":[
1052
  {
 
1056
  "author":"Michal M\u011bchura"
1057
  },
1058
  {
1059
+ "name":"UD Romanian RRT v2.8",
1060
  "url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
1061
  "license":"CC BY-SA 4.0",
1062
  "author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
ner/model CHANGED
Binary files a/ner/model and b/ner/model differ
 
parser/model CHANGED
Binary files a/parser/model and b/parser/model differ
 
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves�{"0":{"":85972},"1":{"":90580},"2":{"case":22318,"punct":9077,"det":9009,"nsubj":7125,"advmod":6350,"cc":5364,"mark":5291,"aux":4018,"obl":2015,"nummod":1880,"expl:pv":1798,"cop":1706,"amod":1376,"aux:pass":1369,"nsubj:pass":963,"expl:pass":909,"parataxis":877,"obj":866,"advcl":710,"iobj":567,"expl:poss":464,"expl":390,"nmod":204,"nsubj||csubj":154,"nmod:tmod":152,"expl:impers":102,"xcomp":97,"advmod:tmod":85,"nmod:pmod":74,"cc:preconj":63,"csubj":58,"nsubj:pass||csubj":57,"obj||ccomp":44,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14423,"amod":9673,"obl":7745,"conj":7281,"fixed":5595,"obj":5457,"acl":4102,"advmod":2145,"advcl":2043,"ccomp":1929,"nummod":1646,"nsubj":1278,"nmod:pmod":1208,"flat":1160,"det":1031,"appos":915,"xcomp":886,"iobj":804,"nmod:agent":718,"csubj":626,"nsubj:pass":546,"case":442,"parataxis":426,"nmod:tmod":286,"goeswith":245,"ccomp:pmod":174,"cc":124,"cop":100,"expl:pv":86,"expl":55,"advcl:tcl":52,"compound":50,"csubj:pass":49,"expl:poss":36,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
 
1
+ ��moves� {"0":{"":86134},"1":{"":90421},"2":{"case":22293,"punct":9078,"det":9035,"nsubj":7080,"advmod":6417,"mark":5380,"cc":5367,"aux":4002,"obl":2028,"nummod":1887,"expl:pv":1796,"cop":1712,"aux:pass":1372,"amod":1370,"nsubj:pass":1013,"expl:pass":910,"parataxis":878,"obj":868,"advcl":713,"iobj":564,"expl:poss":469,"expl":393,"nmod":203,"nsubj||csubj":155,"nmod:tmod":153,"expl:impers":102,"xcomp":97,"advmod:tmod":84,"obl:pmod":74,"cc:preconj":63,"csubj":59,"nsubj:pass||csubj":57,"obj||ccomp":45,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14500,"amod":9699,"obl":7775,"conj":7286,"fixed":5485,"obj":5462,"acl":4105,"advmod":2099,"advcl":2049,"ccomp":1932,"nummod":1667,"nsubj":1280,"obl:pmod":1208,"flat":1167,"det":1035,"appos":915,"xcomp":891,"iobj":803,"obl:agent":719,"csubj":632,"nsubj:pass":554,"parataxis":435,"case":434,"nmod:tmod":283,"ccomp:pmod":178,"cc":123,"cop":100,"expl:pv":86,"goeswith":72,"expl":55,"compound":52,"advcl:tcl":52,"csubj:pass":49,"expl:poss":35,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
ro_core_news_md-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fae7ea902dd93fd735d8eff776b80e36a68a76d2760760463d6e124b9c14a6a5
3
- size 45557410
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76845c3f800eae6e4f52e3066ee5a28bf103bdbbe4db9b0649a72f4b8d897f98
3
+ size 46220322
senter/cfg CHANGED
@@ -1,3 +1,3 @@
1
  {
2
-
3
  }
 
1
  {
2
+ "overwrite":false
3
  }
senter/model CHANGED
Binary files a/senter/model and b/senter/model differ
 
tagger/cfg CHANGED
@@ -10,6 +10,7 @@
10
  "Afp",
11
  "Afp-p-n",
12
  "Afp-poy",
 
13
  "Afpf--n",
14
  "Afpfp-n",
15
  "Afpfp-ny",
@@ -111,6 +112,7 @@
111
  "Ds2ms-s",
112
  "Ds3---p",
113
  "Ds3---s",
 
114
  "Ds3fp-s",
115
  "Ds3fsos",
116
  "Ds3fsrs",
@@ -139,18 +141,23 @@
139
  "LSQR",
140
  "LT",
141
  "M",
142
- "Mc",
143
  "Mc-p-d",
144
  "Mc-p-l",
 
 
 
145
  "Mcfp-l",
146
  "Mcfp-ln",
147
  "Mcfprln",
148
  "Mcfprly",
149
  "Mcfsoln",
 
150
  "Mcfsrln",
 
151
  "Mcmp-l",
152
  "Mcms-ln",
153
  "Mcmsrl",
 
154
  "Mcmsrly",
155
  "Mffprln",
156
  "Mffsrln",
@@ -223,7 +230,6 @@
223
  "Pd3mpr--y",
224
  "Pd3mso",
225
  "Pd3msr",
226
- "Pi3",
227
  "Pi3--r",
228
  "Pi3-po",
229
  "Pi3-so",
@@ -269,6 +275,7 @@
269
  "Pp3-po--------s",
270
  "Pp3-sd--------w",
271
  "Pp3-sd--y-----w",
 
272
  "Pp3fpa--------w",
273
  "Pp3fpa--y-----w",
274
  "Pp3fpr--------s",
@@ -295,7 +302,6 @@
295
  "Ps2fp-s",
296
  "Ps2fsrp",
297
  "Ps2fsrs",
298
- "Ps2ms-s",
299
  "Ps3---p",
300
  "Ps3---s",
301
  "Ps3fp-s",
@@ -328,7 +334,6 @@
328
  "RPAR",
329
  "RSQR",
330
  "Rc",
331
- "Rgc",
332
  "Rgp",
333
  "Rgpy",
334
  "Rgs",
@@ -386,6 +391,7 @@
386
  "Va--3s",
387
  "Va--3s----y",
388
  "Vag",
 
389
  "Vaii1",
390
  "Vaii2s",
391
  "Vaii3p",
@@ -455,7 +461,7 @@
455
  "Vmp--sm",
456
  "Vmp--sm---y",
457
  "Vmsp1p",
458
- "Vmsp1s",
459
  "Vmsp2s",
460
  "Vmsp3",
461
  "Vmsp3-----y",
@@ -468,7 +474,9 @@
468
  "Ynmsoy",
469
  "Ynmsry",
470
  "Yp",
 
471
  "Yp-sr",
472
  "Yr"
473
- ]
 
474
  }
 
10
  "Afp",
11
  "Afp-p-n",
12
  "Afp-poy",
13
+ "Afp-srn",
14
  "Afpf--n",
15
  "Afpfp-n",
16
  "Afpfp-ny",
 
112
  "Ds2ms-s",
113
  "Ds3---p",
114
  "Ds3---s",
115
+ "Ds3---sy",
116
  "Ds3fp-s",
117
  "Ds3fsos",
118
  "Ds3fsrs",
 
141
  "LSQR",
142
  "LT",
143
  "M",
 
144
  "Mc-p-d",
145
  "Mc-p-l",
146
+ "Mc-s-b",
147
+ "Mc-s-d",
148
+ "Mc-s-l",
149
  "Mcfp-l",
150
  "Mcfp-ln",
151
  "Mcfprln",
152
  "Mcfprly",
153
  "Mcfsoln",
154
+ "Mcfsrl",
155
  "Mcfsrln",
156
+ "Mcfsrly",
157
  "Mcmp-l",
158
  "Mcms-ln",
159
  "Mcmsrl",
160
+ "Mcmsrln",
161
  "Mcmsrly",
162
  "Mffprln",
163
  "Mffsrln",
 
230
  "Pd3mpr--y",
231
  "Pd3mso",
232
  "Pd3msr",
 
233
  "Pi3--r",
234
  "Pi3-po",
235
  "Pi3-so",
 
275
  "Pp3-po--------s",
276
  "Pp3-sd--------w",
277
  "Pp3-sd--y-----w",
278
+ "Pp3-so--------s",
279
  "Pp3fpa--------w",
280
  "Pp3fpa--y-----w",
281
  "Pp3fpr--------s",
 
302
  "Ps2fp-s",
303
  "Ps2fsrp",
304
  "Ps2fsrs",
 
305
  "Ps3---p",
306
  "Ps3---s",
307
  "Ps3fp-s",
 
334
  "RPAR",
335
  "RSQR",
336
  "Rc",
 
337
  "Rgp",
338
  "Rgpy",
339
  "Rgs",
 
391
  "Va--3s",
392
  "Va--3s----y",
393
  "Vag",
394
+ "Vag-------y",
395
  "Vaii1",
396
  "Vaii2s",
397
  "Vaii3p",
 
461
  "Vmp--sm",
462
  "Vmp--sm---y",
463
  "Vmsp1p",
464
+ "Vmsp2p",
465
  "Vmsp2s",
466
  "Vmsp3",
467
  "Vmsp3-----y",
 
474
  "Ynmsoy",
475
  "Ynmsry",
476
  "Yp",
477
+ "Yp,Yn",
478
  "Yp-sr",
479
  "Yr"
480
+ ],
481
+ "overwrite":false
482
  }
tagger/model CHANGED
Binary files a/tagger/model and b/tagger/model differ
 
tok2vec/model CHANGED
Binary files a/tok2vec/model and b/tok2vec/model differ
 
tokenizer CHANGED
@@ -1,3 +1,3 @@
1
- ��prefix_search�
2
  ��A�
3
- � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
 
1
+ ��prefix_search�
2
  ��A�
3
+ � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0442198e6d05377364bc6e0ce4f78c69ae3b1d2ee6feb4c1265384ca182a1dbb
3
- size 8420995
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4534edb1d1b8e8017538d692a57054e6179b5b351805c50502b2f0ef77b79ec7
3
+ size 10070837
vocab/vectors.cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "mode":"default"
3
+ }