Commit
·
5516623
1
Parent(s):
1b5232e
Update spaCy pipeline
Browse files- LICENSES_SOURCES +1 -1
- README.md +34 -28
- accuracy.json +315 -309
- attribute_ruler/patterns +0 -0
- config.cfg +29 -26
- meta.json +333 -320
- ner/model +0 -0
- parser/model +0 -0
- parser/moves +1 -1
- ro_core_news_md-any-py3-none-any.whl +2 -2
- senter/cfg +1 -1
- senter/model +0 -0
- tagger/cfg +14 -6
- tagger/model +0 -0
- tok2vec/model +0 -0
- tokenizer +2 -2
- vocab/strings.json +2 -2
- vocab/vectors.cfg +3 -0
LICENSES_SOURCES
CHANGED
@@ -549,7 +549,7 @@ terms of this License.```
|
|
549 |
|
550 |
|
551 |
|
552 |
-
# UD Romanian RRT v2.
|
553 |
|
554 |
* Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
|
555 |
* URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
|
|
|
549 |
|
550 |
|
551 |
|
552 |
+
# UD Romanian RRT v2.8
|
553 |
|
554 |
* Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
|
555 |
* URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
|
README.md
CHANGED
@@ -4,7 +4,7 @@ tags:
|
|
4 |
- token-classification
|
5 |
language:
|
6 |
- ro
|
7 |
-
license:
|
8 |
model-index:
|
9 |
- name: ro_core_news_md
|
10 |
results:
|
@@ -14,47 +14,47 @@ model-index:
|
|
14 |
metrics:
|
15 |
- name: NER Precision
|
16 |
type: precision
|
17 |
-
value: 0.
|
18 |
- name: NER Recall
|
19 |
type: recall
|
20 |
-
value: 0.
|
21 |
- name: NER F Score
|
22 |
type: f_score
|
23 |
-
value: 0.
|
24 |
- task:
|
25 |
name: POS
|
26 |
type: token-classification
|
27 |
metrics:
|
28 |
- name: POS Accuracy
|
29 |
type: accuracy
|
30 |
-
value: 0.
|
31 |
- task:
|
32 |
name: SENTER
|
33 |
type: token-classification
|
34 |
metrics:
|
35 |
- name: SENTER Precision
|
36 |
type: precision
|
37 |
-
value: 0.
|
38 |
- name: SENTER Recall
|
39 |
type: recall
|
40 |
-
value: 0.
|
41 |
- name: SENTER F Score
|
42 |
type: f_score
|
43 |
-
value: 0.
|
44 |
- task:
|
45 |
name: UNLABELED_DEPENDENCIES
|
46 |
type: token-classification
|
47 |
metrics:
|
48 |
- name: Unlabeled Dependencies Accuracy
|
49 |
type: accuracy
|
50 |
-
value: 0.
|
51 |
- task:
|
52 |
name: LABELED_DEPENDENCIES
|
53 |
type: token-classification
|
54 |
metrics:
|
55 |
- name: Labeled Dependencies Accuracy
|
56 |
type: accuracy
|
57 |
-
value: 0.
|
58 |
---
|
59 |
### Details: https://spacy.io/models/ro#ro_core_news_md
|
60 |
|
@@ -63,12 +63,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
|
|
63 |
| Feature | Description |
|
64 |
| --- | --- |
|
65 |
| **Name** | `ro_core_news_md` |
|
66 |
-
| **Version** | `3.
|
67 |
-
| **spaCy** | `>=3.
|
68 |
| **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
|
69 |
| **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
|
70 |
| **Vectors** | 500000 keys, 20000 unique vectors (300 dimensions) |
|
71 |
-
| **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.
|
72 |
| **License** | `CC BY-SA 4.0` |
|
73 |
| **Author** | [Explosion](https://explosion.ai) |
|
74 |
|
@@ -76,12 +76,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
|
|
76 |
|
77 |
<details>
|
78 |
|
79 |
-
<summary>View label scheme (
|
80 |
|
81 |
| Component | Labels |
|
82 |
| --- | --- |
|
83 |
-
| **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc`, `Mc-p-d`, `Mc-
|
84 |
-
| **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:
|
85 |
| **`senter`** | `I`, `S` |
|
86 |
| **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
|
87 |
|
@@ -92,15 +92,21 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
|
|
92 |
| Type | Score |
|
93 |
| --- | --- |
|
94 |
| `TOKEN_ACC` | 99.90 |
|
95 |
-
| `
|
96 |
-
| `
|
97 |
-
| `
|
98 |
-
| `
|
99 |
-
| `
|
100 |
-
| `
|
101 |
-
| `
|
102 |
-
| `
|
103 |
-
| `
|
104 |
-
| `
|
105 |
-
| `
|
106 |
-
| `
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
- token-classification
|
5 |
language:
|
6 |
- ro
|
7 |
+
license: cc-by-sa-4.0
|
8 |
model-index:
|
9 |
- name: ro_core_news_md
|
10 |
results:
|
|
|
14 |
metrics:
|
15 |
- name: NER Precision
|
16 |
type: precision
|
17 |
+
value: 0.7485865058
|
18 |
- name: NER Recall
|
19 |
type: recall
|
20 |
+
value: 0.7629658087
|
21 |
- name: NER F Score
|
22 |
type: f_score
|
23 |
+
value: 0.7557077626
|
24 |
- task:
|
25 |
name: POS
|
26 |
type: token-classification
|
27 |
metrics:
|
28 |
- name: POS Accuracy
|
29 |
type: accuracy
|
30 |
+
value: 0.9619726156
|
31 |
- task:
|
32 |
name: SENTER
|
33 |
type: token-classification
|
34 |
metrics:
|
35 |
- name: SENTER Precision
|
36 |
type: precision
|
37 |
+
value: 0.9626168224
|
38 |
- name: SENTER Recall
|
39 |
type: recall
|
40 |
+
value: 0.9587765957
|
41 |
- name: SENTER F Score
|
42 |
type: f_score
|
43 |
+
value: 0.9606928714
|
44 |
- task:
|
45 |
name: UNLABELED_DEPENDENCIES
|
46 |
type: token-classification
|
47 |
metrics:
|
48 |
- name: Unlabeled Dependencies Accuracy
|
49 |
type: accuracy
|
50 |
+
value: 0.8893350063
|
51 |
- task:
|
52 |
name: LABELED_DEPENDENCIES
|
53 |
type: token-classification
|
54 |
metrics:
|
55 |
- name: Labeled Dependencies Accuracy
|
56 |
type: accuracy
|
57 |
+
value: 0.8893350063
|
58 |
---
|
59 |
### Details: https://spacy.io/models/ro#ro_core_news_md
|
60 |
|
|
|
63 |
| Feature | Description |
|
64 |
| --- | --- |
|
65 |
| **Name** | `ro_core_news_md` |
|
66 |
+
| **Version** | `3.2.0` |
|
67 |
+
| **spaCy** | `>=3.2.0,<3.3.0` |
|
68 |
| **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
|
69 |
| **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
|
70 |
| **Vectors** | 500000 keys, 20000 unique vectors (300 dimensions) |
|
71 |
+
| **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.8](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
|
72 |
| **License** | `CC BY-SA 4.0` |
|
73 |
| **Author** | [Explosion](https://explosion.ai) |
|
74 |
|
|
|
76 |
|
77 |
<details>
|
78 |
|
79 |
+
<summary>View label scheme (541 labels for 4 components)</summary>
|
80 |
|
81 |
| Component | Labels |
|
82 |
| --- | --- |
|
83 |
+
| **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afp-srn`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3---sy`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc-p-d`, `Mc-p-l`, `Mc-s-b`, `Mc-s-d`, `Mc-s-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrl`, `Mcfsrln`, `Mcfsrly`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrln`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3-so--------s`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vag-------y`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp2p`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp,Yn`, `Yp-sr`, `Yr` |
|
84 |
+
| **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `obl:pmod`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
|
85 |
| **`senter`** | `I`, `S` |
|
86 |
| **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
|
87 |
|
|
|
92 |
| Type | Score |
|
93 |
| --- | --- |
|
94 |
| `TOKEN_ACC` | 99.90 |
|
95 |
+
| `TOKEN_P` | 99.67 |
|
96 |
+
| `TOKEN_R` | 99.57 |
|
97 |
+
| `TOKEN_F` | 99.59 |
|
98 |
+
| `TAG_ACC` | 96.20 |
|
99 |
+
| `SENTS_P` | 96.26 |
|
100 |
+
| `SENTS_R` | 95.88 |
|
101 |
+
| `SENTS_F` | 96.07 |
|
102 |
+
| `DEP_UAS` | 88.93 |
|
103 |
+
| `DEP_LAS` | 83.88 |
|
104 |
+
| `POS_ACC` | 93.82 |
|
105 |
+
| `MORPH_ACC` | 94.69 |
|
106 |
+
| `MORPH_MICRO_P` | 98.71 |
|
107 |
+
| `MORPH_MICRO_R` | 95.58 |
|
108 |
+
| `MORPH_MICRO_F` | 96.84 |
|
109 |
+
| `LEMMA_ACC` | 81.83 |
|
110 |
+
| `ENTS_P` | 74.86 |
|
111 |
+
| `ENTS_R` | 76.30 |
|
112 |
+
| `ENTS_F` | 75.57 |
|
accuracy.json
CHANGED
@@ -1,185 +1,64 @@
|
|
1 |
{
|
2 |
"token_acc": 0.9990029326,
|
3 |
-
"
|
4 |
-
"
|
5 |
-
"
|
6 |
-
"
|
7 |
-
"
|
8 |
-
"
|
9 |
-
"
|
10 |
-
"
|
11 |
-
"
|
12 |
-
"sents_p": 0.9598393574,
|
13 |
-
"sents_r": 0.9534574468,
|
14 |
-
"sents_f": 0.9566377585,
|
15 |
-
"speed": 8493.8160932984,
|
16 |
-
"morph_per_feat": {
|
17 |
-
"AdpType": {
|
18 |
-
"p": 0.997492687,
|
19 |
-
"r": 0.9933416563,
|
20 |
-
"f": 0.995412844
|
21 |
-
},
|
22 |
-
"Case": {
|
23 |
-
"p": 0.9877617623,
|
24 |
-
"r": 0.9809588116,
|
25 |
-
"f": 0.9843485331
|
26 |
-
},
|
27 |
-
"Variant": {
|
28 |
-
"p": 0.9846153846,
|
29 |
-
"r": 0.9241877256,
|
30 |
-
"f": 0.9534450652
|
31 |
-
},
|
32 |
-
"Gender": {
|
33 |
-
"p": 0.9798818233,
|
34 |
-
"r": 0.9754901961,
|
35 |
-
"f": 0.977681078
|
36 |
-
},
|
37 |
-
"Number": {
|
38 |
-
"p": 0.9811536265,
|
39 |
-
"r": 0.9754712696,
|
40 |
-
"f": 0.9783041968
|
41 |
-
},
|
42 |
-
"PronType": {
|
43 |
-
"p": 0.9943589744,
|
44 |
-
"r": 0.9892857143,
|
45 |
-
"f": 0.9918158568
|
46 |
-
},
|
47 |
-
"Definite": {
|
48 |
-
"p": 0.9773605743,
|
49 |
-
"r": 0.9711046086,
|
50 |
-
"f": 0.9742225484
|
51 |
-
},
|
52 |
-
"Degree": {
|
53 |
-
"p": 0.9527845036,
|
54 |
-
"r": 0.9369047619,
|
55 |
-
"f": 0.9447779112
|
56 |
-
},
|
57 |
-
"Polarity": {
|
58 |
-
"p": 0.9884318766,
|
59 |
-
"r": 0.9846350832,
|
60 |
-
"f": 0.9865298268
|
61 |
-
},
|
62 |
-
"Mood": {
|
63 |
-
"p": 0.9760869565,
|
64 |
-
"r": 0.9621428571,
|
65 |
-
"f": 0.9690647482
|
66 |
-
},
|
67 |
-
"Person": {
|
68 |
-
"p": 0.9822419534,
|
69 |
-
"r": 0.9696859021,
|
70 |
-
"f": 0.9759235435
|
71 |
-
},
|
72 |
-
"Tense": {
|
73 |
-
"p": 0.9691497366,
|
74 |
-
"r": 0.9491525424,
|
75 |
-
"f": 0.9590469099
|
76 |
-
},
|
77 |
-
"VerbForm": {
|
78 |
-
"p": 0.9661582459,
|
79 |
-
"r": 0.9579395085,
|
80 |
-
"f": 0.9620313242
|
81 |
-
},
|
82 |
-
"NumForm": {
|
83 |
-
"p": 0.9926650367,
|
84 |
-
"r": 0.9902439024,
|
85 |
-
"f": 0.9914529915
|
86 |
-
},
|
87 |
-
"NumType": {
|
88 |
-
"p": 0.9951807229,
|
89 |
-
"r": 0.9904076739,
|
90 |
-
"f": 0.9927884615
|
91 |
-
},
|
92 |
-
"PartType": {
|
93 |
-
"p": 0.9473684211,
|
94 |
-
"r": 0.9,
|
95 |
-
"f": 0.9230769231
|
96 |
-
},
|
97 |
-
"Strength": {
|
98 |
-
"p": 0.9931623932,
|
99 |
-
"r": 0.9781144781,
|
100 |
-
"f": 0.9855810008
|
101 |
-
},
|
102 |
-
"Reflex": {
|
103 |
-
"p": 0.9969135802,
|
104 |
-
"r": 0.990797546,
|
105 |
-
"f": 0.9938461538
|
106 |
-
},
|
107 |
-
"Poss": {
|
108 |
-
"p": 0.9826989619,
|
109 |
-
"r": 0.993006993,
|
110 |
-
"f": 0.987826087
|
111 |
-
},
|
112 |
-
"Position": {
|
113 |
-
"p": 0.9791666667,
|
114 |
-
"r": 0.9724137931,
|
115 |
-
"f": 0.9757785467
|
116 |
-
},
|
117 |
-
"Number[psor]": {
|
118 |
-
"p": 0.9436619718,
|
119 |
-
"r": 0.9710144928,
|
120 |
-
"f": 0.9571428571
|
121 |
-
},
|
122 |
-
"Abbr": {
|
123 |
-
"p": 0.9625,
|
124 |
-
"r": 0.9058823529,
|
125 |
-
"f": 0.9333333333
|
126 |
-
},
|
127 |
-
"Foreign": {
|
128 |
-
"p": 0.0,
|
129 |
-
"r": 0.0,
|
130 |
-
"f": 0.0
|
131 |
-
}
|
132 |
-
},
|
133 |
"dep_las_per_type": {
|
134 |
"case": {
|
135 |
-
"p": 0.
|
136 |
-
"r": 0.
|
137 |
-
"f": 0.
|
138 |
},
|
139 |
"det": {
|
140 |
-
"p": 0.
|
141 |
-
"r": 0.
|
142 |
-
"f": 0.
|
143 |
},
|
144 |
"nmod:tmod": {
|
145 |
-
"p": 0.
|
146 |
-
"r": 0.
|
147 |
-
"f": 0.
|
148 |
},
|
149 |
"amod": {
|
150 |
-
"p": 0.
|
151 |
-
"r": 0.
|
152 |
-
"f": 0.
|
153 |
},
|
154 |
"cc": {
|
155 |
-
"p": 0.
|
156 |
-
"r": 0.
|
157 |
-
"f": 0.
|
158 |
},
|
159 |
"conj": {
|
160 |
-
"p": 0.
|
161 |
-
"r": 0.
|
162 |
-
"f": 0.
|
163 |
},
|
164 |
"nmod": {
|
165 |
-
"p": 0.
|
166 |
-
"r": 0.
|
167 |
-
"f": 0.
|
168 |
},
|
169 |
"mark": {
|
170 |
-
"p": 0.
|
171 |
-
"r": 0.
|
172 |
-
"f": 0.
|
173 |
},
|
174 |
"fixed": {
|
175 |
-
"p": 0.
|
176 |
-
"r": 0.
|
177 |
-
"f": 0.
|
178 |
},
|
179 |
"nsubj": {
|
180 |
-
"p": 0.
|
181 |
-
"r": 0.
|
182 |
-
"f": 0.
|
183 |
},
|
184 |
"advcl:tcl": {
|
185 |
"p": 0.0,
|
@@ -187,84 +66,84 @@
|
|
187 |
"f": 0.0
|
188 |
},
|
189 |
"obj": {
|
190 |
-
"p": 0.
|
191 |
-
"r": 0.
|
192 |
-
"f": 0.
|
193 |
},
|
194 |
"nummod": {
|
195 |
-
"p": 0.
|
196 |
-
"r": 0.
|
197 |
-
"f": 0.
|
198 |
},
|
199 |
"flat": {
|
200 |
-
"p": 0.
|
201 |
-
"r": 0.
|
202 |
-
"f": 0.
|
203 |
},
|
204 |
"obl": {
|
205 |
-
"p": 0.
|
206 |
-
"r": 0.
|
207 |
-
"f": 0.
|
208 |
},
|
209 |
-
"
|
210 |
-
"p": 0.
|
211 |
-
"r": 0.
|
212 |
-
"f": 0.
|
213 |
},
|
214 |
"acl": {
|
215 |
-
"p": 0.
|
216 |
-
"r": 0.
|
217 |
-
"f": 0.
|
218 |
},
|
219 |
"advmod": {
|
220 |
-
"p": 0.
|
221 |
-
"r": 0.
|
222 |
-
"f": 0.
|
223 |
},
|
224 |
"expl:pv": {
|
225 |
-
"p": 0.
|
226 |
-
"r": 0.
|
227 |
-
"f": 0.
|
228 |
},
|
229 |
"root": {
|
230 |
-
"p": 0.
|
231 |
-
"r": 0.
|
232 |
-
"f": 0.
|
233 |
},
|
234 |
"advcl": {
|
235 |
-
"p": 0.
|
236 |
-
"r": 0.
|
237 |
-
"f": 0.
|
238 |
},
|
239 |
"iobj": {
|
240 |
-
"p": 0.
|
241 |
-
"r": 0.
|
242 |
-
"f": 0.
|
243 |
},
|
244 |
"ccomp": {
|
245 |
-
"p": 0.
|
246 |
-
"r": 0.
|
247 |
-
"f": 0.
|
248 |
},
|
249 |
"goeswith": {
|
250 |
-
"p": 0.
|
251 |
-
"r": 0.
|
252 |
-
"f": 0.
|
253 |
},
|
254 |
"parataxis": {
|
255 |
-
"p": 0.
|
256 |
-
"r": 0.
|
257 |
-
"f": 0.
|
258 |
},
|
259 |
"expl:poss": {
|
260 |
-
"p": 0.
|
261 |
-
"r": 0.
|
262 |
-
"f": 0.
|
263 |
},
|
264 |
"cop": {
|
265 |
-
"p": 0.
|
266 |
-
"r": 0.
|
267 |
-
"f": 0.
|
268 |
},
|
269 |
"cc:preconj": {
|
270 |
"p": 0.0,
|
@@ -272,54 +151,49 @@
|
|
272 |
"f": 0.0
|
273 |
},
|
274 |
"aux": {
|
275 |
-
"p": 0.
|
276 |
"r": 0.9122340426,
|
277 |
-
"f": 0.
|
278 |
},
|
279 |
"expl": {
|
280 |
-
"p": 0.
|
281 |
-
"r": 0.
|
282 |
-
"f": 0.
|
283 |
},
|
284 |
"appos": {
|
285 |
-
"p": 0.
|
286 |
-
"r": 0.
|
287 |
-
"f": 0.
|
288 |
},
|
289 |
"xcomp": {
|
290 |
-
"p": 0.
|
291 |
-
"r": 0.
|
292 |
-
"f": 0.
|
293 |
},
|
294 |
-
"
|
295 |
-
"p": 0.
|
296 |
-
"r": 0.
|
297 |
-
"f": 0.
|
298 |
},
|
299 |
"csubj": {
|
300 |
-
"p": 0.
|
301 |
-
"r": 0.
|
302 |
-
"f": 0.
|
303 |
},
|
304 |
-
"
|
305 |
-
"p": 0.
|
306 |
-
"r": 0.
|
307 |
-
"f": 0.
|
308 |
},
|
309 |
"aux:pass": {
|
310 |
-
"p": 0.
|
311 |
-
"r": 0.
|
312 |
-
"f": 0.
|
313 |
-
},
|
314 |
-
"nsubj:pass": {
|
315 |
-
"p": 0.6060606061,
|
316 |
-
"r": 0.6711409396,
|
317 |
-
"f": 0.6369426752
|
318 |
},
|
319 |
-
"
|
320 |
-
"p": 0.
|
321 |
-
"r": 0.
|
322 |
-
"f": 0.
|
323 |
},
|
324 |
"advmod:tmod": {
|
325 |
"p": 0.0,
|
@@ -331,10 +205,15 @@
|
|
331 |
"r": 0.6666666667,
|
332 |
"f": 0.5714285714
|
333 |
},
|
|
|
|
|
|
|
|
|
|
|
334 |
"expl:pass": {
|
335 |
-
"p": 0.
|
336 |
-
"r": 0.
|
337 |
-
"f": 0.
|
338 |
},
|
339 |
"orphan": {
|
340 |
"p": 0.0,
|
@@ -347,9 +226,9 @@
|
|
347 |
"f": 0.1666666667
|
348 |
},
|
349 |
"csubj:pass": {
|
350 |
-
"p": 0.
|
351 |
-
"r": 0.
|
352 |
-
"f": 0.
|
353 |
},
|
354 |
"vocative": {
|
355 |
"p": 0.0,
|
@@ -362,86 +241,213 @@
|
|
362 |
"f": 0.0
|
363 |
}
|
364 |
},
|
365 |
-
"
|
366 |
-
|
367 |
-
|
368 |
-
|
369 |
-
|
|
|
|
|
|
|
|
|
|
|
370 |
},
|
371 |
-
"
|
372 |
-
"p": 0.
|
373 |
-
"r": 0.
|
374 |
-
"f": 0.
|
375 |
},
|
376 |
-
"
|
377 |
-
"p": 0.
|
378 |
-
"r": 0.
|
379 |
-
"f": 0.
|
380 |
},
|
381 |
-
"
|
382 |
-
"p": 0.
|
383 |
-
"r": 0.
|
384 |
-
"f": 0.
|
385 |
},
|
386 |
-
"
|
387 |
-
"p": 0.
|
388 |
-
"r": 0.
|
389 |
-
"f": 0.
|
390 |
},
|
391 |
-
"
|
392 |
-
"p": 0.
|
393 |
-
"r": 0.
|
394 |
-
"f": 0.
|
395 |
},
|
396 |
-
"
|
397 |
-
"p": 0.
|
398 |
-
"r": 0.
|
399 |
-
"f": 0.
|
400 |
},
|
401 |
-
"
|
402 |
-
"p": 0.
|
403 |
-
"r": 0.
|
404 |
-
"f": 0.
|
405 |
},
|
406 |
-
"
|
407 |
-
"p": 0.
|
408 |
-
"r": 0.
|
409 |
-
"f": 0.
|
410 |
},
|
411 |
-
"
|
412 |
-
"p": 0.
|
413 |
-
"r": 0.
|
414 |
-
"f": 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
415 |
},
|
416 |
"PRODUCT": {
|
417 |
-
"p": 0.
|
418 |
-
"r": 0.
|
419 |
-
"f": 0.
|
420 |
},
|
421 |
"LOC": {
|
422 |
-
"p": 0.
|
423 |
-
"r": 0.
|
424 |
-
"f": 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
425 |
},
|
426 |
"WORK_OF_ART": {
|
427 |
-
"p": 0.
|
428 |
-
"r": 0.
|
429 |
-
"f": 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
430 |
},
|
431 |
"QUANTITY": {
|
432 |
-
"p": 0.
|
433 |
-
"r": 0.
|
434 |
-
"f": 0.
|
|
|
|
|
|
|
|
|
|
|
435 |
},
|
436 |
"LANGUAGE": {
|
437 |
-
"p": 0.
|
438 |
-
"r":
|
439 |
-
"f": 0.
|
440 |
},
|
441 |
"PERIOD": {
|
442 |
-
"p": 0.
|
443 |
-
"r": 0.
|
444 |
-
"f": 0.
|
445 |
}
|
446 |
-
}
|
|
|
447 |
}
|
|
|
1 |
{
|
2 |
"token_acc": 0.9990029326,
|
3 |
+
"token_p": 0.9967350492,
|
4 |
+
"token_r": 0.9957244934,
|
5 |
+
"token_f": 0.9959492157,
|
6 |
+
"tag_acc": 0.9619726156,
|
7 |
+
"sents_p": 0.9626168224,
|
8 |
+
"sents_r": 0.9587765957,
|
9 |
+
"sents_f": 0.9606928714,
|
10 |
+
"dep_uas": 0.8893350063,
|
11 |
+
"dep_las": 0.8388068128,
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
"dep_las_per_type": {
|
13 |
"case": {
|
14 |
+
"p": 0.9337493999,
|
15 |
+
"r": 0.9492435334,
|
16 |
+
"f": 0.9414327202
|
17 |
},
|
18 |
"det": {
|
19 |
+
"p": 0.9484425349,
|
20 |
+
"r": 0.966083151,
|
21 |
+
"f": 0.9571815718
|
22 |
},
|
23 |
"nmod:tmod": {
|
24 |
+
"p": 0.6666666667,
|
25 |
+
"r": 0.0930232558,
|
26 |
+
"f": 0.1632653061
|
27 |
},
|
28 |
"amod": {
|
29 |
+
"p": 0.8737690242,
|
30 |
+
"r": 0.8864668483,
|
31 |
+
"f": 0.8800721371
|
32 |
},
|
33 |
"cc": {
|
34 |
+
"p": 0.877016129,
|
35 |
+
"r": 0.910041841,
|
36 |
+
"f": 0.8932238193
|
37 |
},
|
38 |
"conj": {
|
39 |
+
"p": 0.5879699248,
|
40 |
+
"r": 0.5915279879,
|
41 |
+
"f": 0.5897435897
|
42 |
},
|
43 |
"nmod": {
|
44 |
+
"p": 0.7885679164,
|
45 |
+
"r": 0.8099747475,
|
46 |
+
"f": 0.7991279975
|
47 |
},
|
48 |
"mark": {
|
49 |
+
"p": 0.9161147903,
|
50 |
+
"r": 0.9222222222,
|
51 |
+
"f": 0.919158361
|
52 |
},
|
53 |
"fixed": {
|
54 |
+
"p": 0.8559322034,
|
55 |
+
"r": 0.7163120567,
|
56 |
+
"f": 0.7799227799
|
57 |
},
|
58 |
"nsubj": {
|
59 |
+
"p": 0.8134920635,
|
60 |
+
"r": 0.7824427481,
|
61 |
+
"f": 0.7976653696
|
62 |
},
|
63 |
"advcl:tcl": {
|
64 |
"p": 0.0,
|
|
|
66 |
"f": 0.0
|
67 |
},
|
68 |
"obj": {
|
69 |
+
"p": 0.7793880837,
|
70 |
+
"r": 0.8273504274,
|
71 |
+
"f": 0.8026533997
|
72 |
},
|
73 |
"nummod": {
|
74 |
+
"p": 0.8892405063,
|
75 |
+
"r": 0.8619631902,
|
76 |
+
"f": 0.8753894081
|
77 |
},
|
78 |
"flat": {
|
79 |
+
"p": 0.7441860465,
|
80 |
+
"r": 0.6857142857,
|
81 |
+
"f": 0.7137546468
|
82 |
},
|
83 |
"obl": {
|
84 |
+
"p": 0.6402378593,
|
85 |
+
"r": 0.731596829,
|
86 |
+
"f": 0.6828752643
|
87 |
},
|
88 |
+
"obl:pmod": {
|
89 |
+
"p": 0.4375,
|
90 |
+
"r": 0.1615384615,
|
91 |
+
"f": 0.2359550562
|
92 |
},
|
93 |
"acl": {
|
94 |
+
"p": 0.7222222222,
|
95 |
+
"r": 0.7303370787,
|
96 |
+
"f": 0.7262569832
|
97 |
},
|
98 |
"advmod": {
|
99 |
+
"p": 0.8060686016,
|
100 |
+
"r": 0.7823303457,
|
101 |
+
"f": 0.7940220923
|
102 |
},
|
103 |
"expl:pv": {
|
104 |
+
"p": 0.7777777778,
|
105 |
+
"r": 0.8191489362,
|
106 |
+
"f": 0.7979274611
|
107 |
},
|
108 |
"root": {
|
109 |
+
"p": 0.9103078983,
|
110 |
+
"r": 0.9042553191,
|
111 |
+
"f": 0.9072715143
|
112 |
},
|
113 |
"advcl": {
|
114 |
+
"p": 0.5579710145,
|
115 |
+
"r": 0.6260162602,
|
116 |
+
"f": 0.5900383142
|
117 |
},
|
118 |
"iobj": {
|
119 |
+
"p": 0.7966101695,
|
120 |
+
"r": 0.6394557823,
|
121 |
+
"f": 0.7094339623
|
122 |
},
|
123 |
"ccomp": {
|
124 |
+
"p": 0.6995073892,
|
125 |
+
"r": 0.802259887,
|
126 |
+
"f": 0.7473684211
|
127 |
},
|
128 |
"goeswith": {
|
129 |
+
"p": 0.25,
|
130 |
+
"r": 0.1428571429,
|
131 |
+
"f": 0.1818181818
|
132 |
},
|
133 |
"parataxis": {
|
134 |
+
"p": 0.8494623656,
|
135 |
+
"r": 0.6030534351,
|
136 |
+
"f": 0.7053571429
|
137 |
},
|
138 |
"expl:poss": {
|
139 |
+
"p": 0.6086956522,
|
140 |
+
"r": 0.6511627907,
|
141 |
+
"f": 0.6292134831
|
142 |
},
|
143 |
"cop": {
|
144 |
+
"p": 0.75,
|
145 |
+
"r": 0.773006135,
|
146 |
+
"f": 0.7613293051
|
147 |
},
|
148 |
"cc:preconj": {
|
149 |
"p": 0.0,
|
|
|
151 |
"f": 0.0
|
152 |
},
|
153 |
"aux": {
|
154 |
+
"p": 0.9661971831,
|
155 |
"r": 0.9122340426,
|
156 |
+
"f": 0.9384404925
|
157 |
},
|
158 |
"expl": {
|
159 |
+
"p": 0.5714285714,
|
160 |
+
"r": 0.4761904762,
|
161 |
+
"f": 0.5194805195
|
162 |
},
|
163 |
"appos": {
|
164 |
+
"p": 0.4691358025,
|
165 |
+
"r": 0.3762376238,
|
166 |
+
"f": 0.4175824176
|
167 |
},
|
168 |
"xcomp": {
|
169 |
+
"p": 0.5538461538,
|
170 |
+
"r": 0.4337349398,
|
171 |
+
"f": 0.4864864865
|
172 |
},
|
173 |
+
"nsubj:pass": {
|
174 |
+
"p": 0.5878787879,
|
175 |
+
"r": 0.6381578947,
|
176 |
+
"f": 0.6119873817
|
177 |
},
|
178 |
"csubj": {
|
179 |
+
"p": 0.8448275862,
|
180 |
+
"r": 0.7777777778,
|
181 |
+
"f": 0.8099173554
|
182 |
},
|
183 |
+
"obl:agent": {
|
184 |
+
"p": 0.7538461538,
|
185 |
+
"r": 0.7538461538,
|
186 |
+
"f": 0.7538461538
|
187 |
},
|
188 |
"aux:pass": {
|
189 |
+
"p": 0.7428571429,
|
190 |
+
"r": 0.8666666667,
|
191 |
+
"f": 0.8
|
|
|
|
|
|
|
|
|
|
|
192 |
},
|
193 |
+
"dep": {
|
194 |
+
"p": 0.0,
|
195 |
+
"r": 0.0,
|
196 |
+
"f": 0.0
|
197 |
},
|
198 |
"advmod:tmod": {
|
199 |
"p": 0.0,
|
|
|
205 |
"r": 0.6666666667,
|
206 |
"f": 0.5714285714
|
207 |
},
|
208 |
+
"ccomp:pmod": {
|
209 |
+
"p": 0.5,
|
210 |
+
"r": 0.1875,
|
211 |
+
"f": 0.2727272727
|
212 |
+
},
|
213 |
"expl:pass": {
|
214 |
+
"p": 0.6808510638,
|
215 |
+
"r": 0.7032967033,
|
216 |
+
"f": 0.6918918919
|
217 |
},
|
218 |
"orphan": {
|
219 |
"p": 0.0,
|
|
|
226 |
"f": 0.1666666667
|
227 |
},
|
228 |
"csubj:pass": {
|
229 |
+
"p": 0.6666666667,
|
230 |
+
"r": 0.6666666667,
|
231 |
+
"f": 0.6666666667
|
232 |
},
|
233 |
"vocative": {
|
234 |
"p": 0.0,
|
|
|
241 |
"f": 0.0
|
242 |
}
|
243 |
},
|
244 |
+
"pos_acc": 0.9381923087,
|
245 |
+
"morph_acc": 0.9469023954,
|
246 |
+
"morph_micro_p": 0.9870716332,
|
247 |
+
"morph_micro_r": 0.9558096483,
|
248 |
+
"morph_micro_f": 0.9683797083,
|
249 |
+
"morph_per_feat": {
|
250 |
+
"AdpType": {
|
251 |
+
"p": 0.9954051796,
|
252 |
+
"r": 0.9941593659,
|
253 |
+
"f": 0.9947818827
|
254 |
},
|
255 |
+
"Case": {
|
256 |
+
"p": 0.9873727088,
|
257 |
+
"r": 0.9820391627,
|
258 |
+
"f": 0.9846987136
|
259 |
},
|
260 |
+
"Variant": {
|
261 |
+
"p": 0.976744186,
|
262 |
+
"r": 0.9130434783,
|
263 |
+
"f": 0.9438202247
|
264 |
},
|
265 |
+
"Gender": {
|
266 |
+
"p": 0.9821478774,
|
267 |
+
"r": 0.9776129845,
|
268 |
+
"f": 0.9798751841
|
269 |
},
|
270 |
+
"Number": {
|
271 |
+
"p": 0.9810964083,
|
272 |
+
"r": 0.9438508752,
|
273 |
+
"f": 0.9621133125
|
274 |
},
|
275 |
+
"PronType": {
|
276 |
+
"p": 0.9902862986,
|
277 |
+
"r": 0.9872579001,
|
278 |
+
"f": 0.9887697805
|
279 |
},
|
280 |
+
"Definite": {
|
281 |
+
"p": 0.9788447388,
|
282 |
+
"r": 0.9734723747,
|
283 |
+
"f": 0.9761511649
|
284 |
},
|
285 |
+
"Degree": {
|
286 |
+
"p": 0.9568913175,
|
287 |
+
"r": 0.9347568209,
|
288 |
+
"f": 0.9456945695
|
289 |
},
|
290 |
+
"Polarity": {
|
291 |
+
"p": 0.9884318766,
|
292 |
+
"r": 0.9858974359,
|
293 |
+
"f": 0.9871630295
|
294 |
},
|
295 |
+
"Mood": {
|
296 |
+
"p": 0.9740072202,
|
297 |
+
"r": 0.9677187948,
|
298 |
+
"f": 0.9708528248
|
299 |
+
},
|
300 |
+
"Person": {
|
301 |
+
"p": 0.9764359352,
|
302 |
+
"r": 0.9696526508,
|
303 |
+
"f": 0.9730324711
|
304 |
+
},
|
305 |
+
"Tense": {
|
306 |
+
"p": 0.9707207207,
|
307 |
+
"r": 0.9563609467,
|
308 |
+
"f": 0.9634873323
|
309 |
+
},
|
310 |
+
"VerbForm": {
|
311 |
+
"p": 0.9714013346,
|
312 |
+
"r": 0.9622285175,
|
313 |
+
"f": 0.9667931689
|
314 |
+
},
|
315 |
+
"NumForm": {
|
316 |
+
"p": 0.9758064516,
|
317 |
+
"r": 0.2929782082,
|
318 |
+
"f": 0.4506517691
|
319 |
+
},
|
320 |
+
"NumType": {
|
321 |
+
"p": 0.9846153846,
|
322 |
+
"r": 0.3054892601,
|
323 |
+
"f": 0.4663023679
|
324 |
+
},
|
325 |
+
"PartType": {
|
326 |
+
"p": 0.9473684211,
|
327 |
+
"r": 0.9230769231,
|
328 |
+
"f": 0.9350649351
|
329 |
+
},
|
330 |
+
"Strength": {
|
331 |
+
"p": 0.9914675768,
|
332 |
+
"r": 0.97319933,
|
333 |
+
"f": 0.9822485207
|
334 |
+
},
|
335 |
+
"Reflex": {
|
336 |
+
"p": 0.9938461538,
|
337 |
+
"r": 0.9877675841,
|
338 |
+
"f": 0.990797546
|
339 |
+
},
|
340 |
+
"Poss": {
|
341 |
+
"p": 0.986013986,
|
342 |
+
"r": 0.986013986,
|
343 |
+
"f": 0.986013986
|
344 |
+
},
|
345 |
+
"Position": {
|
346 |
+
"p": 0.986013986,
|
347 |
+
"r": 0.9724137931,
|
348 |
+
"f": 0.9791666667
|
349 |
+
},
|
350 |
+
"Number[psor]": {
|
351 |
+
"p": 0.9420289855,
|
352 |
+
"r": 0.9558823529,
|
353 |
+
"f": 0.9489051095
|
354 |
+
},
|
355 |
+
"Foreign": {
|
356 |
+
"p": 0.0,
|
357 |
+
"r": 0.0,
|
358 |
+
"f": 0.0
|
359 |
+
},
|
360 |
+
"Abbr": {
|
361 |
+
"p": 0.9620253165,
|
362 |
+
"r": 0.9156626506,
|
363 |
+
"f": 0.9382716049
|
364 |
+
}
|
365 |
+
},
|
366 |
+
"lemma_acc": 0.8183070924,
|
367 |
+
"ents_p": 0.7485865058,
|
368 |
+
"ents_r": 0.7629658087,
|
369 |
+
"ents_f": 0.7557077626,
|
370 |
+
"ents_per_type": {
|
371 |
+
"DATETIME": {
|
372 |
+
"p": 0.0,
|
373 |
+
"r": 0.0,
|
374 |
+
"f": 0.0
|
375 |
+
},
|
376 |
+
"PERSON": {
|
377 |
+
"p": 0.0,
|
378 |
+
"r": 0.0,
|
379 |
+
"f": 0.0
|
380 |
},
|
381 |
"PRODUCT": {
|
382 |
+
"p": 0.0,
|
383 |
+
"r": 0.0,
|
384 |
+
"f": 0.0
|
385 |
},
|
386 |
"LOC": {
|
387 |
+
"p": 0.0,
|
388 |
+
"r": 0.0,
|
389 |
+
"f": 0.0
|
390 |
+
},
|
391 |
+
"GPE": {
|
392 |
+
"p": 0.0,
|
393 |
+
"r": 0.0,
|
394 |
+
"f": 0.0
|
395 |
+
},
|
396 |
+
"ORDINAL": {
|
397 |
+
"p": 0.0,
|
398 |
+
"r": 0.0,
|
399 |
+
"f": 0.0
|
400 |
+
},
|
401 |
+
"NUMERIC_VALUE": {
|
402 |
+
"p": 0.0,
|
403 |
+
"r": 0.0,
|
404 |
+
"f": 0.0
|
405 |
+
},
|
406 |
+
"ORGANIZATION": {
|
407 |
+
"p": 0.0,
|
408 |
+
"r": 0.0,
|
409 |
+
"f": 0.0
|
410 |
+
},
|
411 |
+
"NAT_REL_POL": {
|
412 |
+
"p": 0.0,
|
413 |
+
"r": 0.0,
|
414 |
+
"f": 0.0
|
415 |
},
|
416 |
"WORK_OF_ART": {
|
417 |
+
"p": 0.0,
|
418 |
+
"r": 0.0,
|
419 |
+
"f": 0.0
|
420 |
+
},
|
421 |
+
"EVENT": {
|
422 |
+
"p": 0.0,
|
423 |
+
"r": 0.0,
|
424 |
+
"f": 0.0
|
425 |
+
},
|
426 |
+
"FACILITY": {
|
427 |
+
"p": 0.0,
|
428 |
+
"r": 0.0,
|
429 |
+
"f": 0.0
|
430 |
},
|
431 |
"QUANTITY": {
|
432 |
+
"p": 0.0,
|
433 |
+
"r": 0.0,
|
434 |
+
"f": 0.0
|
435 |
+
},
|
436 |
+
"MONEY": {
|
437 |
+
"p": 0.0,
|
438 |
+
"r": 0.0,
|
439 |
+
"f": 0.0
|
440 |
},
|
441 |
"LANGUAGE": {
|
442 |
+
"p": 0.0,
|
443 |
+
"r": 0.0,
|
444 |
+
"f": 0.0
|
445 |
},
|
446 |
"PERIOD": {
|
447 |
+
"p": 0.0,
|
448 |
+
"r": 0.0,
|
449 |
+
"f": 0.0
|
450 |
}
|
451 |
+
},
|
452 |
+
"speed": 8391.5537539766
|
453 |
}
|
attribute_ruler/patterns
CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
|
|
config.cfg
CHANGED
@@ -1,10 +1,8 @@
|
|
1 |
[paths]
|
2 |
-
train =
|
3 |
-
dev =
|
4 |
-
vectors =
|
5 |
-
raw = null
|
6 |
init_tok2vec = null
|
7 |
-
vocab_data = null
|
8 |
|
9 |
[system]
|
10 |
gpu_allocator = null
|
@@ -24,6 +22,7 @@ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
|
|
24 |
|
25 |
[components.attribute_ruler]
|
26 |
factory = "attribute_ruler"
|
|
|
27 |
validate = false
|
28 |
|
29 |
[components.lemmatizer]
|
@@ -31,11 +30,13 @@ factory = "lemmatizer"
|
|
31 |
mode = "lookup"
|
32 |
model = null
|
33 |
overwrite = false
|
|
|
34 |
|
35 |
[components.ner]
|
36 |
factory = "ner"
|
37 |
incorrect_spans_key = null
|
38 |
moves = null
|
|
|
39 |
update_with_oracle_cut_size = 100
|
40 |
|
41 |
[components.ner.model]
|
@@ -53,8 +54,8 @@ nO = null
|
|
53 |
[components.ner.model.tok2vec.embed]
|
54 |
@architectures = "spacy.MultiHashEmbed.v2"
|
55 |
width = 96
|
56 |
-
attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
|
57 |
-
rows = [5000,2500,2500,2500]
|
58 |
include_static_vectors = true
|
59 |
|
60 |
[components.ner.model.tok2vec.encode]
|
@@ -69,6 +70,7 @@ factory = "parser"
|
|
69 |
learn_tokens = false
|
70 |
min_action_freq = 30
|
71 |
moves = null
|
|
|
72 |
update_with_oracle_cut_size = 100
|
73 |
|
74 |
[components.parser.model]
|
@@ -87,6 +89,8 @@ upstream = "tok2vec"
|
|
87 |
|
88 |
[components.senter]
|
89 |
factory = "senter"
|
|
|
|
|
90 |
|
91 |
[components.senter.model]
|
92 |
@architectures = "spacy.Tagger.v1"
|
@@ -98,8 +102,8 @@ nO = null
|
|
98 |
[components.senter.model.tok2vec.embed]
|
99 |
@architectures = "spacy.MultiHashEmbed.v2"
|
100 |
width = 16
|
101 |
-
attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
|
102 |
-
rows = [1000,500,500,500]
|
103 |
include_static_vectors = true
|
104 |
|
105 |
[components.senter.model.tok2vec.encode]
|
@@ -111,6 +115,8 @@ maxout_pieces = 2
|
|
111 |
|
112 |
[components.tagger]
|
113 |
factory = "tagger"
|
|
|
|
|
114 |
|
115 |
[components.tagger.model]
|
116 |
@architectures = "spacy.Tagger.v1"
|
@@ -130,8 +136,8 @@ factory = "tok2vec"
|
|
130 |
[components.tok2vec.model.embed]
|
131 |
@architectures = "spacy.MultiHashEmbed.v2"
|
132 |
width = ${components.tok2vec.model.encode:width}
|
133 |
-
attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
|
134 |
-
rows = [5000,2500,2500,2500]
|
135 |
include_static_vectors = true
|
136 |
|
137 |
[components.tok2vec.model.encode]
|
@@ -145,22 +151,19 @@ maxout_pieces = 3
|
|
145 |
|
146 |
[corpora.dev]
|
147 |
@readers = "spacy.Corpus.v1"
|
148 |
-
|
149 |
-
max_length = 0
|
150 |
-
path = ${paths:dev}
|
151 |
gold_preproc = false
|
|
|
|
|
152 |
augmenter = null
|
153 |
|
154 |
[corpora.train]
|
155 |
@readers = "spacy.Corpus.v1"
|
156 |
-
path = ${paths
|
157 |
-
max_length = 5000
|
158 |
gold_preproc = false
|
|
|
159 |
limit = 0
|
160 |
-
|
161 |
-
[corpora.train.augmenter]
|
162 |
-
@augmenters = "spacy.lower_case.v1"
|
163 |
-
level = 0.1
|
164 |
|
165 |
[training]
|
166 |
train_corpus = "corpora.train"
|
@@ -191,9 +194,8 @@ compound = 1.001
|
|
191 |
t = 0.0
|
192 |
|
193 |
[training.logger]
|
194 |
-
@loggers = "spacy.
|
195 |
-
|
196 |
-
remove_config_values = []
|
197 |
|
198 |
[training.optimizer]
|
199 |
@optimizers = "Adam.v1"
|
@@ -214,16 +216,17 @@ dep_las_per_type = null
|
|
214 |
sents_p = null
|
215 |
sents_r = null
|
216 |
sents_f = 0.02
|
217 |
-
lemma_acc = 0.
|
218 |
-
ents_f = 0.
|
219 |
ents_p = 0.0
|
220 |
ents_r = 0.0
|
221 |
ents_per_type = null
|
|
|
222 |
|
223 |
[pretraining]
|
224 |
|
225 |
[initialize]
|
226 |
-
vocab_data =
|
227 |
vectors = ${paths.vectors}
|
228 |
init_tok2vec = ${paths.init_tok2vec}
|
229 |
before_init = null
|
|
|
1 |
[paths]
|
2 |
+
train = null
|
3 |
+
dev = null
|
4 |
+
vectors = null
|
|
|
5 |
init_tok2vec = null
|
|
|
6 |
|
7 |
[system]
|
8 |
gpu_allocator = null
|
|
|
22 |
|
23 |
[components.attribute_ruler]
|
24 |
factory = "attribute_ruler"
|
25 |
+
scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
|
26 |
validate = false
|
27 |
|
28 |
[components.lemmatizer]
|
|
|
30 |
mode = "lookup"
|
31 |
model = null
|
32 |
overwrite = false
|
33 |
+
scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
|
34 |
|
35 |
[components.ner]
|
36 |
factory = "ner"
|
37 |
incorrect_spans_key = null
|
38 |
moves = null
|
39 |
+
scorer = {"@scorers":"spacy.ner_scorer.v1"}
|
40 |
update_with_oracle_cut_size = 100
|
41 |
|
42 |
[components.ner.model]
|
|
|
54 |
[components.ner.model.tok2vec.embed]
|
55 |
@architectures = "spacy.MultiHashEmbed.v2"
|
56 |
width = 96
|
57 |
+
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
58 |
+
rows = [5000,2500,2500,2500,100]
|
59 |
include_static_vectors = true
|
60 |
|
61 |
[components.ner.model.tok2vec.encode]
|
|
|
70 |
learn_tokens = false
|
71 |
min_action_freq = 30
|
72 |
moves = null
|
73 |
+
scorer = {"@scorers":"spacy.parser_scorer.v1"}
|
74 |
update_with_oracle_cut_size = 100
|
75 |
|
76 |
[components.parser.model]
|
|
|
89 |
|
90 |
[components.senter]
|
91 |
factory = "senter"
|
92 |
+
overwrite = false
|
93 |
+
scorer = {"@scorers":"spacy.senter_scorer.v1"}
|
94 |
|
95 |
[components.senter.model]
|
96 |
@architectures = "spacy.Tagger.v1"
|
|
|
102 |
[components.senter.model.tok2vec.embed]
|
103 |
@architectures = "spacy.MultiHashEmbed.v2"
|
104 |
width = 16
|
105 |
+
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
106 |
+
rows = [1000,500,500,500,50]
|
107 |
include_static_vectors = true
|
108 |
|
109 |
[components.senter.model.tok2vec.encode]
|
|
|
115 |
|
116 |
[components.tagger]
|
117 |
factory = "tagger"
|
118 |
+
overwrite = false
|
119 |
+
scorer = {"@scorers":"spacy.tagger_scorer.v1"}
|
120 |
|
121 |
[components.tagger.model]
|
122 |
@architectures = "spacy.Tagger.v1"
|
|
|
136 |
[components.tok2vec.model.embed]
|
137 |
@architectures = "spacy.MultiHashEmbed.v2"
|
138 |
width = ${components.tok2vec.model.encode:width}
|
139 |
+
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
140 |
+
rows = [5000,2500,2500,2500,100]
|
141 |
include_static_vectors = true
|
142 |
|
143 |
[components.tok2vec.model.encode]
|
|
|
151 |
|
152 |
[corpora.dev]
|
153 |
@readers = "spacy.Corpus.v1"
|
154 |
+
path = ${paths.dev}
|
|
|
|
|
155 |
gold_preproc = false
|
156 |
+
max_length = 0
|
157 |
+
limit = 0
|
158 |
augmenter = null
|
159 |
|
160 |
[corpora.train]
|
161 |
@readers = "spacy.Corpus.v1"
|
162 |
+
path = ${paths.train}
|
|
|
163 |
gold_preproc = false
|
164 |
+
max_length = 0
|
165 |
limit = 0
|
166 |
+
augmenter = null
|
|
|
|
|
|
|
167 |
|
168 |
[training]
|
169 |
train_corpus = "corpora.train"
|
|
|
194 |
t = 0.0
|
195 |
|
196 |
[training.logger]
|
197 |
+
@loggers = "spacy.ConsoleLogger.v1"
|
198 |
+
progress_bar = false
|
|
|
199 |
|
200 |
[training.optimizer]
|
201 |
@optimizers = "Adam.v1"
|
|
|
216 |
sents_p = null
|
217 |
sents_r = null
|
218 |
sents_f = 0.02
|
219 |
+
lemma_acc = 0.5
|
220 |
+
ents_f = 0.16
|
221 |
ents_p = 0.0
|
222 |
ents_r = 0.0
|
223 |
ents_per_type = null
|
224 |
+
speed = 0.0
|
225 |
|
226 |
[pretraining]
|
227 |
|
228 |
[initialize]
|
229 |
+
vocab_data = null
|
230 |
vectors = ${paths.vectors}
|
231 |
init_tok2vec = ${paths.init_tok2vec}
|
232 |
before_init = null
|
meta.json
CHANGED
@@ -1,14 +1,14 @@
|
|
1 |
{
|
2 |
"lang":"ro",
|
3 |
"name":"core_news_md",
|
4 |
-
"version":"3.
|
5 |
"description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
|
6 |
"author":"Explosion",
|
7 |
"email":"[email protected]",
|
8 |
"url":"https://explosion.ai",
|
9 |
"license":"CC BY-SA 4.0",
|
10 |
-
"spacy_version":">=3.
|
11 |
-
"spacy_git_version":"
|
12 |
"vectors":{
|
13 |
"width":300,
|
14 |
"vectors":20000,
|
@@ -30,6 +30,7 @@
|
|
30 |
"Afp",
|
31 |
"Afp-p-n",
|
32 |
"Afp-poy",
|
|
|
33 |
"Afpf--n",
|
34 |
"Afpfp-n",
|
35 |
"Afpfp-ny",
|
@@ -131,6 +132,7 @@
|
|
131 |
"Ds2ms-s",
|
132 |
"Ds3---p",
|
133 |
"Ds3---s",
|
|
|
134 |
"Ds3fp-s",
|
135 |
"Ds3fsos",
|
136 |
"Ds3fsrs",
|
@@ -159,18 +161,23 @@
|
|
159 |
"LSQR",
|
160 |
"LT",
|
161 |
"M",
|
162 |
-
"Mc",
|
163 |
"Mc-p-d",
|
164 |
"Mc-p-l",
|
|
|
|
|
|
|
165 |
"Mcfp-l",
|
166 |
"Mcfp-ln",
|
167 |
"Mcfprln",
|
168 |
"Mcfprly",
|
169 |
"Mcfsoln",
|
|
|
170 |
"Mcfsrln",
|
|
|
171 |
"Mcmp-l",
|
172 |
"Mcms-ln",
|
173 |
"Mcmsrl",
|
|
|
174 |
"Mcmsrly",
|
175 |
"Mffprln",
|
176 |
"Mffsrln",
|
@@ -243,7 +250,6 @@
|
|
243 |
"Pd3mpr--y",
|
244 |
"Pd3mso",
|
245 |
"Pd3msr",
|
246 |
-
"Pi3",
|
247 |
"Pi3--r",
|
248 |
"Pi3-po",
|
249 |
"Pi3-so",
|
@@ -289,6 +295,7 @@
|
|
289 |
"Pp3-po--------s",
|
290 |
"Pp3-sd--------w",
|
291 |
"Pp3-sd--y-----w",
|
|
|
292 |
"Pp3fpa--------w",
|
293 |
"Pp3fpa--y-----w",
|
294 |
"Pp3fpr--------s",
|
@@ -315,7 +322,6 @@
|
|
315 |
"Ps2fp-s",
|
316 |
"Ps2fsrp",
|
317 |
"Ps2fsrs",
|
318 |
-
"Ps2ms-s",
|
319 |
"Ps3---p",
|
320 |
"Ps3---s",
|
321 |
"Ps3fp-s",
|
@@ -348,7 +354,6 @@
|
|
348 |
"RPAR",
|
349 |
"RSQR",
|
350 |
"Rc",
|
351 |
-
"Rgc",
|
352 |
"Rgp",
|
353 |
"Rgpy",
|
354 |
"Rgs",
|
@@ -406,6 +411,7 @@
|
|
406 |
"Va--3s",
|
407 |
"Va--3s----y",
|
408 |
"Vag",
|
|
|
409 |
"Vaii1",
|
410 |
"Vaii2s",
|
411 |
"Vaii3p",
|
@@ -475,7 +481,7 @@
|
|
475 |
"Vmp--sm",
|
476 |
"Vmp--sm---y",
|
477 |
"Vmsp1p",
|
478 |
-
"
|
479 |
"Vmsp2s",
|
480 |
"Vmsp3",
|
481 |
"Vmsp3-----y",
|
@@ -488,6 +494,7 @@
|
|
488 |
"Ynmsoy",
|
489 |
"Ynmsry",
|
490 |
"Yp",
|
|
|
491 |
"Yp-sr",
|
492 |
"Yr"
|
493 |
],
|
@@ -525,14 +532,14 @@
|
|
525 |
"iobj",
|
526 |
"mark",
|
527 |
"nmod",
|
528 |
-
"nmod:agent",
|
529 |
-
"nmod:pmod",
|
530 |
"nmod:tmod",
|
531 |
"nsubj",
|
532 |
"nsubj:pass",
|
533 |
"nummod",
|
534 |
"obj",
|
535 |
"obl",
|
|
|
|
|
536 |
"orphan",
|
537 |
"parataxis",
|
538 |
"punct",
|
@@ -590,186 +597,65 @@
|
|
590 |
],
|
591 |
"performance":{
|
592 |
"token_acc":0.9990029326,
|
593 |
-
"
|
594 |
-
"
|
595 |
-
"
|
596 |
-
"
|
597 |
-
"
|
598 |
-
"
|
599 |
-
"
|
600 |
-
"
|
601 |
-
"
|
602 |
-
"sents_p":0.9598393574,
|
603 |
-
"sents_r":0.9534574468,
|
604 |
-
"sents_f":0.9566377585,
|
605 |
-
"speed":8493.8160932984,
|
606 |
-
"morph_per_feat":{
|
607 |
-
"AdpType":{
|
608 |
-
"p":0.997492687,
|
609 |
-
"r":0.9933416563,
|
610 |
-
"f":0.995412844
|
611 |
-
},
|
612 |
-
"Case":{
|
613 |
-
"p":0.9877617623,
|
614 |
-
"r":0.9809588116,
|
615 |
-
"f":0.9843485331
|
616 |
-
},
|
617 |
-
"Variant":{
|
618 |
-
"p":0.9846153846,
|
619 |
-
"r":0.9241877256,
|
620 |
-
"f":0.9534450652
|
621 |
-
},
|
622 |
-
"Gender":{
|
623 |
-
"p":0.9798818233,
|
624 |
-
"r":0.9754901961,
|
625 |
-
"f":0.977681078
|
626 |
-
},
|
627 |
-
"Number":{
|
628 |
-
"p":0.9811536265,
|
629 |
-
"r":0.9754712696,
|
630 |
-
"f":0.9783041968
|
631 |
-
},
|
632 |
-
"PronType":{
|
633 |
-
"p":0.9943589744,
|
634 |
-
"r":0.9892857143,
|
635 |
-
"f":0.9918158568
|
636 |
-
},
|
637 |
-
"Definite":{
|
638 |
-
"p":0.9773605743,
|
639 |
-
"r":0.9711046086,
|
640 |
-
"f":0.9742225484
|
641 |
-
},
|
642 |
-
"Degree":{
|
643 |
-
"p":0.9527845036,
|
644 |
-
"r":0.9369047619,
|
645 |
-
"f":0.9447779112
|
646 |
-
},
|
647 |
-
"Polarity":{
|
648 |
-
"p":0.9884318766,
|
649 |
-
"r":0.9846350832,
|
650 |
-
"f":0.9865298268
|
651 |
-
},
|
652 |
-
"Mood":{
|
653 |
-
"p":0.9760869565,
|
654 |
-
"r":0.9621428571,
|
655 |
-
"f":0.9690647482
|
656 |
-
},
|
657 |
-
"Person":{
|
658 |
-
"p":0.9822419534,
|
659 |
-
"r":0.9696859021,
|
660 |
-
"f":0.9759235435
|
661 |
-
},
|
662 |
-
"Tense":{
|
663 |
-
"p":0.9691497366,
|
664 |
-
"r":0.9491525424,
|
665 |
-
"f":0.9590469099
|
666 |
-
},
|
667 |
-
"VerbForm":{
|
668 |
-
"p":0.9661582459,
|
669 |
-
"r":0.9579395085,
|
670 |
-
"f":0.9620313242
|
671 |
-
},
|
672 |
-
"NumForm":{
|
673 |
-
"p":0.9926650367,
|
674 |
-
"r":0.9902439024,
|
675 |
-
"f":0.9914529915
|
676 |
-
},
|
677 |
-
"NumType":{
|
678 |
-
"p":0.9951807229,
|
679 |
-
"r":0.9904076739,
|
680 |
-
"f":0.9927884615
|
681 |
-
},
|
682 |
-
"PartType":{
|
683 |
-
"p":0.9473684211,
|
684 |
-
"r":0.9,
|
685 |
-
"f":0.9230769231
|
686 |
-
},
|
687 |
-
"Strength":{
|
688 |
-
"p":0.9931623932,
|
689 |
-
"r":0.9781144781,
|
690 |
-
"f":0.9855810008
|
691 |
-
},
|
692 |
-
"Reflex":{
|
693 |
-
"p":0.9969135802,
|
694 |
-
"r":0.990797546,
|
695 |
-
"f":0.9938461538
|
696 |
-
},
|
697 |
-
"Poss":{
|
698 |
-
"p":0.9826989619,
|
699 |
-
"r":0.993006993,
|
700 |
-
"f":0.987826087
|
701 |
-
},
|
702 |
-
"Position":{
|
703 |
-
"p":0.9791666667,
|
704 |
-
"r":0.9724137931,
|
705 |
-
"f":0.9757785467
|
706 |
-
},
|
707 |
-
"Number[psor]":{
|
708 |
-
"p":0.9436619718,
|
709 |
-
"r":0.9710144928,
|
710 |
-
"f":0.9571428571
|
711 |
-
},
|
712 |
-
"Abbr":{
|
713 |
-
"p":0.9625,
|
714 |
-
"r":0.9058823529,
|
715 |
-
"f":0.9333333333
|
716 |
-
},
|
717 |
-
"Foreign":{
|
718 |
-
"p":0.0,
|
719 |
-
"r":0.0,
|
720 |
-
"f":0.0
|
721 |
-
}
|
722 |
-
},
|
723 |
"dep_las_per_type":{
|
724 |
"case":{
|
725 |
-
"p":0.
|
726 |
-
"r":0.
|
727 |
-
"f":0.
|
728 |
},
|
729 |
"det":{
|
730 |
-
"p":0.
|
731 |
-
"r":0.
|
732 |
-
"f":0.
|
733 |
},
|
734 |
"nmod:tmod":{
|
735 |
-
"p":0.
|
736 |
-
"r":0.
|
737 |
-
"f":0.
|
738 |
},
|
739 |
"amod":{
|
740 |
-
"p":0.
|
741 |
-
"r":0.
|
742 |
-
"f":0.
|
743 |
},
|
744 |
"cc":{
|
745 |
-
"p":0.
|
746 |
-
"r":0.
|
747 |
-
"f":0.
|
748 |
},
|
749 |
"conj":{
|
750 |
-
"p":0.
|
751 |
-
"r":0.
|
752 |
-
"f":0.
|
753 |
},
|
754 |
"nmod":{
|
755 |
-
"p":0.
|
756 |
-
"r":0.
|
757 |
-
"f":0.
|
758 |
},
|
759 |
"mark":{
|
760 |
-
"p":0.
|
761 |
-
"r":0.
|
762 |
-
"f":0.
|
763 |
},
|
764 |
"fixed":{
|
765 |
-
"p":0.
|
766 |
-
"r":0.
|
767 |
-
"f":0.
|
768 |
},
|
769 |
"nsubj":{
|
770 |
-
"p":0.
|
771 |
-
"r":0.
|
772 |
-
"f":0.
|
773 |
},
|
774 |
"advcl:tcl":{
|
775 |
"p":0.0,
|
@@ -777,84 +663,84 @@
|
|
777 |
"f":0.0
|
778 |
},
|
779 |
"obj":{
|
780 |
-
"p":0.
|
781 |
-
"r":0.
|
782 |
-
"f":0.
|
783 |
},
|
784 |
"nummod":{
|
785 |
-
"p":0.
|
786 |
-
"r":0.
|
787 |
-
"f":0.
|
788 |
},
|
789 |
"flat":{
|
790 |
-
"p":0.
|
791 |
-
"r":0.
|
792 |
-
"f":0.
|
793 |
},
|
794 |
"obl":{
|
795 |
-
"p":0.
|
796 |
-
"r":0.
|
797 |
-
"f":0.
|
798 |
},
|
799 |
-
"
|
800 |
-
"p":0.
|
801 |
-
"r":0.
|
802 |
-
"f":0.
|
803 |
},
|
804 |
"acl":{
|
805 |
-
"p":0.
|
806 |
-
"r":0.
|
807 |
-
"f":0.
|
808 |
},
|
809 |
"advmod":{
|
810 |
-
"p":0.
|
811 |
-
"r":0.
|
812 |
-
"f":0.
|
813 |
},
|
814 |
"expl:pv":{
|
815 |
-
"p":0.
|
816 |
-
"r":0.
|
817 |
-
"f":0.
|
818 |
},
|
819 |
"root":{
|
820 |
-
"p":0.
|
821 |
-
"r":0.
|
822 |
-
"f":0.
|
823 |
},
|
824 |
"advcl":{
|
825 |
-
"p":0.
|
826 |
-
"r":0.
|
827 |
-
"f":0.
|
828 |
},
|
829 |
"iobj":{
|
830 |
-
"p":0.
|
831 |
-
"r":0.
|
832 |
-
"f":0.
|
833 |
},
|
834 |
"ccomp":{
|
835 |
-
"p":0.
|
836 |
-
"r":0.
|
837 |
-
"f":0.
|
838 |
},
|
839 |
"goeswith":{
|
840 |
-
"p":0.
|
841 |
-
"r":0.
|
842 |
-
"f":0.
|
843 |
},
|
844 |
"parataxis":{
|
845 |
-
"p":0.
|
846 |
-
"r":0.
|
847 |
-
"f":0.
|
848 |
},
|
849 |
"expl:poss":{
|
850 |
-
"p":0.
|
851 |
-
"r":0.
|
852 |
-
"f":0.
|
853 |
},
|
854 |
"cop":{
|
855 |
-
"p":0.
|
856 |
-
"r":0.
|
857 |
-
"f":0.
|
858 |
},
|
859 |
"cc:preconj":{
|
860 |
"p":0.0,
|
@@ -862,54 +748,49 @@
|
|
862 |
"f":0.0
|
863 |
},
|
864 |
"aux":{
|
865 |
-
"p":0.
|
866 |
"r":0.9122340426,
|
867 |
-
"f":0.
|
868 |
},
|
869 |
"expl":{
|
870 |
-
"p":0.
|
871 |
-
"r":0.
|
872 |
-
"f":0.
|
873 |
},
|
874 |
"appos":{
|
875 |
-
"p":0.
|
876 |
-
"r":0.
|
877 |
-
"f":0.
|
878 |
},
|
879 |
"xcomp":{
|
880 |
-
"p":0.
|
881 |
-
"r":0.
|
882 |
-
"f":0.
|
883 |
},
|
884 |
-
"
|
885 |
-
"p":0.
|
886 |
-
"r":0.
|
887 |
-
"f":0.
|
888 |
},
|
889 |
"csubj":{
|
890 |
-
"p":0.
|
891 |
-
"r":0.
|
892 |
-
"f":0.
|
893 |
},
|
894 |
-
"
|
895 |
-
"p":0.
|
896 |
-
"r":0.
|
897 |
-
"f":0.
|
898 |
},
|
899 |
"aux:pass":{
|
900 |
-
"p":0.
|
901 |
-
"r":0.
|
902 |
-
"f":0.
|
903 |
-
},
|
904 |
-
"nsubj:pass":{
|
905 |
-
"p":0.6060606061,
|
906 |
-
"r":0.6711409396,
|
907 |
-
"f":0.6369426752
|
908 |
},
|
909 |
-
"
|
910 |
-
"p":0.
|
911 |
-
"r":0.
|
912 |
-
"f":0.
|
913 |
},
|
914 |
"advmod:tmod":{
|
915 |
"p":0.0,
|
@@ -921,10 +802,15 @@
|
|
921 |
"r":0.6666666667,
|
922 |
"f":0.5714285714
|
923 |
},
|
|
|
|
|
|
|
|
|
|
|
924 |
"expl:pass":{
|
925 |
-
"p":0.
|
926 |
-
"r":0.
|
927 |
-
"f":0.
|
928 |
},
|
929 |
"orphan":{
|
930 |
"p":0.0,
|
@@ -937,9 +823,9 @@
|
|
937 |
"f":0.1666666667
|
938 |
},
|
939 |
"csubj:pass":{
|
940 |
-
"p":0.
|
941 |
-
"r":0.
|
942 |
-
"f":0.
|
943 |
},
|
944 |
"vocative":{
|
945 |
"p":0.0,
|
@@ -952,88 +838,215 @@
|
|
952 |
"f":0.0
|
953 |
}
|
954 |
},
|
955 |
-
"
|
956 |
-
|
957 |
-
|
958 |
-
|
959 |
-
|
|
|
|
|
|
|
|
|
|
|
960 |
},
|
961 |
-
"
|
962 |
-
"p":0.
|
963 |
-
"r":0.
|
964 |
-
"f":0.
|
965 |
},
|
966 |
-
"
|
967 |
-
"p":0.
|
968 |
-
"r":0.
|
969 |
-
"f":0.
|
970 |
},
|
971 |
-
"
|
972 |
-
"p":0.
|
973 |
-
"r":0.
|
974 |
-
"f":0.
|
975 |
},
|
976 |
-
"
|
977 |
-
"p":0.
|
978 |
-
"r":0.
|
979 |
-
"f":0.
|
980 |
},
|
981 |
-
"
|
982 |
-
"p":0.
|
983 |
-
"r":0.
|
984 |
-
"f":0.
|
985 |
},
|
986 |
-
"
|
987 |
-
"p":0.
|
988 |
-
"r":0.
|
989 |
-
"f":0.
|
990 |
},
|
991 |
-
"
|
992 |
-
"p":0.
|
993 |
-
"r":0.
|
994 |
-
"f":0.
|
995 |
},
|
996 |
-
"
|
997 |
-
"p":0.
|
998 |
-
"r":0.
|
999 |
-
"f":0.
|
1000 |
},
|
1001 |
-
"
|
1002 |
-
"p":0.
|
1003 |
-
"r":0.
|
1004 |
-
"f":0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1005 |
},
|
1006 |
"PRODUCT":{
|
1007 |
-
"p":0.
|
1008 |
-
"r":0.
|
1009 |
-
"f":0.
|
1010 |
},
|
1011 |
"LOC":{
|
1012 |
-
"p":0.
|
1013 |
-
"r":0.
|
1014 |
-
"f":0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1015 |
},
|
1016 |
"WORK_OF_ART":{
|
1017 |
-
"p":0.
|
1018 |
-
"r":0.
|
1019 |
-
"f":0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1020 |
},
|
1021 |
"QUANTITY":{
|
1022 |
-
"p":0.
|
1023 |
-
"r":0.
|
1024 |
-
"f":0.
|
|
|
|
|
|
|
|
|
|
|
1025 |
},
|
1026 |
"LANGUAGE":{
|
1027 |
-
"p":0.
|
1028 |
-
"r":
|
1029 |
-
"f":0.
|
1030 |
},
|
1031 |
"PERIOD":{
|
1032 |
-
"p":0.
|
1033 |
-
"r":0.
|
1034 |
-
"f":0.
|
1035 |
}
|
1036 |
-
}
|
|
|
1037 |
},
|
1038 |
"sources":[
|
1039 |
{
|
@@ -1043,7 +1056,7 @@
|
|
1043 |
"author":"Michal M\u011bchura"
|
1044 |
},
|
1045 |
{
|
1046 |
-
"name":"UD Romanian RRT v2.
|
1047 |
"url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
|
1048 |
"license":"CC BY-SA 4.0",
|
1049 |
"author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
|
|
|
1 |
{
|
2 |
"lang":"ro",
|
3 |
"name":"core_news_md",
|
4 |
+
"version":"3.2.0",
|
5 |
"description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
|
6 |
"author":"Explosion",
|
7 |
"email":"[email protected]",
|
8 |
"url":"https://explosion.ai",
|
9 |
"license":"CC BY-SA 4.0",
|
10 |
+
"spacy_version":">=3.2.0,<3.3.0",
|
11 |
+
"spacy_git_version":"bb26550e2",
|
12 |
"vectors":{
|
13 |
"width":300,
|
14 |
"vectors":20000,
|
|
|
30 |
"Afp",
|
31 |
"Afp-p-n",
|
32 |
"Afp-poy",
|
33 |
+
"Afp-srn",
|
34 |
"Afpf--n",
|
35 |
"Afpfp-n",
|
36 |
"Afpfp-ny",
|
|
|
132 |
"Ds2ms-s",
|
133 |
"Ds3---p",
|
134 |
"Ds3---s",
|
135 |
+
"Ds3---sy",
|
136 |
"Ds3fp-s",
|
137 |
"Ds3fsos",
|
138 |
"Ds3fsrs",
|
|
|
161 |
"LSQR",
|
162 |
"LT",
|
163 |
"M",
|
|
|
164 |
"Mc-p-d",
|
165 |
"Mc-p-l",
|
166 |
+
"Mc-s-b",
|
167 |
+
"Mc-s-d",
|
168 |
+
"Mc-s-l",
|
169 |
"Mcfp-l",
|
170 |
"Mcfp-ln",
|
171 |
"Mcfprln",
|
172 |
"Mcfprly",
|
173 |
"Mcfsoln",
|
174 |
+
"Mcfsrl",
|
175 |
"Mcfsrln",
|
176 |
+
"Mcfsrly",
|
177 |
"Mcmp-l",
|
178 |
"Mcms-ln",
|
179 |
"Mcmsrl",
|
180 |
+
"Mcmsrln",
|
181 |
"Mcmsrly",
|
182 |
"Mffprln",
|
183 |
"Mffsrln",
|
|
|
250 |
"Pd3mpr--y",
|
251 |
"Pd3mso",
|
252 |
"Pd3msr",
|
|
|
253 |
"Pi3--r",
|
254 |
"Pi3-po",
|
255 |
"Pi3-so",
|
|
|
295 |
"Pp3-po--------s",
|
296 |
"Pp3-sd--------w",
|
297 |
"Pp3-sd--y-----w",
|
298 |
+
"Pp3-so--------s",
|
299 |
"Pp3fpa--------w",
|
300 |
"Pp3fpa--y-----w",
|
301 |
"Pp3fpr--------s",
|
|
|
322 |
"Ps2fp-s",
|
323 |
"Ps2fsrp",
|
324 |
"Ps2fsrs",
|
|
|
325 |
"Ps3---p",
|
326 |
"Ps3---s",
|
327 |
"Ps3fp-s",
|
|
|
354 |
"RPAR",
|
355 |
"RSQR",
|
356 |
"Rc",
|
|
|
357 |
"Rgp",
|
358 |
"Rgpy",
|
359 |
"Rgs",
|
|
|
411 |
"Va--3s",
|
412 |
"Va--3s----y",
|
413 |
"Vag",
|
414 |
+
"Vag-------y",
|
415 |
"Vaii1",
|
416 |
"Vaii2s",
|
417 |
"Vaii3p",
|
|
|
481 |
"Vmp--sm",
|
482 |
"Vmp--sm---y",
|
483 |
"Vmsp1p",
|
484 |
+
"Vmsp2p",
|
485 |
"Vmsp2s",
|
486 |
"Vmsp3",
|
487 |
"Vmsp3-----y",
|
|
|
494 |
"Ynmsoy",
|
495 |
"Ynmsry",
|
496 |
"Yp",
|
497 |
+
"Yp,Yn",
|
498 |
"Yp-sr",
|
499 |
"Yr"
|
500 |
],
|
|
|
532 |
"iobj",
|
533 |
"mark",
|
534 |
"nmod",
|
|
|
|
|
535 |
"nmod:tmod",
|
536 |
"nsubj",
|
537 |
"nsubj:pass",
|
538 |
"nummod",
|
539 |
"obj",
|
540 |
"obl",
|
541 |
+
"obl:agent",
|
542 |
+
"obl:pmod",
|
543 |
"orphan",
|
544 |
"parataxis",
|
545 |
"punct",
|
|
|
597 |
],
|
598 |
"performance":{
|
599 |
"token_acc":0.9990029326,
|
600 |
+
"token_p":0.9967350492,
|
601 |
+
"token_r":0.9957244934,
|
602 |
+
"token_f":0.9959492157,
|
603 |
+
"tag_acc":0.9619726156,
|
604 |
+
"sents_p":0.9626168224,
|
605 |
+
"sents_r":0.9587765957,
|
606 |
+
"sents_f":0.9606928714,
|
607 |
+
"dep_uas":0.8893350063,
|
608 |
+
"dep_las":0.8388068128,
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
609 |
"dep_las_per_type":{
|
610 |
"case":{
|
611 |
+
"p":0.9337493999,
|
612 |
+
"r":0.9492435334,
|
613 |
+
"f":0.9414327202
|
614 |
},
|
615 |
"det":{
|
616 |
+
"p":0.9484425349,
|
617 |
+
"r":0.966083151,
|
618 |
+
"f":0.9571815718
|
619 |
},
|
620 |
"nmod:tmod":{
|
621 |
+
"p":0.6666666667,
|
622 |
+
"r":0.0930232558,
|
623 |
+
"f":0.1632653061
|
624 |
},
|
625 |
"amod":{
|
626 |
+
"p":0.8737690242,
|
627 |
+
"r":0.8864668483,
|
628 |
+
"f":0.8800721371
|
629 |
},
|
630 |
"cc":{
|
631 |
+
"p":0.877016129,
|
632 |
+
"r":0.910041841,
|
633 |
+
"f":0.8932238193
|
634 |
},
|
635 |
"conj":{
|
636 |
+
"p":0.5879699248,
|
637 |
+
"r":0.5915279879,
|
638 |
+
"f":0.5897435897
|
639 |
},
|
640 |
"nmod":{
|
641 |
+
"p":0.7885679164,
|
642 |
+
"r":0.8099747475,
|
643 |
+
"f":0.7991279975
|
644 |
},
|
645 |
"mark":{
|
646 |
+
"p":0.9161147903,
|
647 |
+
"r":0.9222222222,
|
648 |
+
"f":0.919158361
|
649 |
},
|
650 |
"fixed":{
|
651 |
+
"p":0.8559322034,
|
652 |
+
"r":0.7163120567,
|
653 |
+
"f":0.7799227799
|
654 |
},
|
655 |
"nsubj":{
|
656 |
+
"p":0.8134920635,
|
657 |
+
"r":0.7824427481,
|
658 |
+
"f":0.7976653696
|
659 |
},
|
660 |
"advcl:tcl":{
|
661 |
"p":0.0,
|
|
|
663 |
"f":0.0
|
664 |
},
|
665 |
"obj":{
|
666 |
+
"p":0.7793880837,
|
667 |
+
"r":0.8273504274,
|
668 |
+
"f":0.8026533997
|
669 |
},
|
670 |
"nummod":{
|
671 |
+
"p":0.8892405063,
|
672 |
+
"r":0.8619631902,
|
673 |
+
"f":0.8753894081
|
674 |
},
|
675 |
"flat":{
|
676 |
+
"p":0.7441860465,
|
677 |
+
"r":0.6857142857,
|
678 |
+
"f":0.7137546468
|
679 |
},
|
680 |
"obl":{
|
681 |
+
"p":0.6402378593,
|
682 |
+
"r":0.731596829,
|
683 |
+
"f":0.6828752643
|
684 |
},
|
685 |
+
"obl:pmod":{
|
686 |
+
"p":0.4375,
|
687 |
+
"r":0.1615384615,
|
688 |
+
"f":0.2359550562
|
689 |
},
|
690 |
"acl":{
|
691 |
+
"p":0.7222222222,
|
692 |
+
"r":0.7303370787,
|
693 |
+
"f":0.7262569832
|
694 |
},
|
695 |
"advmod":{
|
696 |
+
"p":0.8060686016,
|
697 |
+
"r":0.7823303457,
|
698 |
+
"f":0.7940220923
|
699 |
},
|
700 |
"expl:pv":{
|
701 |
+
"p":0.7777777778,
|
702 |
+
"r":0.8191489362,
|
703 |
+
"f":0.7979274611
|
704 |
},
|
705 |
"root":{
|
706 |
+
"p":0.9103078983,
|
707 |
+
"r":0.9042553191,
|
708 |
+
"f":0.9072715143
|
709 |
},
|
710 |
"advcl":{
|
711 |
+
"p":0.5579710145,
|
712 |
+
"r":0.6260162602,
|
713 |
+
"f":0.5900383142
|
714 |
},
|
715 |
"iobj":{
|
716 |
+
"p":0.7966101695,
|
717 |
+
"r":0.6394557823,
|
718 |
+
"f":0.7094339623
|
719 |
},
|
720 |
"ccomp":{
|
721 |
+
"p":0.6995073892,
|
722 |
+
"r":0.802259887,
|
723 |
+
"f":0.7473684211
|
724 |
},
|
725 |
"goeswith":{
|
726 |
+
"p":0.25,
|
727 |
+
"r":0.1428571429,
|
728 |
+
"f":0.1818181818
|
729 |
},
|
730 |
"parataxis":{
|
731 |
+
"p":0.8494623656,
|
732 |
+
"r":0.6030534351,
|
733 |
+
"f":0.7053571429
|
734 |
},
|
735 |
"expl:poss":{
|
736 |
+
"p":0.6086956522,
|
737 |
+
"r":0.6511627907,
|
738 |
+
"f":0.6292134831
|
739 |
},
|
740 |
"cop":{
|
741 |
+
"p":0.75,
|
742 |
+
"r":0.773006135,
|
743 |
+
"f":0.7613293051
|
744 |
},
|
745 |
"cc:preconj":{
|
746 |
"p":0.0,
|
|
|
748 |
"f":0.0
|
749 |
},
|
750 |
"aux":{
|
751 |
+
"p":0.9661971831,
|
752 |
"r":0.9122340426,
|
753 |
+
"f":0.9384404925
|
754 |
},
|
755 |
"expl":{
|
756 |
+
"p":0.5714285714,
|
757 |
+
"r":0.4761904762,
|
758 |
+
"f":0.5194805195
|
759 |
},
|
760 |
"appos":{
|
761 |
+
"p":0.4691358025,
|
762 |
+
"r":0.3762376238,
|
763 |
+
"f":0.4175824176
|
764 |
},
|
765 |
"xcomp":{
|
766 |
+
"p":0.5538461538,
|
767 |
+
"r":0.4337349398,
|
768 |
+
"f":0.4864864865
|
769 |
},
|
770 |
+
"nsubj:pass":{
|
771 |
+
"p":0.5878787879,
|
772 |
+
"r":0.6381578947,
|
773 |
+
"f":0.6119873817
|
774 |
},
|
775 |
"csubj":{
|
776 |
+
"p":0.8448275862,
|
777 |
+
"r":0.7777777778,
|
778 |
+
"f":0.8099173554
|
779 |
},
|
780 |
+
"obl:agent":{
|
781 |
+
"p":0.7538461538,
|
782 |
+
"r":0.7538461538,
|
783 |
+
"f":0.7538461538
|
784 |
},
|
785 |
"aux:pass":{
|
786 |
+
"p":0.7428571429,
|
787 |
+
"r":0.8666666667,
|
788 |
+
"f":0.8
|
|
|
|
|
|
|
|
|
|
|
789 |
},
|
790 |
+
"dep":{
|
791 |
+
"p":0.0,
|
792 |
+
"r":0.0,
|
793 |
+
"f":0.0
|
794 |
},
|
795 |
"advmod:tmod":{
|
796 |
"p":0.0,
|
|
|
802 |
"r":0.6666666667,
|
803 |
"f":0.5714285714
|
804 |
},
|
805 |
+
"ccomp:pmod":{
|
806 |
+
"p":0.5,
|
807 |
+
"r":0.1875,
|
808 |
+
"f":0.2727272727
|
809 |
+
},
|
810 |
"expl:pass":{
|
811 |
+
"p":0.6808510638,
|
812 |
+
"r":0.7032967033,
|
813 |
+
"f":0.6918918919
|
814 |
},
|
815 |
"orphan":{
|
816 |
"p":0.0,
|
|
|
823 |
"f":0.1666666667
|
824 |
},
|
825 |
"csubj:pass":{
|
826 |
+
"p":0.6666666667,
|
827 |
+
"r":0.6666666667,
|
828 |
+
"f":0.6666666667
|
829 |
},
|
830 |
"vocative":{
|
831 |
"p":0.0,
|
|
|
838 |
"f":0.0
|
839 |
}
|
840 |
},
|
841 |
+
"pos_acc":0.9381923087,
|
842 |
+
"morph_acc":0.9469023954,
|
843 |
+
"morph_micro_p":0.9870716332,
|
844 |
+
"morph_micro_r":0.9558096483,
|
845 |
+
"morph_micro_f":0.9683797083,
|
846 |
+
"morph_per_feat":{
|
847 |
+
"AdpType":{
|
848 |
+
"p":0.9954051796,
|
849 |
+
"r":0.9941593659,
|
850 |
+
"f":0.9947818827
|
851 |
},
|
852 |
+
"Case":{
|
853 |
+
"p":0.9873727088,
|
854 |
+
"r":0.9820391627,
|
855 |
+
"f":0.9846987136
|
856 |
},
|
857 |
+
"Variant":{
|
858 |
+
"p":0.976744186,
|
859 |
+
"r":0.9130434783,
|
860 |
+
"f":0.9438202247
|
861 |
},
|
862 |
+
"Gender":{
|
863 |
+
"p":0.9821478774,
|
864 |
+
"r":0.9776129845,
|
865 |
+
"f":0.9798751841
|
866 |
},
|
867 |
+
"Number":{
|
868 |
+
"p":0.9810964083,
|
869 |
+
"r":0.9438508752,
|
870 |
+
"f":0.9621133125
|
871 |
},
|
872 |
+
"PronType":{
|
873 |
+
"p":0.9902862986,
|
874 |
+
"r":0.9872579001,
|
875 |
+
"f":0.9887697805
|
876 |
},
|
877 |
+
"Definite":{
|
878 |
+
"p":0.9788447388,
|
879 |
+
"r":0.9734723747,
|
880 |
+
"f":0.9761511649
|
881 |
},
|
882 |
+
"Degree":{
|
883 |
+
"p":0.9568913175,
|
884 |
+
"r":0.9347568209,
|
885 |
+
"f":0.9456945695
|
886 |
},
|
887 |
+
"Polarity":{
|
888 |
+
"p":0.9884318766,
|
889 |
+
"r":0.9858974359,
|
890 |
+
"f":0.9871630295
|
891 |
},
|
892 |
+
"Mood":{
|
893 |
+
"p":0.9740072202,
|
894 |
+
"r":0.9677187948,
|
895 |
+
"f":0.9708528248
|
896 |
+
},
|
897 |
+
"Person":{
|
898 |
+
"p":0.9764359352,
|
899 |
+
"r":0.9696526508,
|
900 |
+
"f":0.9730324711
|
901 |
+
},
|
902 |
+
"Tense":{
|
903 |
+
"p":0.9707207207,
|
904 |
+
"r":0.9563609467,
|
905 |
+
"f":0.9634873323
|
906 |
+
},
|
907 |
+
"VerbForm":{
|
908 |
+
"p":0.9714013346,
|
909 |
+
"r":0.9622285175,
|
910 |
+
"f":0.9667931689
|
911 |
+
},
|
912 |
+
"NumForm":{
|
913 |
+
"p":0.9758064516,
|
914 |
+
"r":0.2929782082,
|
915 |
+
"f":0.4506517691
|
916 |
+
},
|
917 |
+
"NumType":{
|
918 |
+
"p":0.9846153846,
|
919 |
+
"r":0.3054892601,
|
920 |
+
"f":0.4663023679
|
921 |
+
},
|
922 |
+
"PartType":{
|
923 |
+
"p":0.9473684211,
|
924 |
+
"r":0.9230769231,
|
925 |
+
"f":0.9350649351
|
926 |
+
},
|
927 |
+
"Strength":{
|
928 |
+
"p":0.9914675768,
|
929 |
+
"r":0.97319933,
|
930 |
+
"f":0.9822485207
|
931 |
+
},
|
932 |
+
"Reflex":{
|
933 |
+
"p":0.9938461538,
|
934 |
+
"r":0.9877675841,
|
935 |
+
"f":0.990797546
|
936 |
+
},
|
937 |
+
"Poss":{
|
938 |
+
"p":0.986013986,
|
939 |
+
"r":0.986013986,
|
940 |
+
"f":0.986013986
|
941 |
+
},
|
942 |
+
"Position":{
|
943 |
+
"p":0.986013986,
|
944 |
+
"r":0.9724137931,
|
945 |
+
"f":0.9791666667
|
946 |
+
},
|
947 |
+
"Number[psor]":{
|
948 |
+
"p":0.9420289855,
|
949 |
+
"r":0.9558823529,
|
950 |
+
"f":0.9489051095
|
951 |
+
},
|
952 |
+
"Foreign":{
|
953 |
+
"p":0.0,
|
954 |
+
"r":0.0,
|
955 |
+
"f":0.0
|
956 |
+
},
|
957 |
+
"Abbr":{
|
958 |
+
"p":0.9620253165,
|
959 |
+
"r":0.9156626506,
|
960 |
+
"f":0.9382716049
|
961 |
+
}
|
962 |
+
},
|
963 |
+
"lemma_acc":0.8183070924,
|
964 |
+
"ents_p":0.7485865058,
|
965 |
+
"ents_r":0.7629658087,
|
966 |
+
"ents_f":0.7557077626,
|
967 |
+
"ents_per_type":{
|
968 |
+
"DATETIME":{
|
969 |
+
"p":0.0,
|
970 |
+
"r":0.0,
|
971 |
+
"f":0.0
|
972 |
+
},
|
973 |
+
"PERSON":{
|
974 |
+
"p":0.0,
|
975 |
+
"r":0.0,
|
976 |
+
"f":0.0
|
977 |
},
|
978 |
"PRODUCT":{
|
979 |
+
"p":0.0,
|
980 |
+
"r":0.0,
|
981 |
+
"f":0.0
|
982 |
},
|
983 |
"LOC":{
|
984 |
+
"p":0.0,
|
985 |
+
"r":0.0,
|
986 |
+
"f":0.0
|
987 |
+
},
|
988 |
+
"GPE":{
|
989 |
+
"p":0.0,
|
990 |
+
"r":0.0,
|
991 |
+
"f":0.0
|
992 |
+
},
|
993 |
+
"ORDINAL":{
|
994 |
+
"p":0.0,
|
995 |
+
"r":0.0,
|
996 |
+
"f":0.0
|
997 |
+
},
|
998 |
+
"NUMERIC_VALUE":{
|
999 |
+
"p":0.0,
|
1000 |
+
"r":0.0,
|
1001 |
+
"f":0.0
|
1002 |
+
},
|
1003 |
+
"ORGANIZATION":{
|
1004 |
+
"p":0.0,
|
1005 |
+
"r":0.0,
|
1006 |
+
"f":0.0
|
1007 |
+
},
|
1008 |
+
"NAT_REL_POL":{
|
1009 |
+
"p":0.0,
|
1010 |
+
"r":0.0,
|
1011 |
+
"f":0.0
|
1012 |
},
|
1013 |
"WORK_OF_ART":{
|
1014 |
+
"p":0.0,
|
1015 |
+
"r":0.0,
|
1016 |
+
"f":0.0
|
1017 |
+
},
|
1018 |
+
"EVENT":{
|
1019 |
+
"p":0.0,
|
1020 |
+
"r":0.0,
|
1021 |
+
"f":0.0
|
1022 |
+
},
|
1023 |
+
"FACILITY":{
|
1024 |
+
"p":0.0,
|
1025 |
+
"r":0.0,
|
1026 |
+
"f":0.0
|
1027 |
},
|
1028 |
"QUANTITY":{
|
1029 |
+
"p":0.0,
|
1030 |
+
"r":0.0,
|
1031 |
+
"f":0.0
|
1032 |
+
},
|
1033 |
+
"MONEY":{
|
1034 |
+
"p":0.0,
|
1035 |
+
"r":0.0,
|
1036 |
+
"f":0.0
|
1037 |
},
|
1038 |
"LANGUAGE":{
|
1039 |
+
"p":0.0,
|
1040 |
+
"r":0.0,
|
1041 |
+
"f":0.0
|
1042 |
},
|
1043 |
"PERIOD":{
|
1044 |
+
"p":0.0,
|
1045 |
+
"r":0.0,
|
1046 |
+
"f":0.0
|
1047 |
}
|
1048 |
+
},
|
1049 |
+
"speed":8391.5537539766
|
1050 |
},
|
1051 |
"sources":[
|
1052 |
{
|
|
|
1056 |
"author":"Michal M\u011bchura"
|
1057 |
},
|
1058 |
{
|
1059 |
+
"name":"UD Romanian RRT v2.8",
|
1060 |
"url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
|
1061 |
"license":"CC BY-SA 4.0",
|
1062 |
"author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
|
ner/model
CHANGED
Binary files a/ner/model and b/ner/model differ
|
|
parser/model
CHANGED
Binary files a/parser/model and b/parser/model differ
|
|
parser/moves
CHANGED
@@ -1 +1 @@
|
|
1 |
-
��moves
|
|
|
1 |
+
��moves�{"0":{"":86134},"1":{"":90421},"2":{"case":22293,"punct":9078,"det":9035,"nsubj":7080,"advmod":6417,"mark":5380,"cc":5367,"aux":4002,"obl":2028,"nummod":1887,"expl:pv":1796,"cop":1712,"aux:pass":1372,"amod":1370,"nsubj:pass":1013,"expl:pass":910,"parataxis":878,"obj":868,"advcl":713,"iobj":564,"expl:poss":469,"expl":393,"nmod":203,"nsubj||csubj":155,"nmod:tmod":153,"expl:impers":102,"xcomp":97,"advmod:tmod":84,"obl:pmod":74,"cc:preconj":63,"csubj":59,"nsubj:pass||csubj":57,"obj||ccomp":45,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14500,"amod":9699,"obl":7775,"conj":7286,"fixed":5485,"obj":5462,"acl":4105,"advmod":2099,"advcl":2049,"ccomp":1932,"nummod":1667,"nsubj":1280,"obl:pmod":1208,"flat":1167,"det":1035,"appos":915,"xcomp":891,"iobj":803,"obl:agent":719,"csubj":632,"nsubj:pass":554,"parataxis":435,"case":434,"nmod:tmod":283,"ccomp:pmod":178,"cc":123,"cop":100,"expl:pv":86,"goeswith":72,"expl":55,"compound":52,"advcl:tcl":52,"csubj:pass":49,"expl:poss":35,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
|
ro_core_news_md-any-py3-none-any.whl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:76845c3f800eae6e4f52e3066ee5a28bf103bdbbe4db9b0649a72f4b8d897f98
|
3 |
+
size 46220322
|
senter/cfg
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
{
|
2 |
-
|
3 |
}
|
|
|
1 |
{
|
2 |
+
"overwrite":false
|
3 |
}
|
senter/model
CHANGED
Binary files a/senter/model and b/senter/model differ
|
|
tagger/cfg
CHANGED
@@ -10,6 +10,7 @@
|
|
10 |
"Afp",
|
11 |
"Afp-p-n",
|
12 |
"Afp-poy",
|
|
|
13 |
"Afpf--n",
|
14 |
"Afpfp-n",
|
15 |
"Afpfp-ny",
|
@@ -111,6 +112,7 @@
|
|
111 |
"Ds2ms-s",
|
112 |
"Ds3---p",
|
113 |
"Ds3---s",
|
|
|
114 |
"Ds3fp-s",
|
115 |
"Ds3fsos",
|
116 |
"Ds3fsrs",
|
@@ -139,18 +141,23 @@
|
|
139 |
"LSQR",
|
140 |
"LT",
|
141 |
"M",
|
142 |
-
"Mc",
|
143 |
"Mc-p-d",
|
144 |
"Mc-p-l",
|
|
|
|
|
|
|
145 |
"Mcfp-l",
|
146 |
"Mcfp-ln",
|
147 |
"Mcfprln",
|
148 |
"Mcfprly",
|
149 |
"Mcfsoln",
|
|
|
150 |
"Mcfsrln",
|
|
|
151 |
"Mcmp-l",
|
152 |
"Mcms-ln",
|
153 |
"Mcmsrl",
|
|
|
154 |
"Mcmsrly",
|
155 |
"Mffprln",
|
156 |
"Mffsrln",
|
@@ -223,7 +230,6 @@
|
|
223 |
"Pd3mpr--y",
|
224 |
"Pd3mso",
|
225 |
"Pd3msr",
|
226 |
-
"Pi3",
|
227 |
"Pi3--r",
|
228 |
"Pi3-po",
|
229 |
"Pi3-so",
|
@@ -269,6 +275,7 @@
|
|
269 |
"Pp3-po--------s",
|
270 |
"Pp3-sd--------w",
|
271 |
"Pp3-sd--y-----w",
|
|
|
272 |
"Pp3fpa--------w",
|
273 |
"Pp3fpa--y-----w",
|
274 |
"Pp3fpr--------s",
|
@@ -295,7 +302,6 @@
|
|
295 |
"Ps2fp-s",
|
296 |
"Ps2fsrp",
|
297 |
"Ps2fsrs",
|
298 |
-
"Ps2ms-s",
|
299 |
"Ps3---p",
|
300 |
"Ps3---s",
|
301 |
"Ps3fp-s",
|
@@ -328,7 +334,6 @@
|
|
328 |
"RPAR",
|
329 |
"RSQR",
|
330 |
"Rc",
|
331 |
-
"Rgc",
|
332 |
"Rgp",
|
333 |
"Rgpy",
|
334 |
"Rgs",
|
@@ -386,6 +391,7 @@
|
|
386 |
"Va--3s",
|
387 |
"Va--3s----y",
|
388 |
"Vag",
|
|
|
389 |
"Vaii1",
|
390 |
"Vaii2s",
|
391 |
"Vaii3p",
|
@@ -455,7 +461,7 @@
|
|
455 |
"Vmp--sm",
|
456 |
"Vmp--sm---y",
|
457 |
"Vmsp1p",
|
458 |
-
"
|
459 |
"Vmsp2s",
|
460 |
"Vmsp3",
|
461 |
"Vmsp3-----y",
|
@@ -468,7 +474,9 @@
|
|
468 |
"Ynmsoy",
|
469 |
"Ynmsry",
|
470 |
"Yp",
|
|
|
471 |
"Yp-sr",
|
472 |
"Yr"
|
473 |
-
]
|
|
|
474 |
}
|
|
|
10 |
"Afp",
|
11 |
"Afp-p-n",
|
12 |
"Afp-poy",
|
13 |
+
"Afp-srn",
|
14 |
"Afpf--n",
|
15 |
"Afpfp-n",
|
16 |
"Afpfp-ny",
|
|
|
112 |
"Ds2ms-s",
|
113 |
"Ds3---p",
|
114 |
"Ds3---s",
|
115 |
+
"Ds3---sy",
|
116 |
"Ds3fp-s",
|
117 |
"Ds3fsos",
|
118 |
"Ds3fsrs",
|
|
|
141 |
"LSQR",
|
142 |
"LT",
|
143 |
"M",
|
|
|
144 |
"Mc-p-d",
|
145 |
"Mc-p-l",
|
146 |
+
"Mc-s-b",
|
147 |
+
"Mc-s-d",
|
148 |
+
"Mc-s-l",
|
149 |
"Mcfp-l",
|
150 |
"Mcfp-ln",
|
151 |
"Mcfprln",
|
152 |
"Mcfprly",
|
153 |
"Mcfsoln",
|
154 |
+
"Mcfsrl",
|
155 |
"Mcfsrln",
|
156 |
+
"Mcfsrly",
|
157 |
"Mcmp-l",
|
158 |
"Mcms-ln",
|
159 |
"Mcmsrl",
|
160 |
+
"Mcmsrln",
|
161 |
"Mcmsrly",
|
162 |
"Mffprln",
|
163 |
"Mffsrln",
|
|
|
230 |
"Pd3mpr--y",
|
231 |
"Pd3mso",
|
232 |
"Pd3msr",
|
|
|
233 |
"Pi3--r",
|
234 |
"Pi3-po",
|
235 |
"Pi3-so",
|
|
|
275 |
"Pp3-po--------s",
|
276 |
"Pp3-sd--------w",
|
277 |
"Pp3-sd--y-----w",
|
278 |
+
"Pp3-so--------s",
|
279 |
"Pp3fpa--------w",
|
280 |
"Pp3fpa--y-----w",
|
281 |
"Pp3fpr--------s",
|
|
|
302 |
"Ps2fp-s",
|
303 |
"Ps2fsrp",
|
304 |
"Ps2fsrs",
|
|
|
305 |
"Ps3---p",
|
306 |
"Ps3---s",
|
307 |
"Ps3fp-s",
|
|
|
334 |
"RPAR",
|
335 |
"RSQR",
|
336 |
"Rc",
|
|
|
337 |
"Rgp",
|
338 |
"Rgpy",
|
339 |
"Rgs",
|
|
|
391 |
"Va--3s",
|
392 |
"Va--3s----y",
|
393 |
"Vag",
|
394 |
+
"Vag-------y",
|
395 |
"Vaii1",
|
396 |
"Vaii2s",
|
397 |
"Vaii3p",
|
|
|
461 |
"Vmp--sm",
|
462 |
"Vmp--sm---y",
|
463 |
"Vmsp1p",
|
464 |
+
"Vmsp2p",
|
465 |
"Vmsp2s",
|
466 |
"Vmsp3",
|
467 |
"Vmsp3-----y",
|
|
|
474 |
"Ynmsoy",
|
475 |
"Ynmsry",
|
476 |
"Yp",
|
477 |
+
"Yp,Yn",
|
478 |
"Yp-sr",
|
479 |
"Yr"
|
480 |
+
],
|
481 |
+
"overwrite":false
|
482 |
}
|
tagger/model
CHANGED
Binary files a/tagger/model and b/tagger/model differ
|
|
tok2vec/model
CHANGED
Binary files a/tok2vec/model and b/tok2vec/model differ
|
|
tokenizer
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
-
��prefix_search�
|
2 |
��A�
|
3 |
-
� ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)
|
|
|
1 |
+
��prefix_search�
|
2 |
��A�
|
3 |
+
� ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
|
vocab/strings.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4534edb1d1b8e8017538d692a57054e6179b5b351805c50502b2f0ef77b79ec7
|
3 |
+
size 10070837
|
vocab/vectors.cfg
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"mode":"default"
|
3 |
+
}
|