Treebank Statistics: UD_Estonian-EDT: Features: NumForm
This feature is language-specific.
It occurs with 3 different values: Digit
, Roman
, Word
.
11421 tokens (3%) have a non-empty value of NumForm
.
2116 types (3%) occur at least once with a non-empty value of NumForm
.
1781 lemmas (4%) occur at least once with a non-empty value of NumForm
.
The feature is used with 4 part-of-speech tags: NUM (8907; 2% instances), ADJ (2484; 1% instances), PROPN (24; 0% instances), SYM (6; 0% instances).
NUM
8907 NUM tokens (99% of all NUM
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NUM
and NumForm
co-occurred: NumType=Card (8246; 93%), Case=EMPTY (5516; 62%), Number=EMPTY (5516; 62%).
NUM
tokens may have the following values of NumForm
:
Digit
(5548; 62% of non-emptyNumForm
): 1, 2, 10, 3, 4, 5, 15, 20, 6, 12Roman
(3; 0% of non-emptyNumForm
): I, IX, XIIWord
(3356; 38% of non-emptyNumForm
): kaks, üks, kolm, kahe, ühe, miljonit, viis, miljoni, neli, kolme
NumForm
seems to be lexical feature of NUM
. 100% lemmas (1436) occur only with one value of NumForm
.
ADJ
2484 ADJ tokens (7% of all ADJ
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which ADJ
and NumForm
co-occurred: Tense=EMPTY (2484; 100%), VerbForm=EMPTY (2484; 100%), Voice=EMPTY (2484; 100%), Degree=EMPTY (2474; 100%), Case=EMPTY (1531; 62%), Number=EMPTY (1531; 62%).
ADJ
tokens may have the following values of NumForm
:
Digit
(1457; 59% of non-emptyNumForm
): 1., 2000., 2., 1997., 1999., 3., 1996., 1998., 1992., 1995.Roman
(114; 5% of non-emptyNumForm
): II, I, III, XI, VII, XX, VI, XII, IV, MDCXXXIIWord
(913; 37% of non-emptyNumForm
): esimene, esimest, esimese, teine, teise, esimesel, esimesed, esimeses, teisel, kolmas
NumForm
seems to be lexical feature of ADJ
. 100% lemmas (358) occur only with one value of NumForm
.
PROPN
24 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which PROPN
and NumForm
co-occurred: Number=Sing (21; 88%).
PROPN
tokens may have the following values of NumForm
:
Digit
(1; 4% of non-emptyNumForm
): 8Roman
(4; 17% of non-emptyNumForm
): ADV, CX, M, XMWord
(19; 79% of non-emptyNumForm
): Teist, Teise, Kolmanda, Esimene, Esimese, Kolme, Neljanda, Neljandal, Teisel
NumForm
seems to be lexical feature of PROPN
. 100% lemmas (10) occur only with one value of NumForm
.
SYM
6 SYM tokens (1% of all SYM
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which SYM
and NumForm
co-occurred: Abbr=EMPTY (6; 100%).
SYM
tokens may have the following values of NumForm
:
Digit
(6; 100% of non-emptyNumForm
): %
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm
:
NUM –[conj]–> NUM (374; 98%),
NUM –[flat]–> NUM (96; 94%),
ADJ –[conj]–> ADJ (58; 83%),
NUM –[nummod]–> NUM (41; 85%),
NUM –[parataxis]–> NUM (7; 100%),
NUM –[orphan]–> NUM (6; 100%),
NUM –[obl]–> NUM (3; 100%),
ADJ –[compound]–> NUM (2; 100%),
ADJ –[flat]–> NUM (1; 100%),
ADJ –[orphan]–> ADJ (1; 100%).