home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Estonian-EDT: Features: Number

This feature is universal. It occurs with 2 different values: Plur, Sing.

This is a layered feature with the following layers: Number, Number[psor].

242356 tokens (55%) have a non-empty value of Number. 75284 types (94%) occur at least once with a non-empty value of Number. 36166 lemmas (86%) occur at least once with a non-empty value of Number. The feature is used with 9 part-of-speech tags: NOUN (114210; 26% instances), ADJ (30477; 7% instances), VERB (25223; 6% instances), PROPN (24850; 6% instances), PRON (22800; 5% instances), AUX (14430; 3% instances), DET (6851; 2% instances), NUM (3457; 1% instances), SYM (58; 0% instances).

NOUN

114210 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

NOUN tokens may have the following values of Number:

Paradigm aastaSingPlur
Case=Ablaastalt
Case=Adeaastal, aastalgiaastatel, aastail
Case=Allaastale
Case=Comaastagaaastatega
Case=Elaaastastaastatest, aastaist
Case=Genaastaaastate
Case=Illaastasseaastatesse
Case=Ineaastasaastates
Case=Nomaastaaastad
Case=Paraastat, aastas, aastatkiaastaid
Case=Teraastaniaastateni
Case=Traaastaksaastateks

ADJ

30477 ADJ tokens (83% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Tense=EMPTY (26256; 86%), Voice=EMPTY (26230; 86%), VerbForm=EMPTY (26223; 86%), Degree=Pos (26015; 85%).

ADJ tokens may have the following values of Number:

Paradigm suurSingPlur
Case=Ablsuurelt
Case=Addsuurde
Case=Adesuurelsuurtel
Case=Allsuurelesuurtele
Case=Elasuurestsuurtest
Case=Gensuuresuurte
Case=Illsuurtesse
Case=Inesuuressuurtes
Case=Nomsuursuured
Case=Parsuurtsuuri
Case=Trasuurekssuurteks

VERB

25223 VERB tokens (53% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (25223; 100%), Voice=Act (25213; 100%), Mood=Ind (24426; 97%), Person=3 (21238; 84%), Tense=Pres (14037; 56%).

VERB tokens may have the following values of Number:

Paradigm saamaSingPlur
Mood=Cnd|Person=1|Tense=Pressaaksinsaaksime
Mood=Cnd|Person=2|Tense=Pressaaksite
Mood=Cnd|Person=3|Tense=Pressaaksid
Mood=Imp|Person=1|Tense=Pressaagem
Mood=Imp|Person=2|Tense=PresSaasaage
Mood=Imp|Person=3|Tense=Pressaagu
Mood=Ind|Person=1|Tense=Pastsain, sai, saingisaime, saimegi
Mood=Ind|Person=1|Tense=Pressaansaame, saamegi
Mood=Ind|Person=2|Tense=Pastsaidsaite
Mood=Ind|Person=2|Tense=Pressaadsaate
Mood=Ind|Person=3|Tense=Pastsai, saigisaid
Mood=Ind|Person=3|Tense=Pressaab, saabkisaavad

PROPN

24850 PROPN tokens (95% of all PROPN tokens) have a non-empty value of Number.

PROPN tokens may have the following values of Number:

Paradigm MaaSingPlur
Case=AblMaalt
Case=AdeMaal
Case=AllMaale
Case=ComMaaga
Case=ElaMaast
Case=GenMaa, MAA
Case=NomMaaMaad
Case=ParMaad

Number seems to be lexical feature of PROPN. 99% lemmas (6955) occur only with one value of Number.

PRON

22800 PRON tokens (100% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Person=EMPTY (13767; 60%), PronType=Prs (11712; 51%).

PRON tokens may have the following values of Number:

Paradigm temaSingPlur
Case=Abl|Person=3|PronType=Prstemalt, taltneilt
Case=Ade|Person=3|PronType=Prstal, temal, temalgineil, nendel
Case=All|Person=3|PronType=Prstalle, temaleneile, nendele, neilegi, nendelegi
Case=Com|Person=3|PronType=Prstemaganendega
Case=Ela|Person=3|PronType=Prstemast, tast, temastkineist, nendest, neistki
Case=Gen|Person=3|PronType=Prstema, ta, temaginende
Case=Gen|PronType=Demnende
Case=Ill|Person=3|PronType=Prstemasseneisse
Case=Ine|Person=3|PronType=Prstemasneis
Case=Nom|Person=3|PronType=Prsta, tema, temagi, taginad, nemad, nemadki
Case=Par|Person=3|PronType=Prsteda, tedagineid, neidki
Case=Par|PronType=Demneid
Case=Ter|Person=3|PronType=Prstemani
Case=Tra|Person=3|PronType=Prstemaksnendeks

AUX

14430 AUX tokens (65% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (14430; 100%), Voice=Act (14424; 100%), Polarity=EMPTY (14392; 100%), Mood=Ind (14140; 98%), Person=3 (13465; 93%), Tense=Pres (11364; 79%).

AUX tokens may have the following values of Number:

Paradigm olemaSingPlur
Mood=Cnd|Person=1|Tense=Presoleksinoleksime
Mood=Cnd|Person=2|Tense=Presoleksid
Mood=Cnd|Person=3|Tense=Presoleksoleksid, oleksidki
Mood=Imp|Person=1|Tense=Presolgem
Mood=Imp|Person=2|Tense=PresoleOlge
Mood=Imp|Person=3|Tense=Presoleolgu, ole
Mood=Imp|Tense=Presolgu
Mood=Ind|Person=1|Tense=Pastolinolime, olimegi
Mood=Ind|Person=1|Tense=Presolen, olengioleme, olemegi
Mood=Ind|Person=2|Tense=Pastolid, olidkiolite
Mood=Ind|Person=2|Tense=Presoled, oledkiolete, oletegi
Mood=Ind|Person=3|Tense=Pastoli, oligiolid, olidki
Mood=Ind|Person=3|Tense=Preson, ongi, ole, om, onson, ongi

DET

6851 DET tokens (95% of all DET tokens) have a non-empty value of Number.

DET tokens may have the following values of Number:

Paradigm seeSingPlur
Case=Abl|PronType=Demsellelt, selleltkineilt
Case=Ade|PronType=Demsel, sellel, Selgineil, nendel
Case=All|PronType=Demselleleneile, nendele
Case=Ela|PronType=Demsellest, sestneist, nendest
Case=Gen|Person=3|PronType=Prsnende
Case=Gen|PronType=Demsellenende
Case=Ill|PronType=Demsellesseneisse, nendesse
Case=Ine|PronType=Demselles, ses, Selleskineis, nendes
Case=Nom|PronType=Demsee, seegineed
Case=Par|PronType=Demsedaneid, neidki
Case=Tra|PronType=Demselleks, SeksNendeks

NUM

3457 NUM tokens (38% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (3344; 97%), NumType=Card (3251; 94%).

NUM tokens may have the following values of Number:

Paradigm kolmSingPlur
Case=Ablkolmelt
Case=Adekolmel
Case=Allkolmele
Case=Elakolmest
Case=Genkolme
Case=Inekolmes
Case=Nomkolm
Case=Parkolmekolmesid
Case=Terkolmeni
Case=Trakolmeks

Number seems to be lexical feature of NUM. 93% lemmas (138) occur only with one value of Number.

SYM

58 SYM tokens (8% of all SYM tokens) have a non-empty value of Number.

The most frequent other feature values with which SYM and Number co-occurred: Abbr=EMPTY (51; 88%).

SYM tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (18986; 89%), NOUN –[nmod]–> NOUN (15520; 63%), VERB –[nsubj]–> NOUN (10425; 70%), NOUN –[det]–> DET (6266; 94%), NOUN –[conj]–> NOUN (6144; 79%), NOUN –[nmod]–> PROPN (5118; 75%), VERB –[nsubj]–> PRON (4738; 66%), PROPN –[flat]–> PROPN (3701; 92%), NOUN –[acl]–> ADJ (3336; 52%), NOUN –[cop]–> AUX (2906; 64%).