Treebank Statistics: UD_Lithuanian-HSE: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
2745 tokens (51%) have a non-empty value of Number
.
1943 types (84%) occur at least once with a non-empty value of Number
.
1249 lemmas (78%) occur at least once with a non-empty value of Number
.
The feature is used with 8 part-of-speech tags: NOUN (1102; 21% instances), VERB (538; 10% instances), ADJ (392; 7% instances), PROPN (300; 6% instances), PRON (208; 4% instances), AUX (106; 2% instances), DET (93; 2% instances), NUM (6; 0% instances).
NOUN
1102 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Gender=Masc (682; 62%).
NOUN
tokens may have the following values of Number
:
Plur
(388; 35% of non-emptyNumber
): laikais, metų, pilotų, prietaisai, žodžiai, žydų, dienų, imigrantų, intelektualų, interesusSing
(714; 65% of non-emptyNumber
): tautos, tauta, tiesa, valstybės, pasaulyje, tautą, tolerancijos, abejonės, amžiaus, daugelisEMPTY
(3): m, pusėn
Paradigm tauta | Sing | Plur |
---|---|---|
Case=Acc | tautą | tautas |
Case=Dat | tautai | |
Case=Gen | tautos | |
Case=Nom | tauta |
VERB
538 VERB tokens (76% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Voice=Act (478; 89%), Polarity=Pos (453; 84%), Definite=EMPTY (393; 73%), Case=EMPTY (392; 73%), Gender=EMPTY (383; 71%), VerbForm=Fin (381; 71%), Mood=Ind (350; 65%), Person=3 (313; 58%).
VERB
tokens may have the following values of Number
:
Plur
(164; 30% of non-emptyNumber
): vadinami, gali, turi, analizuoja, draudžiame, girdėję, gyvename, pastebimi, žinome, GrįžkimeSing
(374; 70% of non-emptyNumber
): gali, negali, sako, žino, bando, dera, dūstu, kalba, nėra, reikiaEMPTY
(170): žinoma, būti, galima, mylėti, turėti, vadinti, atsirasti, bandoma, diskriminuoti, gaminti
Paradigm galėti | Sing | Plur |
---|---|---|
Case=Nom|Definite=Ind|Gender=Masc|Polarity=Pos|Tense=Pres|VerbForm=Part|Voice=Pass | galimas | |
Mood=Cnd|Person=3|Polarity=Pos|VerbForm=Fin|Voice=Act | galėtų | |
Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act | galiu | galime |
Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin|Voice=Act | galėsi | |
Mood=Ind|Person=3|Polarity=Neg|Tense=Past|VerbForm=Fin|Voice=Act | negalėjo | |
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin|Voice=Act | negali | |
Mood=Ind|Person=3|Polarity=Pos|Tense=Past|VerbForm=Fin|Voice=Act | galėjo | galėjo |
Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act | gali, negali | gali |
ADJ
392 ADJ tokens (95% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (367; 94%), Definite=Ind (359; 92%), Gender=Masc (235; 60%).
ADJ
tokens may have the following values of Number
:
Plur
(147; 38% of non-emptyNumber
): kitų, lietuvius, lietuvių, įvairiais, dešiniaisiais, genetiškus, kitais, kitokių, lenkų, paskutiniaiSing
(245; 63% of non-emptyNumber
): vienas, lietuvis, gero, lietuviu, tautinė, tautinės, viena, vienintelis, Laikinosios, blogesnisEMPTY
(22): 1939, XIX, nesunku, sunku, šiaip, 1941, 1944, 1961, 2002, 25
Paradigm kitas | Sing | Plur |
---|---|---|
Case=Acc|Gender=Fem | kita, kitas | |
Case=Dat|Gender=Fem | kitai | |
Case=Gen|Gender=Masc | kito | kitų |
Case=Gen|Gender=Fem | kitos | |
Case=Ins|Gender=Masc | kitais | |
Case=Loc|Gender=Masc | kitame | kituose |
Case=Nom|Gender=Masc | kitas | kiti |
Case=Nom|Gender=Fem | kitos |
PROPN
300 PROPN tokens (93% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Gender=Masc (187; 62%).
PROPN
tokens may have the following values of Number
:
Plur
(13; 4% of non-emptyNumber
): Atėnuose, Sovietų, Vakarų, Atėnus, Dionizijų, Faidonais, Faidrais, Kritonais, Rytų, VakaraiSing
(287; 96% of non-emptyNumber
): Lietuvos, Strepsiado, Sokratas, Sokrato, Europos, Strepsiadas, Rusijos, Tu-154, Aristofano, LietuvaEMPTY
(23): BM, MARS, KGB, R., A., JAV, MN-61, NATO, SSRS, TSRS
Number
seems to be lexical feature of PROPN
. 100% lemmas (142) occur only with one value of Number
.
PRON
208 PRON tokens (82% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Gender=Masc (105; 50%).
PRON
tokens may have the following values of Number
:
Plur
(88; 42% of non-emptyNumber
): juos, jų, jie, mes, mus, mums, kurie, jiems, jus, kuriuosSing
(120; 58% of non-emptyNumber
): jis, ji, to, kuris, jo, man, ją, nieko, aš, jįEMPTY
(46): tai, kas, ką, save, kuo, sau, savęs, tuo, ko, niekam
Paradigm kuris | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc | kurį | kuriuos |
Case=Acc|Gender=Fem | kurią | kurias |
Case=Dat|Gender=Masc | kuriems | |
Case=Gen|Gender=Masc | kurio | kurių |
Case=Gen|Gender=Fem | kurios | |
Case=Ins|Gender=Masc | kuriais | |
Case=Loc|Gender=Fem | kurioje | |
Case=Nom|Gender=Masc | kuris | kurie |
Case=Nom|Gender=Fem | kuri | kurios |
AUX
106 AUX tokens (95% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Voice=Act (106; 100%), VerbForm=Fin (105; 99%), Person=3 (98; 92%), Mood=Ind (92; 87%), Polarity=Pos (84; 79%).
AUX
tokens may have the following values of Number
:
Plur
(17; 16% of non-emptyNumber
): buvo, yra, Esame, būsime, būtų, nesameSing
(89; 84% of non-emptyNumber
): yra, buvo, nėra, būtų, nebūtų, būna, esu, nebuvo, bus, esąsEMPTY
(6): būti, Esama, esą
Paradigm būti | Sing | Plur |
---|---|---|
Aspect=Hab|Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | nebūna | |
Case=Nom|Definite=Ind|Gender=Masc|Polarity=Pos|Tense=Pres|VerbForm=Part | esąs | |
Mood=Cnd|Person=3|Polarity=Neg|VerbForm=Fin | nebūtų | |
Mood=Cnd|Person=3|Polarity=Pos|VerbForm=Fin | būtų | būtų |
Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nesame | |
Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | būsime | |
Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | esu | Esame |
Mood=Ind|Person=2|Polarity=Neg|Tense=Fut|VerbForm=Fin | nebūsi | |
Mood=Ind|Person=3|Polarity=Neg|Tense=Fut|VerbForm=Fin | nebus | |
Mood=Ind|Person=3|Polarity=Neg|Tense=Past|VerbForm=Fin | nebuvo | |
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | nėra | |
Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bus | |
Mood=Ind|Person=3|Polarity=Pos|Tense=Past|VerbForm=Fin | buvo | buvo |
Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | yra, būna | yra |
DET
93 DET tokens (56% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Gender=Masc (51; 55%).
DET
tokens may have the following values of Number
:
Plur
(33; 35% of non-emptyNumber
): jokių, tie, tų, kokias, tokias, Tokios, Tos, jų, kitokių, kokieSing
(60; 65% of non-emptyNumber
): tą, kokia, tas, tokia, to, tokią, viso, kiekvienas, pats, tokiaiEMPTY
(73): mūsų, savo, jo, jų, jos, mano, tavo, šių, tam, tuo
Paradigm tas | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc | tą | |
Case=Acc|Gender=Fem | tą | |
Case=Gen|Definite=Ind|Gender=Masc | tų | |
Case=Gen|Gender=Masc | to | |
Case=Gen|Gender=Fem | tos | |
Case=Ins|Gender=Masc | tais | |
Case=Loc|Gender=Masc | tame | |
Case=Nom|Gender=Masc | tas | tie |
Case=Nom|Gender=Fem | toji | Tos |
Case=Nom|Person=3 | tie |
NUM
6 NUM tokens (25% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: Case=Acc (5; 83%), Gender=Masc (5; 83%).
NUM
tokens may have the following values of Number
:
Plur
(4; 67% of non-emptyNumber
): šimtus, Dešimtys, tūkstančiusSing
(2; 33% of non-emptyNumber
): vieną, šimtąEMPTY
(18): du, penkiasdešimt, trijų, trys, 1994, 30, 4151, 52, 7, 92
Paradigm šimtas | Sing | Plur |
---|---|---|
šimtą | šimtus |
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (221; 87%),
NOUN –[conj]–> NOUN (113; 86%),
VERB –[nsubj]–> NOUN (113; 80%),
NOUN –[nmod]–> NOUN (112; 57%),
VERB –[nsubj]–> PRON (61; 79%),
VERB –[conj]–> VERB (60; 76%),
NOUN –[nmod]–> PROPN (59; 64%),
VERB –[nsubj]–> PROPN (41; 84%),
PROPN –[flat]–> PROPN (37; 90%),
NOUN –[cop]–> AUX (34; 85%).