Treebank Statistics: UD_Slovenian-SSJ: Features: Number
This feature is universal.
It occurs with 3 different values: Dual
, Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
148225 tokens (55%) have a non-empty value of Number
.
48958 types (101%) occur at least once with a non-empty value of Number
.
22215 lemmas (87%) occur at least once with a non-empty value of Number
.
The feature is used with 8 part-of-speech tags: NOUN (56865; 21% instances), ADJ (28426; 11% instances), VERB (22411; 8% instances), AUX (15812; 6% instances), PROPN (10239; 4% instances), DET (7978; 3% instances), PRON (5026; 2% instances), NUM (1468; 1% instances).
NOUN
56865 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
NOUN
tokens may have the following values of Number
:
Dual
(599; 1% of non-emptyNumber
): leti, letoma, meseca, letih, otroka, policista, primerih, starša, strani, državiPlur
(15865; 28% of non-emptyNumber
): let, ljudi, letih, dni, tolarjev, milijonov, ljudje, odstotkov, oči, podatkovSing
(40401; 71% of non-emptyNumber
): leta, strani, delo, primer, dan, leto, čas, del, mesto, času
Paradigm leto | Sing | Dual | Plur |
---|---|---|---|
Case=Acc | leto | leti | leta |
Case=Dat | letom | ||
Case=Gen | leta | let | let |
Case=Ins | letom | letoma | leti |
Case=Loc | letu | letih | letih |
Case=Nom | leto | leta |
ADJ
28426 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (25970; 91%), VerbForm=EMPTY (24781; 87%), Definite=EMPTY (24330; 86%).
ADJ
tokens may have the following values of Number
:
Dual
(280; 1% of non-emptyNumber
): trebušni, izgubljeni, novi, posamični, zadnja, desnima, dodatna, drugačna, drugih, ediniPlur
(8805; 31% of non-emptyNumber
): drugih, druge, različnih, nove, drugi, novih, sami, drugimi, zadnjih, slovenskihSing
(19341; 68% of non-emptyNumber
): prvi, mogoče, drugi, sam, novo, veliko, prva, drugo, potrebno, drugega
Paradigm drug | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Definite=Def|Gender=Masc | drugi | ||
Case=Acc|Definite=Ind|Gender=Masc | drug | ||
Case=Acc|Gender=Masc | drugega | druge | |
Case=Acc|Gender=Fem | drugo | druge | |
Case=Acc|Gender=Neut | drugo | druga | |
Case=Dat|Gender=Masc | drugemu | drugim | |
Case=Dat|Gender=Fem | drugi | ||
Case=Gen|Gender=Masc | drugega | drugih | |
Case=Gen|Gender=Fem | druge | drugih | |
Case=Gen|Gender=Neut | drugega | drugih | |
Case=Ins|Gender=Masc | drugim | drugimi | |
Case=Ins|Gender=Fem | drugo | drugimi | |
Case=Ins|Gender=Neut | drugim | drugimi | |
Case=Loc|Gender=Masc | drugem | drugih | drugih |
Case=Loc|Gender=Fem | drugi | drugih | drugih |
Case=Loc|Gender=Neut | drugem | drugih | |
Case=Nom|Definite=Def|Gender=Masc | drugi | ||
Case=Nom|Definite=Ind|Gender=Masc | drug | ||
Case=Nom|Gender=Masc | drugi | ||
Case=Nom|Gender=Fem | druga | drugi | druge |
Case=Nom|Gender=Neut | drugo | druga |
VERB
22411 VERB tokens (91% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Tense=EMPTY (11880; 53%), Mood=EMPTY (11423; 51%), Person=EMPTY (11423; 51%), VerbForm=Part (11423; 51%).
VERB
tokens may have the following values of Number
:
Dual
(650; 3% of non-emptyNumber
): imata, sta, imela, morala, bila, odšla, bili, dobila, začela, morataPlur
(7663; 34% of non-emptyNumber
): so, imajo, imeli, morali, moramo, morajo, začeli, imamo, dobili, biliSing
(14098; 63% of non-emptyNumber
): je, ima, bilo, ni, gre, bo, mora, imel, pomeni, praviEMPTY
(2181): videti, imeti, biti, vedeti, dobiti, narediti, povedati, reči, sprejeti, najti
Paradigm biti | Sing | Dual | Plur |
---|---|---|---|
Aspect=Imp|Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | bijejo | ||
Gender=Masc|VerbForm=Part | bil | bila, bla | bili |
Gender=Fem|VerbForm=Part | bila | bili | bile |
Gender=Neut|VerbForm=Part | bilo, blo | bili | |
Mood=Imp|Person=2|VerbForm=Fin | bodi | ||
Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisem | nismo | |
Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | bom | bomo | |
Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | sem | sva | smo |
Mood=Ind|Person=2|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisi | niste | |
Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin | boš | boste | |
Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin | si, s | sta | ste |
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | ni | niso | |
Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bo | bosta | bodo |
Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | je | sta | so |
AUX
15812 AUX tokens (91% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (14371; 91%), Mood=Ind (14359; 91%), Polarity=Pos (13271; 84%), Tense=Pres (12883; 81%), Person=3 (12848; 81%).
AUX
tokens may have the following values of Number
:
Dual
(487; 3% of non-emptyNumber
): sta, sva, bila, bosta, nista, nisva, bili, bovaPlur
(4221; 27% of non-emptyNumber
): so, bodo, smo, niso, bili, bomo, boste, ste, bile, nismoSing
(11104; 70% of non-emptyNumber
): je, bo, ni, sem, bil, bila, bilo, bom, nisem, siEMPTY
(1515): bi, biti, b
Paradigm biti | Sing | Dual | Plur |
---|---|---|---|
Gender=Masc|VerbForm=Part | bil | bila | bili, bli |
Gender=Fem|VerbForm=Part | bila, bla | bili | bile |
Gender=Neut|VerbForm=Part | bilo, blo | bili | bila |
Mood=Imp|Person=2|VerbForm=Fin | bodi | bodite | |
Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisem | nisva | nismo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | bom | bova | bomo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | sem | sva | smo |
Mood=Ind|Person=2|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisi | niste | |
Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin | boš | bosta | boste |
Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin | si, as | sta | ste |
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | ni | nista | niso |
Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bo | bosta | bodo, bojo |
Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | je | sta | so |
PROPN
10239 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Gender=Masc (6668; 65%), Case=Nom (5770; 56%).
PROPN
tokens may have the following values of Number
:
Dual
(7; 0% of non-emptyNumber
): Francoza, Belokranjca, Egipčana, Francozov, Litijana, MakedoncaPlur
(530; 5% of non-emptyNumber
): ZDA, Slovenci, Slovencev, Nemci, Francozi, Rusi, Slovence, Američani, Aten, AtenahSing
(9702; 95% of non-emptyNumber
): Slovenije, Sloveniji, EU, Slovenija, Evropi, Ljubljana, Ljubljani, Evrope, Slovenijo, Maribor
Paradigm Francoz | Sing | Dual | Plur |
---|---|---|---|
Case=Acc | Francoza | Francoze | |
Case=Dat | Francozom | ||
Case=Gen | Francozov | Francozov | |
Case=Ins | Francozom | ||
Case=Nom | Francoz | Francoza | Francozi |
Number
seems to be lexical feature of PROPN
. 99% lemmas (5013) occur only with one value of Number
.
DET
7978 DET tokens (85% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Number[psor]=EMPTY (6502; 81%), Person=EMPTY (6502; 81%), Poss=EMPTY (5684; 71%).
DET
tokens may have the following values of Number
:
Dual
(155; 2% of non-emptyNumber
): oba, obeh, obe, obema, ti, svoja, ta, katerih, katerima, njuniPlur
(2131; 27% of non-emptyNumber
): vse, vseh, vsi, teh, katerih, te, svojih, svoje, nekatere, nekateriSing
(5692; 71% of non-emptyNumber
): to, tem, tega, ta, vse, svojo, svoje, vsak, katerem, svojEMPTY
(1374): več, nekaj, veliko, manj, dovolj, malo, toliko, pol, preveč, največ
Paradigm ta | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Gender=Masc | ta, tega | te | |
Case=Acc|Gender=Fem | to | ti | te |
Case=Acc|Gender=Neut | to | ta | |
Case=Dat|Gender=Masc | temu | tem | |
Case=Dat|Gender=Fem | tej | tem | |
Case=Dat|Gender=Neut | temu | tem | |
Case=Gen|Gender=Masc | tega | teh | teh |
Case=Gen|Gender=Fem | te | teh | |
Case=Gen|Gender=Neut | tega | teh | |
Case=Ins|Gender=Masc | tem | tema | temi |
Case=Ins|Gender=Fem | to | temi | |
Case=Ins|Gender=Neut | tem | temi | |
Case=Loc|Gender=Masc | tem | teh | teh |
Case=Loc|Gender=Fem | tej | teh | |
Case=Loc|Gender=Neut | tem | teh | |
Case=Nom|Gender=Masc | ta | ta | ti |
Case=Nom|Gender=Fem | ta | ti | te |
Case=Nom|Gender=Neut | to | ta |
PRON
5026 PRON tokens (55% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Reflex=EMPTY (5026; 100%), PronType=Prs (3884; 77%), Person=3 (2772; 55%), Variant=Short (2535; 50%).
PRON
tokens may have the following values of Number
:
Dual
(115; 2% of non-emptyNumber
): ju, nama, jima, njima, njiju, naju, midva, onadva, vaju, vamaPlur
(1367; 27% of non-emptyNumber
): jih, nas, jim, nam, vam, njih, njimi, vas, vi, miSing
(3544; 71% of non-emptyNumber
): ga, jo, kar, mu, kaj, mi, ji, me, nekaj, kdoEMPTY
(4080): se, si, seboj, sebi, sebe, zase, sabo, nase, vase, medse
Paradigm on | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Gender=Masc | njega | njiju, onadva | njih, nje |
Case=Acc|Gender=Masc|Variant=Short | ga | ju, jih | jih |
Case=Acc|Gender=Fem | njo | njiju | |
Case=Acc|Gender=Fem|Variant=Short | jo | ju | jih |
Case=Acc|Gender=Neut|Variant=Short | ga | ju | jih |
Case=Dat|Gender=Masc | njemu | njima | njim |
Case=Dat|Gender=Masc|Variant=Short | mu | jima | jim |
Case=Dat|Gender=Fem | njej | njim | |
Case=Dat|Gender=Fem|Variant=Short | ji | jima | jim |
Case=Dat|Gender=Neut|Variant=Short | mu | jim | |
Case=Gen|Gender=Masc | njega | njiju | njih |
Case=Gen|Gender=Masc|Variant=Short | ga | jih | |
Case=Gen|Gender=Fem | nje | njih | |
Case=Gen|Gender=Fem|Variant=Short | je | ju | jih |
Case=Gen|Gender=Neut | njega | njih | |
Case=Gen|Gender=Neut|Variant=Short | ga | jih | |
Case=Ins|Gender=Masc | njim | njima | njimi |
Case=Ins|Gender=Fem | njo | njima | njimi |
Case=Ins|Gender=Neut | njim | njimi | |
Case=Loc|Gender=Masc | njem | njiju | njih |
Case=Loc|Gender=Fem | njej | njima | njih |
Case=Loc|Gender=Neut | njem | njih | |
Case=Nom|Gender=Masc | on | onadva | oni |
Case=Nom|Gender=Fem | ona |
NUM
1468 NUM tokens (26% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumForm=Word (1468; 100%), NumType=Card (1462; 100%).
NUM
tokens may have the following values of Number
:
Dual
(278; 19% of non-emptyNumber
): dve, dva, dveh, dvemaPlur
(739; 50% of non-emptyNumber
): tri, štiri, pet, tisoč, treh, deset, štirih, sto, šest, sedemSing
(451; 31% of non-emptyNumber
): eno, ena, eden, enega, en, enem, eni, ene, enim, dvojeEMPTY
(4117): 2, 1, 10, 3, 6, 30, 1., 20, 4, 2000
Paradigm en | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc | en, enega, Enga | |
Case=Acc|Gender=Fem | eno | |
Case=Acc|Gender=Neut | eno | |
Case=Dat|Gender=Masc | enemu | |
Case=Dat|Gender=Fem | eni | |
Case=Gen|Gender=Masc | enega, enga | enih |
Case=Gen|Gender=Fem | ene | |
Case=Gen|Gender=Neut | enega | |
Case=Ins|Gender=Masc | enim | |
Case=Ins|Gender=Fem | eno | |
Case=Ins|Gender=Neut | enim | |
Case=Loc|Gender=Masc | enem | |
Case=Loc|Gender=Fem | eni | enih |
Case=Loc|Gender=Neut | enem | |
Case=Nom|Gender=Masc | en | eni |
Case=Nom|Gender=Fem | ena | |
Case=Nom|Gender=Neut | eno |
Number
seems to be lexical feature of NUM
. 96% lemmas (43) occur only with one value of Number
.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (21319; 100%),
VERB –[aux]–> AUX (9362; 88%),
NOUN –[nmod]–> NOUN (9311; 64%),
VERB –[nsubj]–> NOUN (6430; 93%),
VERB –[obl]–> NOUN (6111; 52%),
NOUN –[det]–> DET (5007; 88%),
NOUN –[conj]–> NOUN (3583; 79%),
ADJ –[cop]–> AUX (3290; 97%),
NOUN –[nmod]–> PROPN (2458; 82%),
NOUN –[acl]–> VERB (2097; 73%).