Treebank Statistics: UD_Upper_Sorbian-UFAL: Features: Number
This feature is universal.
It occurs with 4 different values: Dual
, Plur
, Ptan
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
5903 tokens (53%) have a non-empty value of Number
.
3744 types (86%) occur at least once with a non-empty value of Number
.
2386 lemmas (78%) occur at least once with a non-empty value of Number
.
The feature is used with 9 part-of-speech tags: NOUN (2528; 23% instances), ADJ (1409; 13% instances), VERB (692; 6% instances), PROPN (545; 5% instances), AUX (287; 3% instances), DET (275; 2% instances), PRON (134; 1% instances), NUM (32; 0% instances), ADV (1; 0% instances).
NOUN
2528 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Animacy=EMPTY (1389; 55%).
NOUN
tokens may have the following values of Number
:
Dual
(27; 1% of non-emptyNumber
): měsacaj, rěkomaj, Kralej, atomaj, atomow, genusaj, izotopaj, kmjenaj, likwidaj, lětomajPlur
(809; 32% of non-emptyNumber
): rěčow, kilometrow, nastawki, rostliny, lět, knihi, města, rěče, statow, wobrazyPtan
(4; 0% of non-emptyNumber
): droždźemi, duri, hody, wikiSing
(1688; 67% of non-emptyNumber
): l, př, město, rěč, woda, lěta, stolica, lěće, mócnarstwo, pismoEMPTY
(15): km, m, CEST, centrum, jan, thumb, čas
Paradigm lěto | Sing | Dual | Plur |
---|---|---|---|
Case=Acc | lěto | ||
Case=Gen | lěta | lět, lětow | |
Case=Loc | lěće, lětu | lětomaj | lětach |
Case=Nom | lěto |
ADJ
1409 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Animacy=EMPTY (1249; 89%), Voice=EMPTY (1220; 87%), VerbForm=EMPTY (1219; 87%), Degree=EMPTY (897; 64%).
ADJ
tokens may have the following values of Number
:
Dual
(12; 1% of non-emptyNumber
): dalšej, fotosynteizskej, přesunjenej, rozbiwanej, rozpušćenej, serbskej, sonantnej, wodźikoweju, wudospołnjatej, znatejPlur
(511; 36% of non-emptyNumber
): druhich, druhe, ablawtowych, dalše, wjacore, prěnje, wažne, wikowanske, wotpowědne, wulkeSing
(886; 63% of non-emptyNumber
): serbski, serbskeje, Serbskeho, najwjetše, prěni, wulki, wulku, klinowe, serbska, EkscelentnyEMPTY
(12): němsko, Awstro, Tibeto, dołho, duchowno, hornjo, krótko, online, politisko, syrisko
Paradigm serbski | Sing | Dual | Plur |
---|---|---|---|
Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc | serbskej | ||
Case=Acc|Degree=Pos|Gender=Masc | serbski | ||
Case=Acc|Degree=Pos|Gender=Neut | serbske | ||
Case=Acc|Gender=Fem | serbsku | ||
Case=Dat|Gender=Masc | serbskemu | ||
Case=Dat|Gender=Fem | serbskim | ||
Case=Gen|Degree=Pos|Gender=Fem | serbskeje | ||
Case=Gen|Gender=Masc | Serbskeho | serbskich | |
Case=Gen|Gender=Fem | serbskeje | ||
Case=Ins|Gender=Fem | serbskej, serbsku | ||
Case=Loc|Degree=Pos|Gender=Masc | Serbskim | ||
Case=Loc|Gender=Masc | Serbskim | ||
Case=Loc|Gender=Fem | serbskej | ||
Case=Nom|Degree=Pos|Gender=Masc | Serbski, SERBSKI | ||
Case=Nom|Degree=Pos|Gender=Fem | serbska | ||
Case=Nom|Gender=Fem | serbska | ||
Case=Nom|Gender=Neut | serbske |
VERB
692 VERB tokens (84% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: VerbForm=Fin (642; 93%), Mood=Ind (629; 91%), Person=3 (594; 86%), Tense=Pres (433; 63%).
VERB
tokens may have the following values of Number
:
Dual
(12; 2% of non-emptyNumber
): jewjetej, matej, móžetej, nabywaštej, nahrawałoj, přidźělitej, rozšěrištaj, spěchowaštej, słušatej, wotkryłojPlur
(237; 34% of non-emptyNumber
): su, běchu, maja, eksistuja, móžachu, móžeja, pokazuja, wužiwachu, wužiwaja, hodźaSing
(443; 64% of non-emptyNumber
): ma, móže, wobsahuje, móžeš, hlej, leži, rěči, dyrbi, wužiwa, hodźiEMPTY
(130): nastać, měć, pisać, přełožować, wobkedźbować, čitać, dać, definować, dopokazać, kliknyć
Paradigm měć | Sing | Dual | Plur |
---|---|---|---|
Animacy=Inan|Gender=Masc|Tense=Past|VerbForm=Part|Voice=Act | mał | ||
Gender=Masc|Tense=Past|VerbForm=Part|Voice=Act | měł | ||
Mood=Ind|Person=3|Polarity=Neg|Tense=Past|VerbForm=Fin | njeměješe | ||
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | měješe | mějachu | |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | ma, nima | matej | maja, nimaja |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|VerbType=Mod | ma, nima | maja | |
Mood=Ind|Tense=Pres|VerbForm=Fin | maja |
PROPN
545 PROPN tokens (91% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Gender=Masc (281; 52%).
PROPN
tokens may have the following values of Number
:
Dual
(2; 0% of non-emptyNumber
): ŁužicomajPlur
(53; 10% of non-emptyNumber
): Sumeričanow, Aramejčanow, Assyričenjo, Serbow, Assyričanow, Geuzen, Milčanow, Serbach, Łužičanow, AlpowPtan
(4; 1% of non-emptyNumber
): Drježdźanach, Drježdźany, Mułkecy, WikachSing
(486; 89% of non-emptyNumber
): Mezopotamiskeje, Mezopotamiska, Mezopotamiskej, Wikimedia, Łužicy, Europje, Assur, Assyriska, Aššur, BabylonEMPTY
(51): Aššur, C, Angeles, Gasche, Los, Tamil, Adl, Beth, Bilād, CET
Paradigm Wikipedija | Sing | Plur |
---|---|---|
Case=Acc | Wikipediju | |
Case=Gen | Wikipedije | Wikipedijow |
Case=Loc | Wikipediji | |
Case=Nom | Wikipedija |
Number
seems to be lexical feature of PROPN
. 99% lemmas (324) occur only with one value of Number
.
AUX
287 AUX tokens (99% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (285; 99%), Person=3 (283; 99%), Mood=Ind (275; 96%), Voice=EMPTY (242; 84%), Tense=Pres (194; 68%).
AUX
tokens may have the following values of Number
:
Dual
(9; 3% of non-emptyNumber
): buštej, stej, běštej, stajPlur
(93; 32% of non-emptyNumber
): su, buchu, njejsu, běchu, bychu, njebuchu, njesuSing
(185; 64% of non-emptyNumber
): je, bu, bě, by, njeje, sy, budu, budźe, był, byłaEMPTY
(2): być
Paradigm być | Sing | Dual | Plur |
---|---|---|---|
Gender=Masc|Tense=Past|VerbForm=Part|Voice=Act | był | ||
Gender=Fem|Tense=Past|VerbForm=Part|Voice=Act | była | ||
Mood=Cnd|Person=3|VerbForm=Fin | by | bychu | |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | sy | ||
Mood=Ind|Person=3|Polarity=Neg|Tense=Past|VerbForm=Fin | njebuchu | ||
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | njeje | njejsu, njesu | |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | budu, budźe | ||
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | bě, bu | běštej | běchu, buchu |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin|Voice=Pass | bu | buštej | buchu |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | je | stej, staj | su |
Mood=Ind|Person=3|VerbForm=Fin|Voice=Pass | buchu |
DET
275 DET tokens (84% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Abbr=EMPTY (239; 87%), Number[psor]=EMPTY (230; 84%), Person=EMPTY (230; 84%), Poss=EMPTY (201; 73%), Animacy=EMPTY (185; 67%).
DET
tokens may have the following values of Number
:
Dual
(3; 1% of non-emptyNumber
): Wobě, jeju, kotrejžPlur
(106; 39% of non-emptyNumber
): kotrež, tute, wšě, někotrych, swoje, kotrychž, někotre, tutych, wšěch, někotřiSing
(166; 60% of non-emptyNumber
): n, kotryž, kotraž, tutón, tuta, swoju, kotrež, tute, tutej, tutuEMPTY
(52): jeho, jich, wjele, jeje, mnoho, mjenje, n, najwjace, tróšku, tójšto
Paradigm kotryž | Sing | Dual | Plur |
---|---|---|---|
Animacy=Anim|Case=Dat|Gender=Masc | kotrymž | ||
Animacy=Anim|Case=Nom|Gender=Masc | kotryž | kotřiž | |
Animacy=Inan|Case=Acc|Gender=Masc | kotryž | ||
Animacy=Inan|Case=Gen|Gender=Masc | kotrychž | ||
Animacy=Inan|Case=Loc|Gender=Masc | kotrychž | ||
Animacy=Inan|Case=Nom|Gender=Masc | kotryž, kotrež | kotrež | |
Case=Gen|Gender=Masc | kotrehož | ||
Case=Gen|Gender=Fem | kotrejež | ||
Case=Ins|Gender=Fem | kotrymiž | ||
Case=Loc|Gender=Masc | kotrymž | ||
Case=Loc|Gender=Fem | kotrejž | kotrychž | |
Case=Loc|Gender=Neut | kotrychž | ||
Case=Nom|Gender=Masc | kotryž | kotrež | |
Case=Nom|Gender=Fem | kotraž | kotrež | |
Case=Nom|Gender=Neut | kotrež | kotrejž | kotrež |
PRON
134 PRON tokens (40% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Reflex=EMPTY (134; 100%), Gender=Neut (83; 62%), Person=EMPTY (76; 57%).
PRON
tokens may have the following values of Number
:
Dual
(1; 1% of non-emptyNumber
): WonejPlur
(19; 14% of non-emptyNumber
): je, wone, kiž, Woni, nam, nich, nimiSing
(114; 85% of non-emptyNumber
): to, toho, tym, wona, wón, wono, čimž, jón, t, tomuEMPTY
(204): so, kiž, sej, sobu, sebi
Paradigm wón | Sing | Dual | Plur |
---|---|---|---|
Animacy=Anim|Case=Nom|Gender=Masc | Woni | ||
Animacy=Inan|Case=Acc|Gender=Masc | je | ||
Animacy=Nhum|Case=Acc|Gender=Masc | jeho | ||
Case=Acc|Gender=Masc | jón, jeho | ||
Case=Acc|Gender=Fem | ju, nju | je | |
Case=Acc | je | ||
Case=Dat|Gender=Fem | Jej, jeje, njej | ||
Case=Gen|Gender=Masc | nich | ||
Case=Gen|Gender=Fem | njeje | ||
Case=Ins|Gender=Neut | nimi | ||
Case=Loc|Gender=Masc | nim | ||
Case=Loc|Gender=Neut | nim | ||
Case=Nom|Gender=Masc | wón | ||
Case=Nom|Gender=Fem | wona | wone | |
Case=Nom|Gender=Neut | wono, wone | wone | |
Case=Nom | Wonej | ||
Gender=Masc | jón |
NUM
32 NUM tokens (8% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=Card (31; 97%).
NUM
tokens may have the following values of Number
:
Dual
(12; 38% of non-emptyNumber
): dwaj, dwěmaj, dweju, dwě, woběmajPlur
(4; 13% of non-emptyNumber
): Mio, miliardowSing
(16; 50% of non-emptyNumber
): jedyn, jedna, jednu, jednym, jedneho, jednyEMPTY
(350): 2, 1, 6, 4, 3, 5, 7, I, 000, 10
ADV
1 ADV tokens (0% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: Degree=Pos (1; 100%), PronType=EMPTY (1; 100%).
ADV
tokens may have the following values of Number
:
Plur
(1; 100% of non-emptyNumber
): wuchodneEMPTY
(534): tež, tak, hišće, zwjetša, hač, něhdźe, hižo, tu, wjace, najprjedy
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (1083; 99%),
VERB –[nsubj]–> NOUN (348; 91%),
NOUN –[nmod]–> NOUN (309; 58%),
VERB –[obl]–> NOUN (235; 56%),
NOUN –[conj]–> NOUN (211; 89%),
NOUN –[det]–> DET (179; 83%),
ADJ –[cop]–> AUX (137; 96%),
NOUN –[nmod]–> PROPN (108; 64%),
PROPN –[conj]–> PROPN (82; 93%),
NOUN –[cop]–> AUX (79; 91%).