Treebank Statistics: UD_Upper_Sorbian-UFAL: Features: Case
This feature is universal.
It occurs with 6 different values: Acc
, Dat
, Gen
, Ins
, Loc
, Nom
.
5103 tokens (46%) have a non-empty value of Case
.
3273 types (76%) occur at least once with a non-empty value of Case
.
2082 lemmas (68%) occur at least once with a non-empty value of Case
.
The feature is used with 6 part-of-speech tags: NOUN (2524; 23% instances), ADJ (1406; 13% instances), PROPN (529; 5% instances), PRON (335; 3% instances), DET (275; 2% instances), NUM (34; 0% instances).
NOUN
2524 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Number=Sing (1684; 67%), Animacy=EMPTY (1388; 55%).
NOUN
tokens may have the following values of Case
:
Acc
(495; 20% of non-emptyCase
): př, rěč, nastawki, wobrazy, přikład, čas, dataje, lisćinu, móc, mócnarstwoDat
(64; 3% of non-emptyCase
): akademiji, dispoziciji, rostlinam, wotrjadam, Wopytowarjam, delće, dnjej, drohoćinkam, ekliptice, embryofytamGen
(615; 24% of non-emptyCase
): rěčow, lěta, kilometrow, wody, kraja, lěttysaca, lět, časa, biblioteki, institutaIns
(164; 6% of non-emptyCase
): l, pomocu, ablawtom, družinami, hamorom, krajemi, kralom, ličakami, mjenom, rostlinamiLoc
(330; 13% of non-emptyCase
): lěće, času, rěči, běhu, dobje, formje, lětstotku, stronje, wodźe, zemiNom
(856; 34% of non-emptyCase
): město, woda, stolica, rostliny, institut, pismo, rěč, stat, dołhosć, dźeńEMPTY
(19): km, m, CEST, centrum, jan, přir, raz, t, thumb, čas
Paradigm rěč | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Number=Sing | rěč | rěč, rěc | rěči | rěče | rěči, rěče | rěču |
Number=Plur | rěče | rěče | rěčow | rěčach |
ADJ
1406 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Animacy=EMPTY (1246; 89%), Voice=EMPTY (1219; 87%), VerbForm=EMPTY (1218; 87%), Degree=EMPTY (894; 64%), Number=Sing (885; 63%).
ADJ
tokens may have the following values of Case
:
Acc
(266; 19% of non-emptyCase
): wulku, druhe, wotpowědne, prěni, prěnje, wikowanske, wulke, Klemenowu, bohatu, cyłeDat
(39; 3% of non-emptyCase
): němskej, Delnjej, Hornjej, Indusowej, Jednotliwym, Ludowemu, Persiskemu, Popłatkowemu, ablawtowym, definowanymGen
(287; 20% of non-emptyCase
): druhich, serbskeje, Serbskeho, ablawtowych, wědomostnych, Třećeho, Zjednoćenych, delnjeho, mjenowanych, persiskehoIns
(57; 4% of non-emptyCase
): druhim, druhimi, jednotliwymi, nowymi, přiběracu, samsnym, Baltiskim, Kapadociskej, Persiskim, PrěnjuLoc
(146; 10% of non-emptyCase
): Serbskim, cyłym, sewjernej, babylonskej, chemiskich, druhej, historiskim, hornim, hornjej, južnejNom
(611; 43% of non-emptyCase
): najwjetše, Serbski, wulki, klinowe, prěnje, serbska, wuznamne, Ekscelentny, dalše, druheEMPTY
(15): němsko, Awstro, Tibeto, d, dołho, duchowno, hornjo, krótko, online, politisko
Paradigm serbski | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Animacy=Inan|Degree=Pos|Gender=Masc|Number=Dual | serbskej | |||||
Degree=Pos|Gender=Masc|Number=Sing | Serbski, SERBSKI | serbski | Serbskim | |||
Degree=Pos|Gender=Fem|Number=Sing | serbska | serbskeje | ||||
Degree=Pos|Gender=Neut|Number=Sing | serbske | |||||
Gender=Masc|Number=Sing | serbskemu | Serbskeho | Serbskim | |||
Gender=Masc|Number=Plur | serbskich | |||||
Gender=Fem|Number=Sing | serbska | serbsku | serbskeje | serbskej | serbskej, serbsku | |
Gender=Fem|Number=Plur | serbskim | |||||
Gender=Neut|Number=Plur | serbske |
PROPN
529 PROPN tokens (89% of all PROPN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PROPN
and Case
co-occurred: Number=Sing (470; 89%), Gender=Masc (280; 53%).
PROPN
tokens may have the following values of Case
:
Acc
(16; 3% of non-emptyCase
): Esperanto, Mezopotamisku, Aziju, Babylon, Babylonsku, Fenicisku, Institut, Israel, Mnichow, PalestinuDat
(9; 2% of non-emptyCase
): Ešarrje, Francoskej, Hetitam, Leidenčanam, Mezopotamiskej, Serbam, Łužicy, Španiskej, ŠpaničanamGen
(127; 24% of non-emptyCase
): Mezopotamiskeje, Sumeričanow, Němskeje, Aramejčanow, Assyriskeje, Serbow, Syriskeje, Tigrisa, Łužicy, AkkadaIns
(34; 6% of non-emptyCase
): Babylonom, Eufratom, Iranom, Solawu, Wódru, Łobjom, Anatolskej, Andrapradešom, Assyriskej, AwstriskejLoc
(72; 14% of non-emptyCase
): Europje, Budyšinje, Mezopotamiskej, Africe, Americe, Babylonje, Berlinje, Indiskej, Litawskej, NižozemskejNom
(271; 51% of non-emptyCase
): Mezopotamiska, Assur, Assyriska, Aššur, Hammurabi, Jakub, Ur, Wikipedija, Assyričenjo, BabylonEMPTY
(67): Wikimedia, Aššur, C, Commons, Adl, Angeles, Gasche, Los, Tamil, Tlustulimu
Paradigm Mezopotamiska | Nom | Acc | Dat | Gen | Loc |
---|---|---|---|---|---|
Animacy=Inan | Mezopotamiskeje | ||||
Mezopotamiska | Mezopotamisku | Mezopotamiskej | Mezopotamiskeje, Mezopotamiskej | Mezopotamiskej |
PRON
335 PRON tokens (99% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Person=EMPTY (278; 83%), PronType=Prs (256; 76%), Gender=EMPTY (215; 64%), Number=EMPTY (204; 61%), Reflex=Yes (199; 59%).
PRON
tokens may have the following values of Case
:
Acc
(217; 65% of non-emptyCase
): so, to, je, jeho, jón, něšto, ju, ničo, nju, wšitkoDat
(12; 4% of non-emptyCase
): sej, tomu, nam, Jej, jeje, njej, sebiGen
(21; 6% of non-emptyCase
): toho, nich, njejeIns
(11; 3% of non-emptyCase
): tym, sobu, nimiLoc
(14; 4% of non-emptyCase
): tym, čimž, nimNom
(60; 18% of non-emptyCase
): to, kiž, wona, wón, wone, wono, Woni, ty, štož, WonejEMPTY
(3): t, jón
Paradigm to | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Abbr=Yes | t | |||||
PronType=Dem | to | to | tomu | toho | tym | tym |
DET
275 DET tokens (84% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: Abbr=EMPTY (239; 87%), Number[psor]=EMPTY (230; 84%), Person=EMPTY (230; 84%), Poss=EMPTY (201; 73%), Animacy=EMPTY (185; 67%), Number=Sing (166; 60%).
DET
tokens may have the following values of Case
:
Acc
(49; 18% of non-emptyCase
): swoje, swoju, tute, tutu, swój, kóžde, kóždy, wšě, wšěch, žaneDat
(4; 1% of non-emptyCase
): kotrymž, swojemu, wšemu, wšitkimGen
(25; 9% of non-emptyCase
): tutych, tutoho, kotrychž, tych, kotrehož, kotrejež, kóždychžkuli, někajkeho, někotrych, swojehoIns
(42; 15% of non-emptyCase
): n, swojimi, kotrymiž, swojej, tymLoc
(39; 14% of non-emptyCase
): někotrych, tutej, tutym, kotrejž, kotrychž, swojich, twojim, wšěch, kotrymž, kóždymNom
(116; 42% of non-emptyCase
): kotrež, kotraž, kotryž, tute, tutón, tuta, někotre, wšě, někotři, NašEMPTY
(52): jeho, jich, wjele, jeje, mnoho, mjenje, n, najwjace, tróšku, tójšto
Paradigm kotryž | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Animacy=Anim|Gender=Masc|Number=Sing | kotryž | |||||
Animacy=Anim|Gender=Masc|Number=Plur | kotřiž | kotrymž | ||||
Animacy=Inan|Gender=Masc|Number=Sing | kotryž, kotrež | kotryž | ||||
Animacy=Inan|Gender=Masc|Number=Plur | kotrež | kotrychž | kotrychž | |||
Gender=Masc|Number=Sing | kotryž | kotrehož | kotrymž | |||
Gender=Masc|Number=Plur | kotrež | |||||
Gender=Fem|Number=Sing | kotraž | kotrejež | kotrejž | |||
Gender=Fem|Number=Plur | kotrež | kotrychž | kotrymiž | |||
Gender=Neut|Number=Sing | kotrež | |||||
Gender=Neut|Number=Dual | kotrejž | |||||
Gender=Neut|Number=Plur | kotrež | kotrychž |
NUM
34 NUM tokens (9% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumType=Card (33; 97%).
NUM
tokens may have the following values of Case
:
Acc
(7; 21% of non-emptyCase
): dwaj, jednu, jedynGen
(8; 24% of non-emptyCase
): Mio, štyrjoch, dweju, jedneho, miliardowIns
(1; 3% of non-emptyCase
): dwěmajLoc
(5; 15% of non-emptyCase
): dwěmaj, jednym, woběmajNom
(13; 38% of non-emptyCase
): jedyn, dwaj, jedna, dwě, jednyEMPTY
(348): 2, 1, 6, 4, 3, 5, 7, I, 000, 10
Paradigm dwaj | Nom | Acc | Gen | Loc | Ins |
---|---|---|---|---|---|
Animacy=Inan|Gender=Masc | dwaj | dwaj | dweju | ||
Gender=Fem | dwaj, dwě | dwěmaj | |||
Gender=Neut | dwěmaj | ||||
dwaj |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[amod]–> ADJ (1080; 99%),
NOUN –[conj]–> NOUN (224; 94%),
NOUN –[det]–> DET (177; 82%),
PROPN –[conj]–> PROPN (85; 97%),
ADJ –[nsubj]–> NOUN (76; 90%),
ADJ –[conj]–> ADJ (63; 97%),
PROPN –[flat]–> PROPN (52; 79%),
PROPN –[amod]–> ADJ (43; 100%),
NOUN –[nsubj]–> NOUN (41; 87%),
NOUN –[appos]–> NOUN (33; 69%).