Treebank Statistics: UD_Slovenian-SSJ: Features: Case
This feature is universal.
It occurs with 6 different values: Acc
, Dat
, Gen
, Ins
, Loc
, Nom
.
134799 tokens (50%) have a non-empty value of Case
.
41081 types (84%) occur at least once with a non-empty value of Case
.
19512 lemmas (77%) occur at least once with a non-empty value of Case
.
The feature is used with 7 part-of-speech tags: NOUN (56865; 21% instances), ADJ (28426; 11% instances), ADP (24235; 9% instances), PROPN (10239; 4% instances), DET (7978; 3% instances), PRON (5588; 2% instances), NUM (1468; 1% instances).
NOUN
56865 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Number=Sing (40401; 71%).
NOUN
tokens may have the following values of Case
:
Acc
(13185; 23% of non-emptyCase
): leto, primer, delo, dan, življenje, čas, način, mesto, vlogo, voljoDat
(1384; 2% of non-emptyCase
): ljudem, členu, delu, koncu, moškim, bogovom, otrokom, predsedniku, skupini, članomGen
(15799; 28% of non-emptyCase
): leta, let, dela, ljudi, tolarjev, milijonov, časa, dni, sveta, zakonaIns
(3720; 7% of non-emptyCase
): leti, pomočjo, delom, letom, vojno, časom, imenom, ljudmi, besedami, naslovomLoc
(8214; 14% of non-emptyCase
): letih, strani, času, svetu, delu, mestu, koncu, letu, podlagi, področjuNom
(14563; 26% of non-emptyCase
): predsednik, ljudje, del, večina, človek, država, čas, mesto, zakon, otroci
Paradigm leto | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Number=Sing | leto | leto | leta | letu | letom | |
Number=Dual | leti | let | letih | letoma | ||
Number=Plur | leta | leta | letom | let | letih | leti |
ADJ
28426 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Degree=Pos (25970; 91%), VerbForm=EMPTY (24781; 87%), Definite=EMPTY (24330; 86%), Number=Sing (19341; 68%).
ADJ
tokens may have the following values of Case
:
Acc
(5596; 20% of non-emptyCase
): druge, novo, nove, veliko, drugo, prvi, prvo, različne, prihodnje, splošnoDat
(596; 2% of non-emptyCase
): drugim, drugemu, novim, državnemu, mlajšim, drugi, evropskim, novemu, velikemu, desniGen
(6451; 23% of non-emptyCase
): drugih, različnih, novega, drugega, novih, evropske, slovenskih, slovenske, prvega, velikihIns
(1651; 6% of non-emptyCase
): drugim, drugimi, različnimi, veliko, kratkim, drugo, novimi, številnimi, velikim, evropskoLoc
(3405; 12% of non-emptyCase
): drugi, drugem, drugih, zadnjih, prvem, prvi, različnih, glavnem, novem, zadnjemNom
(10727; 38% of non-emptyCase
): mogoče, drugi, sam, prvi, prva, velika, sama, druga, potrebno, sami
Paradigm drug | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Definite=Def|Gender=Masc|Number=Sing | drugi | drugi | ||||
Definite=Ind|Gender=Masc|Number=Sing | drug | drug | ||||
Gender=Masc|Number=Sing | drugega | drugemu | drugega | drugem | drugim | |
Gender=Masc|Number=Dual | drugih | |||||
Gender=Masc|Number=Plur | drugi | druge | drugim | drugih | drugih | drugimi |
Gender=Fem|Number=Sing | druga | drugo | drugi | druge | drugi | drugo |
Gender=Fem|Number=Dual | drugi | drugih | ||||
Gender=Fem|Number=Plur | druge | druge | drugih | drugih | drugimi | |
Gender=Neut|Number=Sing | drugo | drugo | drugega | drugem | drugim | |
Gender=Neut|Number=Plur | druga | druga | drugih | drugih | drugimi |
ADP
24235 ADP tokens (100% of all ADP
tokens) have a non-empty value of Case
.
ADP
tokens may have the following values of Case
:
Acc
(5858; 24% of non-emptyCase
): za, na, v, čez, skozi, po, med, pod, zoper, predDat
(523; 2% of non-emptyCase
): k, proti, kljub, h, navkljub, blizu, Klub, nasprotiGen
(3663; 15% of non-emptyCase
): iz, od, do, zaradi, brez, z, poleg, s, izmed, gledeIns
(4248; 18% of non-emptyCase
): z, s, med, pred, pod, za, nadLoc
(9943; 41% of non-emptyCase
): v, na, po, pri, o, ob
Paradigm za | Acc | Gen | Ins |
---|---|---|---|
za | za | za |
PROPN
10239 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PROPN
and Case
co-occurred: Number=Sing (9702; 95%), Gender=Masc (6668; 65%).
PROPN
tokens may have the following values of Case
:
Acc
(769; 8% of non-emptyCase
): Slovenijo, EU, Evropo, Ljubljano, Jugoslavijo, Slovence, Francijo, Nemčijo, Japonsko, RusijoDat
(256; 3% of non-emptyCase
): Sloveniji, Evropi, Janezu, Kopaču, Ljubljani, Mariji, Srbom, Bosku, Diegu, EulerjuGen
(1723; 17% of non-emptyCase
): Slovenije, EU, Evrope, RS, ESS, Ljubljane, Slovencev, Amerike, Kosova, ZemljeIns
(406; 4% of non-emptyCase
): ZDA, Madžarsko, Rusijo, Britanijo, Davidom, Francijo, Hrvaško, Irakom, Jugoslavijo, MarjanomLoc
(1315; 13% of non-emptyCase
): Sloveniji, Evropi, Ljubljani, ZDA, Ameriki, Italiji, Mariboru, Nemčiji, Rimu, ZemljiNom
(5770; 56% of non-emptyCase
): Slovenija, Ljubljana, Maribor, Janez, LJUBLJANA, New, Bojan, Jože, Slovenci, Boris
Paradigm Slovenija | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Slovenija, SLOVENIJA | Slovenijo | Sloveniji | Slovenije, SLOVENIJE | Sloveniji | Slovenijo |
DET
7978 DET tokens (85% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: Number[psor]=EMPTY (6502; 81%), Person=EMPTY (6502; 81%), Number=Sing (5692; 71%), Poss=EMPTY (5684; 71%).
DET
tokens may have the following values of Case
:
Acc
(2003; 25% of non-emptyCase
): to, vse, svoje, svojo, svoj, ta, vsak, te, ves, njegovoDat
(305; 4% of non-emptyCase
): temu, vsem, vsemu, svoji, tistim, tej, svojim, njegovi, svojemu, temGen
(1475; 18% of non-emptyCase
): tega, vseh, te, teh, svojega, svoje, svojih, katerih, vsega, njegovegaIns
(538; 7% of non-emptyCase
): tem, svojo, katerimi, svojim, katerim, katero, svojimi, vsemi, temi, njegovimiLoc
(1278; 16% of non-emptyCase
): tem, katerem, katerih, kateri, tej, svoji, vseh, njegovem, svojem, tehNom
(2379; 30% of non-emptyCase
): to, ta, vsi, vse, njegov, njegova, vsak, tisti, nekateri, teEMPTY
(1374): več, nekaj, veliko, manj, dovolj, malo, toliko, pol, preveč, največ
Paradigm ta | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Gender=Masc|Number=Sing | ta | ta, tega | temu | tega | tem | tem |
Gender=Masc|Number=Dual | ta | teh | teh | tema | ||
Gender=Masc|Number=Plur | ti | te | tem | teh | teh | temi |
Gender=Fem|Number=Sing | ta | to | tej | te | tej | to |
Gender=Fem|Number=Dual | ti | ti | ||||
Gender=Fem|Number=Plur | te | te | tem | teh | teh | temi |
Gender=Neut|Number=Sing | to | to | temu | tega | tem | tem |
Gender=Neut|Number=Plur | ta | ta | tem | teh | teh | temi |
PRON
5588 PRON tokens (61% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Reflex=EMPTY (5026; 90%), PronType=Prs (4446; 80%), Number=Sing (3544; 63%), Variant=Short (2927; 52%).
PRON
tokens may have the following values of Case
:
Acc
(2356; 42% of non-emptyCase
): ga, jih, jo, kaj, me, kar, nas, nekaj, vas, juDat
(1514; 27% of non-emptyCase
): si, mu, mi, ji, jim, nam, vam, ti, njemu, sebiGen
(377; 7% of non-emptyCase
): jih, ga, njih, nas, česar, je, njega, sebe, nje, ničesarIns
(295; 5% of non-emptyCase
): njimi, njim, seboj, njo, nami, sabo, mano, njima, čimer, menojLoc
(252; 5% of non-emptyCase
): njem, njej, nas, njih, čemer, sebi, meni, čem, vas, marsičemNom
(794; 14% of non-emptyCase
): kar, kaj, kdo, jaz, nič, nekaj, nihče, nekdo, vi, onEMPTY
(3518): se
Paradigm on | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Gender=Masc|Number=Sing | on | njega | njemu | njega | njem | njim |
Gender=Masc|Number=Sing|Variant=Short | ga | mu | ga | |||
Gender=Masc|Number=Dual | onadva | njiju, onadva | njima | njiju | njiju | njima |
Gender=Masc|Number=Dual|Variant=Short | ju, jih | jima | ||||
Gender=Masc|Number=Plur | oni | njih, nje | njim | njih | njih | njimi |
Gender=Masc|Number=Plur|Variant=Short | jih | jim | jih | |||
Gender=Fem|Number=Sing | ona | njo | njej | nje | njej | njo |
Gender=Fem|Number=Sing|Variant=Short | jo | ji | je | |||
Gender=Fem|Number=Dual | njiju | njima | njima | |||
Gender=Fem|Number=Dual|Variant=Short | ju | jima | ju | |||
Gender=Fem|Number=Plur | njim | njih | njih | njimi | ||
Gender=Fem|Number=Plur|Variant=Short | jih | jim | jih | |||
Gender=Neut|Number=Sing | njega | njem | njim | |||
Gender=Neut|Number=Sing|Variant=Short | ga | mu | ga | |||
Gender=Neut|Number=Dual|Variant=Short | ju | |||||
Gender=Neut|Number=Plur | njih | njih | njimi | |||
Gender=Neut|Number=Plur|Variant=Short | jih | jim | jih |
NUM
1468 NUM tokens (26% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumForm=Word (1468; 100%), NumType=Card (1462; 100%), Number=Plur (739; 50%).
NUM
tokens may have the following values of Case
:
Acc
(595; 41% of non-emptyCase
): eno, dve, tri, dva, štiri, tisoč, pet, deset, en, enegaDat
(18; 1% of non-emptyCase
): eni, trem, enemu, petim, štirim, petdesetimGen
(175; 12% of non-emptyCase
): dveh, treh, enega, ene, petih, štirih, sedmih, desetih, osmih, petnajstihIns
(78; 5% of non-emptyCase
): dvema, tremi, petimi, enim, eno, desetimi, šestimi, štirimi, sedmimi, devetimiLoc
(216; 15% of non-emptyCase
): dveh, enem, eni, štirih, treh, petih, šestih, desetih, osmih, sedmihNom
(386; 26% of non-emptyCase
): ena, eden, dva, dve, trije, tri, en, pet, tisoč, enoEMPTY
(4117): 2, 1, 10, 3, 6, 30, 1., 20, 4, 2000
Paradigm en | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Gender=Masc|Number=Sing | en | en, enega, Enga | enemu | enega, enga | enem | enim |
Gender=Masc|Number=Plur | eni | enih | ||||
Gender=Fem|Number=Sing | ena | eno | eni | ene | eni | eno |
Gender=Fem|Number=Plur | enih | |||||
Gender=Neut|Number=Sing | eno | eno | enega | enem | enim |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[amod]–> ADJ (21257; 99%),
NOUN –[case]–> ADP (18927; 99%),
NOUN –[det]–> DET (4982; 88%),
NOUN –[conj]–> NOUN (4341; 96%),
PROPN –[case]–> ADP (1960; 99%),
ADJ –[nsubj]–> NOUN (1661; 99%),
PROPN –[flat:name]–> PROPN (1551; 100%),
ADJ –[conj]–> ADJ (1255; 96%),
DET –[case]–> ADP (1143; 99%),
PROPN –[conj]–> PROPN (805; 99%).