Treebank Statistics: UD_North_Sami-Giella: Features: Case
This feature is universal.
It occurs with 8 different values: Abe
, Acc
, Com
, Ess
, Gen
, Ill
, Loc
, Nom
.
10832 tokens (40%) have a non-empty value of Case
.
5109 types (67%) occur at least once with a non-empty value of Case
.
2932 lemmas (67%) occur at least once with a non-empty value of Case
.
The feature is used with 7 part-of-speech tags: NOUN (6374; 24% instances), PRON (2610; 10% instances), PROPN (875; 3% instances), ADJ (515; 2% instances), NUM (344; 1% instances), VERB (111; 0% instances), AUX (3; 0% instances).
NOUN
6374 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Number=Sing (4311; 68%).
NOUN
tokens may have the following values of Case
:
Acc
(1286; 20% of non-emptyCase
): sámegiela, veahki, bierggu, biktasiid, mánáid, reivve, girjji, girjjiid, gáfe, bargguCom
(247; 4% of non-emptyCase
): biillain, mánáiguin, mánáin, vugiin, beatnagiiddisguin, beatnagiin, biillaiguin, bissuin, boazodoaluin, borramušainEss
(167; 3% of non-emptyCase
): oahpaheaddjin, lassin, veahkkin, ovdamearkan, vuođđun, buohccedivššárin, nuorran, Eurohpameašttirin, bassin, buohccinGen
(1301; 20% of non-emptyCase
): sámi, jagi, beaivvi, áigge, olbmo, sámegiela, máná, áiggi, skuvlla, sámiidIll
(513; 8% of non-emptyCase
): mánáide, skuvlii, gávpogii, meahccái, sámegillii, internáhttii, mollii, bargui, heajaide, siidiiLoc
(814; 13% of non-emptyCase
): skuvllas, internáhtas, guovllus, viesus, oasis, oktavuođas, olbmuin, barggus, goađis, gávpogisNom
(2046; 32% of non-emptyCase
): olbmot, mánát, eadni, gánda, olmmoš, stállu, oahppit, mánná, nieida, oahpaheaddjiEMPTY
(45): M., dearvvašvuođa-, A., Mr., giella-, skuvla-, Bivdo-, Gieldda-, IL, J.
Paradigm olmmoš | Nom | Acc | Gen | Loc | Ess | Com | Ill |
---|---|---|---|---|---|---|---|
_ | olmmožin | ||||||
Number=Sing | olmmoš | olbmo | olbmo | olbmos | olbmui | ||
Number=Plur | olbmot | olbmuid | olbmuid | olbmuin | olbmuiguin | olbmuide | |
Number=Plur|Number[psor]=Plur|Person[psor]=3 | olbmuideaset |
PRON
2610 PRON tokens (92% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Number=Sing (1624; 62%), PronType=Prs (1537; 59%).
PRON
tokens may have the following values of Case
:
Acc
(356; 14% of non-emptyCase
): dan, maid, su, iežas, daid, dán, iežaset, du, mu, maidegeCom
(53; 2% of non-emptyCase
): dainna, daiguin, dáinna, iežainis, nuppiin, suinna, duinna, iežainan, iežaineaskka, maiguinEss
(4; 0% of non-emptyCase
): danin, dákkárin, iehčaneameGen
(410; 16% of non-emptyCase
): mu, dan, dán, min, su, iežas, sin, du, daid, iežasetIll
(170; 7% of non-emptyCase
): munnje, dasa, sutnje, dutnje, sidjiide, alccesis, dán, midjiide, dan, earáideLoc
(239; 9% of non-emptyCase
): mus, dus, das, mis, sis, sus, dán, dan, mas, dainNom
(1378; 53% of non-emptyCase
): son, mun, mii, dat, sii, don, dát, soai, moai, geatEMPTY
(237): buot, juohke, eará, dakkár, muhtun, makkár, muhtin, unnán, dákkár, seamma
Paradigm dat | Nom | Acc | Gen | Loc | Ess | Com | Ill |
---|---|---|---|---|---|---|---|
Number=Sing | dat | dan, dange | dan | das, dan, dasnai | dainna | dasa, dan | |
Number=Plur | dat, Dathan | daid | daid | dain | daiguin, daid | daidda | |
danin |
PROPN
875 PROPN tokens (88% of all PROPN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PROPN
and Case
co-occurred: Number=Sing (865; 99%).
PROPN
tokens may have the following values of Case
:
Acc
(43; 5% of non-emptyCase
): Sarvva, Liná, Divvuma, Máhte, Sámedikki, Antonsen, Beckham, Buolbmága, Busi, EfraimaCom
(14; 2% of non-emptyCase
): Sámedikkiin, Birehiin, Hanseniin, Iŋggáin, Juffáin, Máhte-Iŋggáin, Márehiin, Nilut_Cupain, Rihtáin, RiibmagállásiinEss
(5; 1% of non-emptyCase
): Gállábárdnin, Jesusin, Mihkkalažžan, Márehažžan, SmierrunGen
(223; 25% of non-emptyCase
): Norgga, Sámi, Finnmárkku, Kárášjoga, Romssa, Sámedikki, Ipmila, Guovdageainnu, Deanu, RuoŧaIll
(63; 7% of non-emptyCase
): Kárášjohkii, Sápmái, Ellii, Finnmárkkuopmodahkii, Gáivutnii, Hámmárfestii, Trosterudii, Aarbortii, Abbai, ArniiLoc
(147; 17% of non-emptyCase
): Kárášjogas, Guovdageainnus, Finnmárkkus, Deanus, Gáivuonas, Norggas, Romssas, Máhtes, Olmmáivákkis, OslosNom
(380; 43% of non-emptyCase
): Gállá, Máret, Máhtte, Liná, Ánde, Sámediggi, Ánne, Biret, Ipmil, FinnmárkkuopmodatEMPTY
(117): Nils, Stuorra, Aslak, Biret, Jack, Lene, Per, Anders, Johan, Margrethe
Paradigm Sámediggi | Nom | Acc | Gen | Loc | Com | Ill |
---|---|---|---|---|---|---|
Sámediggi | Sámedikki | Sámedikki, Sámedikke | Sámedikkis | Sámedikkiin | Sámediggái |
ADJ
515 ADJ tokens (37% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Degree=EMPTY (437; 85%), Number=Sing (329; 64%).
ADJ
tokens may have the following values of Case
:
Acc
(18; 3% of non-emptyCase
): buori, buriid, ollu, Goalmmáda, baháid, buhtismeahttumiid, doloža, guoskevačča, sullasačča, suohttasiiddiskaCom
(3; 1% of non-emptyCase
): buriinEss
(59; 11% of non-emptyCase
): duhtavažžan, nubbin, seavdnjadin, nuorran, ruoksadin, bassin, bivnnuhin, boarisin, buhtisin, buhtismeahttuminGen
(20; 4% of non-emptyCase
): nuppi, jagáš, buoremusaid, buori, 7-jahkásačča, buriid, doloža, parlamentáralaččaid, ráhkkásisIll
(1; 0% of non-emptyCase
): sullásaččaideLoc
(8; 2% of non-emptyCase
): nuppi, Nuorabuin, Nuoramusain, doložis, ráhkkásisttánNom
(406; 79% of non-emptyCase
): buorre, váttis, vejolaš, veara, buorit, boaris, dehálaš, suohtas, divrras, duohtaEMPTY
(861): ollu, ođđa, vuosttaš, stuora, stuorra, eanaš, olles, buoremus, amas, dálá
Paradigm buorre | Nom | Acc | Gen | Ess | Com |
---|---|---|---|---|---|
_ | buorrin | ||||
Degree=Cmp | buorebun | ||||
Degree=Cmp|Number=Sing | buoret | ||||
Degree=Sup | buoremussan | ||||
Degree=Sup|Number=Sing | buoremus | ||||
Degree=Sup|Number=Plur | buoremusaid | ||||
Number=Sing | buorre | buori | buori | buriin | |
Number=Plur | buorit | buriid | buriid |
NUM
344 NUM tokens (99% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumType=Card (344; 100%), Number=Sing (327; 95%).
NUM
tokens may have the following values of Case
:
Acc
(79; 23% of non-emptyCase
): guokte, moadde, golbma, máŋga, ovtta, vihtta, Galliid, guhtta, njeallje, 1300Com
(15; 4% of non-emptyCase
): ovttain, golmmain, guvttiin, viđain, galliin, golmmaiguin, čuđiinEss
(2; 1% of non-emptyCase
): guoktin, oktanGen
(61; 18% of non-emptyCase
): golmma, viđa, máŋgga, ovtta, 12, guovtti, 1.8.2001, moatti, 05.01.00, 12.03.2010Ill
(23; 7% of non-emptyCase
): golmma, beannot, guovtti, čuohtái, moatti, máŋgga, njealji, ovtta, golmmaideLoc
(47; 14% of non-emptyCase
): ovtta, guovtti, 1982:s, 1995:s, golmmain, máŋgga, 1834:s, 1877:s, 1898:s, 1899:sNom
(117; 34% of non-emptyCase
): okta, guokte, golbma, máŋga, njeallje, vihtta, moadde, 1971, 2005, 50EMPTY
(4): guoktenuppelot, vihttalot
Paradigm guokte | Nom | Acc | Gen | Loc | Ess | Com | Ill |
---|---|---|---|---|---|---|---|
Number=Sing | guokte | guokte | guovtti, guovtte | guovtti | guvttiin | guovtti | |
Number=Plur | guovttit | guvttiid | |||||
guoktin |
VERB
111 VERB tokens (3% of all VERB
tokens) have a non-empty value of Case
.
The most frequent other feature values with which VERB
and Case
co-occurred: Aspect=EMPTY (111; 100%), Mood=EMPTY (111; 100%), Number=EMPTY (111; 100%), Person=EMPTY (111; 100%), Tense=EMPTY (111; 100%), VerbForm=Ger (111; 100%).
VERB
tokens may have the following values of Case
:
Abe
(11; 10% of non-emptyCase
): beroškeahttá, eahpitkeahttá, logakeahttá, bážikeahttá, dieđikeahttá, mávssekeahtesEss
(64; 58% of non-emptyCase
): boahtimin, fárremin, leamen, čierastallame, bargame, bargamin, bassaladdame, bassame, boahtime, oađđiminGen
(24; 22% of non-emptyCase
): vácci, čuoigga, gudnejahttin, ráhkistan, Mearkkašan, Suga, bora, fuopmášan, namahan, njágaLoc
(12; 11% of non-emptyCase
): goargŋumis, juhkamis, bargamis, borgguheames, botkemis, deaivvadeamis, gođđimis, guldaleames, jáhkkimis, vuostáváldimisEMPTY
(4199): lea, leat, lei, ledje, bođii, boahtá, manai, vuolgit, ožžon, dieđe
Paradigm bargat | Loc | Ess |
---|---|---|
bargamis | bargame, bargamin |
Case
seems to be lexical feature of VERB
. 93% lemmas (67) occur only with one value of Case
.
AUX
3 AUX tokens (0% of all AUX
tokens) have a non-empty value of Case
.
The most frequent other feature values with which AUX
and Case
co-occurred: Mood=EMPTY (3; 100%), Number=EMPTY (3; 100%), Person=EMPTY (3; 100%), Polarity=EMPTY (3; 100%), Tense=EMPTY (3; 100%), VerbForm=Ger (3; 100%).
AUX
tokens may have the following values of Case
:
Ess
(3; 100% of non-emptyCase
): leamen, áigume, áiguminEMPTY
(1991): lea, leat, ii, lei, eai, ledje, galgá, lean, sáhttá, in
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[conj]–> NOUN (323; 96%),
NOUN –[det]–> PRON (168; 93%),
ADJ –[nsubj]–> NOUN (118; 98%),
ADJ –[nsubj]–> PRON (76; 100%),
NOUN –[nsubj]–> PRON (74; 74%),
NOUN –[nsubj]–> NOUN (65; 76%),
PROPN –[conj]–> PROPN (42; 84%),
NOUN –[amod]–> NUM (31; 97%),
ADJ –[conj]–> ADJ (13; 93%),
NOUN –[appos]–> NOUN (13; 76%).