Statistics of NUM in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Slovenian-SST: POS Tags: `NUM`

There are 78 NUM lemmas (1%), 123 NUM types (1%) and 1048 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 15 in number of tokens.

The 10 most frequent NUM lemmas: en, dva, trije, pet, tisoč, dvajset, štirje, deset, trideset, sedem

The 10 most frequent NUM types: dva, ena, en, tri, tisoč, pet, eno, dve, dvajset, trideset

The 10 most frequent ambiguous lemmas: en (NUM 225, DET 140), pet (NUM 53, ADJ 2), drug (ADJ 191, NUM 1), dvajseti (ADJ 4, NUM 1)

The 10 most frequent ambiguous types: ena (NUM 61, DET 22), en (NUM 54, DET 44), eno (NUM 45, DET 40), eni (NUM 21, DET 5), sto (NUM 21, X 1), enega (NUM 18, DET 9), ene (ADV 13, NUM 13, DET 9), enem (NUM 6, DET 3), enim (NUM 4, DET 2), tridesetih (NUM 3, ADJ 1)

ena
- NUM 61: in ne samo , da je bila ena , so prišle dve .
- DET 22: takšna ena , eee , z blata hiša .
en
- NUM 54: triindvajseti april je že , samo še en teden pa bomo že kurili kresove .
- DET 44: če dovolite , glejte , mi dovolite , a mi dovolite , da dam en predlog ?
eno
- NUM 45: saj je eno in isto .
- DET 40: in to so mešali kar eno olje pa ene take č- stare barve .
eni
- NUM 21: jah , zdaj , kakor kateri , ne , eni imajo dosti .
- DET 5: aja , oni je bil tu v Zrečah nekaj pri eni teti ali kako ?
sto
- NUM 21: on ima približno sto ljudi v veleposlaništvih po Evropi .
- X 1: aja , zdaj sem jaz na kratko že opisal zgodbo tega filma , e , pravzaprav v vsaki , v vsakem filmu je ena zgodba , mogoče opišem zgodbo , e , Kraljeva vrnitev , kjer na prestol že več sto leti ni sedel pravi kralj , e , na prestolu vladajo sicer ljudje kot neki namestniki kralja , kralj , ki je po krvni liniji kralj , pa se boji tega , e , svojega nasledstva , ker je njegov prednik Ilzidur , e , bil , e , pravzaprav podvržen , e , slabosti , e , gospodarja prstana , se pravi tega prstana .
enega
- NUM 18: e , prej si nisem izbral , e , enega glavnega kraja .
- DET 9: vidite , zdaj bova prišla pa do enega pojma .
ene
- ADV 13: in mi je vsako jutro že za zajtrk skuhala po ene pet jajc pa masonek .
- NUM 13: eee , eem , plačujem pa tudi čisto običajne , ja , tako da ene in druge .
- DET 9: in to so mešali kar eno olje pa ene take č- stare barve .
enem
- NUM 6: ja , ker jaz se spomnim , da sem pa šel na enem izmed teh pohodov na Kamniško sedlo iz Kamniške Bistrice gor .
- DET 3: no , skozi zgodbo spremljamo dekle po imenu Mima , ki je nastopala kot pevka in plesalka v enem triu , kjer je bilo zelo važno vzdrževati neko tako javno podobo neke take popolne , nedolžne deklice .
enim
- NUM 4: in smo pred enim mescem dobili pa psičko vipet- , eem , vipetko , mmm , ki ji je srednji otrok dal ime Luna .
- DET 2: s temi tisoč petsto markami sem se peljal z enim kolegom , [name:personal] [name:surname] , iz Maribora v Nemčijo , on je kupil dejansko kamion , jaz sem pa iskal enega Opel , Opel karavan , brez šip .
tridesetih
- NUM 3: e , v tridesetih letih od sprejetja Konvencije o otrokovih pravicah je le-ta , e , pomembno vplivala in pomagala položaju otrok po svetu .
- ADJ 1: je pa tudi posebna moda , pač glih ta moda iz dvajsetih , tridesetih let prejšnjega stoletja , pa te polkadot pač tele pike po oblekah ali pa resice .

Morphology

The form / lemma ratio of NUM is 1.576923 (the average of all parts of speech is 1.751794).

The 1st highest number of forms (10) was observed with the lemma “en”: een, en, ena, ene, enega, enem, eni, enih, enim, eno.

The 2nd highest number of forms (4) was observed with the lemma “dva”: dva, dve, dveh, dvema.

The 3rd highest number of forms (4) was observed with the lemma “trije”: treh, tremi, tri, trije.

NUM occurs with 5 features: Case (1048; 100% instances), NumType (1048; 100% instances), Number (1048; 100% instances), NumForm (1047; 100% instances), Gender (496; 47% instances)

NUM occurs with 16 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Word, NumType=Card, NumType=Ord, NumType=Sets, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 53 feature combinations. The most frequent feature combination is Case=Acc|Number=Plur|NumForm=Word|NumType=Card (317 tokens). Examples: pet, dvajset, tisoč, trideset, deset, petnajst, sto, petdeset, sedem, tristo

Relations

NUM nodes are attached to their parents using 19 different relations: nummod (572; 55% instances), flat (144; 14% instances), obl (72; 7% instances), conj (65; 6% instances), nsubj (48; 5% instances), root (48; 5% instances), obj (28; 3% instances), parataxis (22; 2% instances), reparandum (14; 1% instances), nmod (7; 1% instances), appos (6; 1% instances), orphan (5; 0% instances), amod (4; 0% instances), fixed (4; 0% instances), ccomp (3; 0% instances), acl (2; 0% instances), advmod (2; 0% instances), discourse:filler (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 12 different parts of speech: NOUN (585; 56% instances), NUM (195; 19% instances), VERB (157; 15% instances), (48; 5% instances), ADJ (27; 3% instances), DET (10; 1% instances), PRON (8; 1% instances), PROPN (8; 1% instances), X (6; 1% instances), ADV (2; 0% instances), AUX (1; 0% instances), PART (1; 0% instances)

630 (60%) NUM nodes are leaves.

241 (23%) NUM nodes have one child.

105 (10%) NUM nodes have two children.

72 (7%) NUM nodes have three or more children.

The highest child degree of a NUM node is 8.

Children of NUM nodes are attached using 27 different relations: flat (142; 19% instances), punct (142; 19% instances), advmod (123; 16% instances), conj (73; 10% instances), case (63; 8% instances), cc (26; 3% instances), discourse (22; 3% instances), nmod (22; 3% instances), cop (20; 3% instances), det (16; 2% instances), parataxis (16; 2% instances), nsubj (15; 2% instances), reparandum (13; 2% instances), amod (12; 2% instances), orphan (9; 1% instances), discourse:filler (8; 1% instances), acl (7; 1% instances), mark (6; 1% instances), fixed (5; 1% instances), aux (3; 0% instances), advcl (1; 0% instances), appos (1; 0% instances), cc:preconj (1; 0% instances), nummod (1; 0% instances), obl (1; 0% instances), parataxis:discourse (1; 0% instances), parataxis:restart (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: NUM (195; 26% instances), PUNCT (142; 19% instances), PART (75; 10% instances), ADP (70; 9% instances), ADV (53; 7% instances), DET (52; 7% instances), NOUN (37; 5% instances), ADJ (24; 3% instances), AUX (23; 3% instances), VERB (21; 3% instances), CCONJ (20; 3% instances), X (13; 2% instances), INTJ (12; 2% instances), SCONJ (6; 1% instances), PRON (4; 1% instances), PROPN (3; 0% instances)

Treebank Statistics: UD_Slovenian-SST: POS Tags: NUM

Morphology

Relations

Treebank Statistics: UD_Slovenian-SST: POS Tags: `NUM`