home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SST: POS Tags: ADJ

There are 1463 ADJ lemmas (19%), 2802 ADJ types (20%) and 5272 ADJ tokens (5%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 3 in number of types and 8 in number of tokens.

The 10 most frequent ADJ lemmas: drug, dober, sam, velik, prvi, lep, nov, glaven, različen, zadnji

The 10 most frequent ADJ types: drugi, dobro, prvi, drugo, zanimivo, dober, sam, sami, lepa, pomembno

The 10 most frequent ambiguous lemmas: drug (ADJ 191, NUM 1), dober (ADJ 142, ADV 2), lep (ADJ 70, ADV 2), slovenski (ADJ 45, PROPN 1), cel (ADJ 42, PART 1), dolg (ADJ 23, NOUN 5), visok (ADJ 22, ADV 1), pravi (ADJ 19, VERB 1), fajn (ADJ 16, ADV 13), super (ADV 14, ADJ 13)

The 10 most frequent ambiguous types: dobro (ADV 73, ADJ 41, NOUN 1), drugo (ADJ 35, NOUN 1), zanimivo (ADJ 32, ADV 3), pomembno (ADJ 27, ADV 1), druga (ADJ 22, NOUN 1), celo (PART 29, ADJ 17), prvo (ADJ 16, ADV 4), fajn (ADJ 15, ADV 13), novo (ADJ 13, ADV 2), potrebno (ADJ 13, ADV 2)

Morphology

The form / lemma ratio of ADJ is 1.915243 (the average of all parts of speech is 1.751794).

The 1st highest number of forms (22) was observed with the lemma “velik”: največja, največje, največji, največjih, največjo, velik, velika, velike, velikega, velikem, veliki, velikih, velikim, velikimi, veliko, večja, večje, večjem, večji, večjih, večjim, večjo.

The 2nd highest number of forms (21) was observed with the lemma “dober”: boljša, boljše, boljšega, boljšem, boljši, boljših, boljšo, dober, dobra, dobre, dobrem, dobri, dobrih, dobrim, dobrimi, dobro, najboljša, najboljše, najboljšega, najboljši, najboljših.

The 3rd highest number of forms (15) was observed with the lemma “majhen”: majhen, majhna, majhne, majhnega, majhni, majhnih, majhnim, majhno, manjša, manjše, manjši, manjših, manjšo, najmanjša, najmanjši.

ADJ occurs with 10 features: Case (5272; 100% instances), Gender (5272; 100% instances), Number (5272; 100% instances), Degree (4864; 92% instances), Definite (847; 16% instances), VerbForm (663; 13% instances), NumType (217; 4% instances), Poss (64; 1% instances), PronType (28; 1% instances), Typo (1; 0% instances)

ADJ occurs with 23 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Mult, NumType=Ord, Number=Dual, Number=Plur, Number=Sing, Poss=Yes, PronType=Prs, Typo=Yes, VerbForm=Part

ADJ occurs with 218 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Gender=Fem|Number=Sing (483 tokens). Examples: lepa, sama, velika, dobra, glavna, rdeča, edina, stara, nova, zanimiva

Relations

ADJ nodes are attached to their parents using 25 different relations: amod (3378; 64% instances), root (471; 9% instances), conj (345; 7% instances), parataxis (185; 4% instances), obl (146; 3% instances), acl (134; 3% instances), advcl (91; 2% instances), xcomp (87; 2% instances), nsubj (79; 1% instances), ccomp (70; 1% instances), obj (62; 1% instances), reparandum (43; 1% instances), nmod (31; 1% instances), appos (27; 1% instances), advmod (26; 0% instances), parataxis:restart (22; 0% instances), fixed (16; 0% instances), orphan (15; 0% instances), csubj (12; 0% instances), discourse (11; 0% instances), vocative (10; 0% instances), flat (7; 0% instances), iobj (2; 0% instances), dislocated (1; 0% instances), parataxis:discourse (1; 0% instances)

Parents of ADJ nodes belong to 13 different parts of speech: NOUN (3492; 66% instances), VERB (725; 14% instances), (471; 9% instances), ADJ (321; 6% instances), DET (78; 1% instances), PROPN (72; 1% instances), PRON (41; 1% instances), ADV (31; 1% instances), NUM (24; 0% instances), PART (8; 0% instances), AUX (4; 0% instances), X (3; 0% instances), SCONJ (2; 0% instances)

3256 (62%) ADJ nodes are leaves.

719 (14%) ADJ nodes have one child.

228 (4%) ADJ nodes have two children.

1069 (20%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 16.

Children of ADJ nodes are attached using 36 different relations: punct (1221; 17% instances), advmod (1173; 17% instances), cop (947; 13% instances), nsubj (484; 7% instances), obl (365; 5% instances), conj (358; 5% instances), cc (331; 5% instances), mark (306; 4% instances), discourse:filler (288; 4% instances), discourse (234; 3% instances), parataxis (198; 3% instances), aux (183; 3% instances), reparandum (169; 2% instances), case (159; 2% instances), advcl (115; 2% instances), det (80; 1% instances), csubj (73; 1% instances), obj (69; 1% instances), parataxis:discourse (61; 1% instances), nmod (58; 1% instances), orphan (31; 0% instances), ccomp (24; 0% instances), acl (17; 0% instances), vocative (15; 0% instances), amod (13; 0% instances), appos (13; 0% instances), xcomp (12; 0% instances), dislocated (11; 0% instances), nummod (10; 0% instances), flat (6; 0% instances), expl (5; 0% instances), cc:preconj (3; 0% instances), iobj (3; 0% instances), fixed (2; 0% instances), parataxis:restart (2; 0% instances), goeswith (1; 0% instances)

Children of ADJ nodes belong to 16 different parts of speech: PUNCT (1221; 17% instances), AUX (1164; 17% instances), ADV (817; 12% instances), NOUN (707; 10% instances), VERB (500; 7% instances), PART (465; 7% instances), CCONJ (430; 6% instances), DET (326; 5% instances), ADJ (321; 5% instances), INTJ (312; 4% instances), SCONJ (307; 4% instances), ADP (166; 2% instances), PRON (152; 2% instances), X (79; 1% instances), PROPN (46; 1% instances), NUM (27; 0% instances)