home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Upper_Sorbian-UFAL: POS Tags: PRON

There are 10 PRON lemmas (0%), 35 PRON types (1%) and 338 PRON tokens (3%). Out of 16 observed tags, the rank of PRON is: 13 in number of lemmas, 10 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: so, to, wón, kiž, štož, ja, něšto, ty, ništo, wšitko

The 10 most frequent PRON types: so, to, toho, tym, wona, wón, kiž, je, wone, wono

The 10 most frequent ambiguous lemmas: to (PRON 62, ADV 2)

The 10 most frequent ambiguous types: to (PRON 16, ADV 2), tym (PRON 16, DET 2), je (AUX 102, PRON 6, VERB 5), t (PRON 3, ADV 1, NOUN 1), jeho (DET 7, PRON 2), jeje (DET 3, PRON 1), njeje (AUX 8, PRON 1, VERB 1)

Morphology

The form / lemma ratio of PRON is 3.500000 (the average of all parts of speech is 1.418889).

The 1st highest number of forms (18) was observed with the lemma “wón”: Jej, Wonej, Woni, je, jeho, jeje, ju, jón, nich, nim, nimi, njej, njeje, nju, wona, wone, wono, wón.

The 2nd highest number of forms (5) was observed with the lemma “to”: t, to, toho, tomu, tym.

The 3rd highest number of forms (4) was observed with the lemma “so”: sebi, sej, so, sobu.

PRON occurs with 8 features: Case (335; 99% instances), PronType (333; 99% instances), Reflex (199; 59% instances), Number (134; 40% instances), Gender (123; 36% instances), Person (58; 17% instances), Animacy (5; 1% instances), Abbr (3; 1% instances)

PRON occurs with 26 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Animacy=Nhum, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Ind, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes

PRON occurs with 43 feature combinations. The most frequent feature combination is Case=Acc|PronType=Prs|Reflex=Yes (193 tokens). Examples: so

Relations

PRON nodes are attached to their parents using 14 different relations: expl:pass (99; 29% instances), expl:pv (79; 23% instances), nsubj (53; 16% instances), obl (43; 13% instances), obj (36; 11% instances), nmod (16; 5% instances), iobj (3; 1% instances), amod (2; 1% instances), fixed (2; 1% instances), advcl (1; 0% instances), appos (1; 0% instances), cc (1; 0% instances), conj (1; 0% instances), root (1; 0% instances)

Parents of PRON nodes belong to 9 different parts of speech: VERB (287; 85% instances), ADJ (29; 9% instances), NOUN (15; 4% instances), CCONJ (2; 1% instances), ADV (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances), PRON (1; 0% instances), (1; 0% instances)

268 (79%) PRON nodes are leaves.

55 (16%) PRON nodes have one child.

12 (4%) PRON nodes have two children.

3 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 6.

Children of PRON nodes are attached using 10 different relations: case (59; 64% instances), acl (9; 10% instances), punct (8; 9% instances), nmod (4; 4% instances), cop (3; 3% instances), fixed (3; 3% instances), nsubj (3; 3% instances), cc (1; 1% instances), conj (1; 1% instances), mark (1; 1% instances)

Children of PRON nodes belong to 10 different parts of speech: ADP (59; 64% instances), VERB (9; 10% instances), NOUN (8; 9% instances), PUNCT (8; 9% instances), AUX (3; 3% instances), ADJ (1; 1% instances), ADV (1; 1% instances), CCONJ (1; 1% instances), PRON (1; 1% instances), PROPN (1; 1% instances)