home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PUD: POS Tags: PRON

There are 17 PRON lemmas (0%), 65 PRON types (1%) and 576 PRON tokens (3%). Out of 15 observed tags, the rank of PRON is: 11 in number of lemmas, 8 in number of types and 11 in number of tokens.

The 10 most frequent PRON lemmas: se, on, já, což, jenž, co, něco, nic, kdo, někdo

The 10 most frequent PRON types: se, si, co, což, ho, nich, ní, jí, mu, mě

The 10 most frequent ambiguous lemmas: on (PRON 101, ADP 1), jenž (PRON 19, DET 8), co (PRON 18, SCONJ 17, ADV 1), I (NUM 5, PRON 1), ten (DET 180, ADV 2, PRON 1), you (NOUN 1, PRON 1)

The 10 most frequent ambiguous types: se (PRON 304, ADP 14), co (SCONJ 17, PRON 14, ADV 1), je (AUX 152, PRON 5), You (NOUN 1, PRON 1), jehož (DET 3, PRON 1)

Morphology

The form / lemma ratio of PRON is 3.823529 (the average of all parts of speech is 1.426331).

The 1st highest number of forms (20) was observed with the lemma “on”: ho, je, ji, jich, jim, jí, jím, mu, nich, nimi, ní, ním, ně, něho, něj, něm, němu, on, ona, oni.

The 2nd highest number of forms (8) was observed with the lemma “jenž”: jehož, jenž, jež, nichž, nimž, niž, níž, něhož.

The 3rd highest number of forms (5) was observed with the lemma “já”: já, mi, mě, nám, nás.

PRON occurs with 10 features: PronType (576; 100% instances), Case (574; 100% instances), Variant (379; 66% instances), Reflex (354; 61% instances), Number (150; 26% instances), Person (128; 22% instances), Gender (103; 18% instances), PrepCase (64; 11% instances), Animacy (35; 6% instances), Foreign (2; 0% instances)

PRON occurs with 28 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PrepCase=Npr, PrepCase=Pre, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Variant=Short

PRON occurs with 77 feature combinations. The most frequent feature combination is Case=Acc|PronType=Prs|Reflex=Yes|Variant=Short (304 tokens). Examples: se

Relations

PRON nodes are attached to their parents using 19 different relations: expl:pv (240; 42% instances), obj (78; 14% instances), nsubj (53; 9% instances), obl (52; 9% instances), obl:arg (48; 8% instances), expl:pass (45; 8% instances), nmod (34; 6% instances), iobj (4; 1% instances), root (4; 1% instances), conj (3; 1% instances), det (3; 1% instances), fixed (2; 0% instances), flat (2; 0% instances), nsubj:pass (2; 0% instances), orphan (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), obl:agent (1; 0% instances)

Parents of PRON nodes belong to 11 different parts of speech: VERB (497; 86% instances), ADJ (32; 6% instances), NOUN (28; 5% instances), (4; 1% instances), ADV (3; 1% instances), DET (3; 1% instances), NUM (3; 1% instances), ADP (2; 0% instances), AUX (2; 0% instances), PRON (1; 0% instances), PROPN (1; 0% instances)

476 (83%) PRON nodes are leaves.

84 (15%) PRON nodes have one child.

6 (1%) PRON nodes have two children.

10 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 5.

Children of PRON nodes are attached using 17 different relations: case (75; 56% instances), amod (11; 8% instances), cop (8; 6% instances), punct (8; 6% instances), nsubj (6; 4% instances), nmod (5; 4% instances), acl (3; 2% instances), acl:relcl (3; 2% instances), cc (3; 2% instances), advmod:emph (2; 1% instances), det:numgov (2; 1% instances), fixed (2; 1% instances), mark (2; 1% instances), obl (2; 1% instances), conj (1; 1% instances), det (1; 1% instances), obl:arg (1; 1% instances)

Children of PRON nodes belong to 13 different parts of speech: ADP (73; 54% instances), ADJ (12; 9% instances), NOUN (12; 9% instances), AUX (8; 6% instances), PUNCT (8; 6% instances), DET (7; 5% instances), CCONJ (4; 3% instances), ADV (3; 2% instances), SCONJ (3; 2% instances), PROPN (2; 1% instances), NUM (1; 1% instances), PRON (1; 1% instances), VERB (1; 1% instances)