home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-HDT: POS Tags: PRON

There are 28 PRON lemmas (0%), 65 PRON types (0%) and 94847 PRON tokens (3%). Out of 16 observed tags, the rank of PRON is: 11 in number of lemmas, 11 in number of types and 11 in number of tokens.

The 10 most frequent PRON lemmas: der, sich, es, sie, man, er, wir, was, wer, ich

The 10 most frequent PRON types: sich, es, die, sie, man, das, er, der, wir, was

The 10 most frequent ambiguous lemmas: der (DET 359943, PRON 28684, X 2), es (PRON 13851, PROPN 1), sie (PRON 8059, X 16), man (PRON 6680, X 2), wir (PRON 4106, X 1), was (PRON 2181, X 5), nichts (PRON 795, ADV 2), etwas (ADV 568, PRON 423), du (PRON 54, X 5), ihr (DET 7652, PRON 22)

The 10 most frequent ambiguous types: es (PRON 11325, PROPN 1), die (DET 77836, PRON 12604, X 2), sie (PRON 5415, X 16), man (PRON 5900, X 2), das (DET 25405, PRON 4235, SCONJ 5, X 1), der (DET 91439, PRON 4856, X 2), wir (PRON 1680, X 1), was (PRON 1450, X 5), dem (DET 66367, PRON 1681, X 1), nichts (PRON 750, ADV 2)

Morphology

The form / lemma ratio of PRON is 2.321429 (the average of all parts of speech is 2.529726).

The 1st highest number of forms (11) was observed with the lemma “der”: d., da, das, dem, den, denen, der, deren, derer, dessen, die.

The 2nd highest number of forms (4) was observed with the lemma “es”: ’s, es, ihm, s.

The 3rd highest number of forms (4) was observed with the lemma “wer”: wem, wen, wer, wessen.

PRON occurs with 10 features: PronType (94847; 100% instances), Case (93577; 99% instances), Number (73108; 77% instances), Person (54207; 57% instances), Gender (44113; 47% instances), Reflex (21144; 22% instances), Polite (70; 0% instances), Abbr (3; 0% instances), Foreign (1; 0% instances), Typo (1; 0% instances)

PRON occurs with 26 feature-value pairs: Abbr=Yes, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Form, PronType=Dem, PronType=Dem,Rel, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

PRON occurs with 87 feature combinations. The most frequent feature combination is Case=Acc|Person=3|PronType=Prs|Reflex=Yes (17728 tokens). Examples: sich

Relations

PRON nodes are attached to their parents using 21 different relations: nsubj (49866; 53% instances), obj (14660; 15% instances), expl:pv (10469; 11% instances), obl (5345; 6% instances), nsubj:pass (4696; 5% instances), obl:arg (3644; 4% instances), expl (3123; 3% instances), nmod (1919; 2% instances), xcomp (332; 0% instances), conj (229; 0% instances), det (200; 0% instances), root (164; 0% instances), appos (77; 0% instances), acl (58; 0% instances), ccomp (32; 0% instances), parataxis (14; 0% instances), advcl (6; 0% instances), csubj (6; 0% instances), reparandum (4; 0% instances), amod (2; 0% instances), vocative (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: VERB (82385; 87% instances), NOUN (4477; 5% instances), ADJ (3739; 4% instances), AUX (3258; 3% instances), DET (266; 0% instances), ADV (209; 0% instances), (164; 0% instances), PRON (108; 0% instances), NUM (91; 0% instances), PROPN (88; 0% instances), X (58; 0% instances), SCONJ (3; 0% instances), PART (1; 0% instances)

86970 (92%) PRON nodes are leaves.

6735 (7%) PRON nodes have one child.

630 (1%) PRON nodes have two children.

512 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 9.

Children of PRON nodes are attached using 26 different relations: case (6105; 61% instances), advmod (1068; 11% instances), nmod (796; 8% instances), punct (603; 6% instances), acl (364; 4% instances), cc (231; 2% instances), cop (230; 2% instances), nsubj (223; 2% instances), conj (151; 1% instances), appos (139; 1% instances), obl (45; 0% instances), det (27; 0% instances), aux (26; 0% instances), parataxis (20; 0% instances), ccomp (19; 0% instances), mark (14; 0% instances), advcl (4; 0% instances), flat (4; 0% instances), reparandum (4; 0% instances), xcomp (4; 0% instances), csubj (3; 0% instances), flat:name (3; 0% instances), amod (1; 0% instances), expl (1; 0% instances), nmod:poss (1; 0% instances), orphan (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (5964; 59% instances), ADV (934; 9% instances), NOUN (920; 9% instances), PUNCT (603; 6% instances), CCONJ (376; 4% instances), VERB (358; 4% instances), AUX (285; 3% instances), PROPN (162; 2% instances), ADJ (133; 1% instances), DET (130; 1% instances), PRON (108; 1% instances), PART (51; 1% instances), X (35; 0% instances), NUM (21; 0% instances), SCONJ (7; 0% instances)