home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-ESL: POS Tags: PROPN

There are 1 PROPN lemmas (6%), 1 PROPN types (6%) and 1795 PROPN tokens (2%). Out of 17 observed tags, the rank of PROPN is: 12 in number of lemmas, 12 in number of types and 13 in number of tokens.

The 10 most frequent PROPN lemmas: _

The 10 most frequent PROPN types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 15635, VERB 12250, PRON 10618, DET 10057, PUNCT 9580, ADP 8546, AUX 7363, ADJ 5857, ADV 5704, PART 3531, CCONJ 3198, SCONJ 2516, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)

The 10 most frequent ambiguous types: _ (NOUN 15635, VERB 12250, PRON 10618, DET 10057, PUNCT 9580, ADP 8546, AUX 7363, ADJ 5857, ADV 5704, PART 3531, CCONJ 3198, SCONJ 2516, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 27 different relations: obl (367; 20% instances), compound (315; 18% instances), nmod (271; 15% instances), nsubj (197; 11% instances), flat (167; 9% instances), obj (98; 5% instances), conj (93; 5% instances), appos (67; 4% instances), root (51; 3% instances), nmod:poss (36; 2% instances), amod (29; 2% instances), vocative (13; 1% instances), nsubj:pass (11; 1% instances), obl:tmod (11; 1% instances), xcomp (11; 1% instances), acl:relcl (9; 1% instances), advcl (8; 0% instances), case (8; 0% instances), ccomp (8; 0% instances), det (5; 0% instances), parataxis (5; 0% instances), csubj:pass (4; 0% instances), csubj (3; 0% instances), obl:npmod (3; 0% instances), iobj (2; 0% instances), list (2; 0% instances), nmod:npmod (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: VERB (618; 34% instances), PROPN (565; 31% instances), NOUN (465; 26% instances), ADJ (62; 3% instances), (51; 3% instances), NUM (15; 1% instances), ADV (9; 1% instances), PRON (4; 0% instances), ADP (2; 0% instances), AUX (2; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances)

705 (39%) PROPN nodes are leaves.

554 (31%) PROPN nodes have one child.

240 (13%) PROPN nodes have two children.

296 (16%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 12.

Children of PROPN nodes are attached using 29 different relations: case (737; 32% instances), compound (266; 12% instances), punct (233; 10% instances), det (187; 8% instances), flat (167; 7% instances), conj (108; 5% instances), cop (88; 4% instances), cc (83; 4% instances), nsubj (80; 3% instances), amod (67; 3% instances), advmod (60; 3% instances), appos (40; 2% instances), acl:relcl (35; 2% instances), nmod (34; 1% instances), mark (24; 1% instances), nummod (21; 1% instances), obl (17; 1% instances), advcl (16; 1% instances), aux (13; 1% instances), nmod:poss (10; 0% instances), acl (4; 0% instances), parataxis (4; 0% instances), obl:tmod (3; 0% instances), list (2; 0% instances), cc:preconj (1; 0% instances), det:predet (1; 0% instances), dislocated (1; 0% instances), expl (1; 0% instances), nmod:npmod (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: ADP (686; 30% instances), PROPN (565; 25% instances), PUNCT (233; 10% instances), DET (199; 9% instances), NOUN (131; 6% instances), AUX (101; 4% instances), CCONJ (82; 4% instances), ADV (59; 3% instances), VERB (56; 2% instances), PART (52; 2% instances), ADJ (48; 2% instances), NUM (38; 2% instances), PRON (35; 2% instances), SCONJ (18; 1% instances), SYM (1; 0% instances)