home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Swedish-LinES: POS Tags: PROPN

There are 756 PROPN lemmas (7%), 839 PROPN types (6%) and 2860 PROPN tokens (3%). Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 11 in number of tokens.

The 10 most frequent PROPN lemmas: Harry, Quinn, Stillman, Auster, Bray, Ron, Weasley, Access, Mweta, Dobby

The 10 most frequent PROPN types: Harry, Quinn, Stillman, Bray, Auster, Access, Microsoft, Weasley, Ron, Dobby

The 10 most frequent ambiguous lemmas: förenad (PROPN 3, ADJ 2), data (NOUN 50, PROPN 2), Drake (NOUN 1, PROPN 1), link (NOUN 1, PROPN 1)

The 10 most frequent ambiguous types: gud (NOUN 2, PROPN 1), Andra (PROPN 5, ADJ 4, PRON 1), Förenta (PROPN 3, ADJ 1), Le (DET 5, PROPN 3), Web (NOUN 6, PROPN 3), A (PROPN 2, NOUN 1), Data (PROPN 2, NOUN 1), Kategori (NOUN 3, PROPN 2), van (ADJ 6, PROPN 2), År (PROPN 2, NOUN 1)

Morphology

The form / lemma ratio of PROPN is 1.109788 (the average of all parts of speech is 1.414579).

The 1st highest number of forms (3) was observed with the lemma “Bray”: BRAY, Bray, Brays.

The 2nd highest number of forms (3) was observed with the lemma “Lockman”: LOCKMAN, Lockman, Lockmans.

The 3rd highest number of forms (3) was observed with the lemma “Voldemort”: Vol, Voldemort, Voldemorts.

PROPN occurs with 6 features: Case (2848; 100% instances), Number (518; 18% instances), Gender (25; 1% instances), Definite (8; 0% instances), Foreign (2; 0% instances), Degree (1; 0% instances)

PROPN occurs with 10 feature-value pairs: Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Pos, Foreign=Yes, Gender=Com, Gender=Neut, Number=Plur, Number=Sing

PROPN occurs with 16 feature combinations. The most frequent feature combination is Case=Nom (2084 tokens). Examples: Harry, Quinn, Stillman, Bray, Auster, Access, Microsoft, Ron, Weasley, Mweta

Relations

PROPN nodes are attached to their parents using 23 different relations: nsubj (905; 32% instances), nmod (404; 14% instances), flat (376; 13% instances), obl (366; 13% instances), nmod:poss (236; 8% instances), obj (179; 6% instances), conj (163; 6% instances), root (48; 2% instances), vocative (37; 1% instances), appos (35; 1% instances), nsubj:pass (27; 1% instances), compound (19; 1% instances), xcomp (16; 1% instances), obl:agent (11; 0% instances), parataxis (10; 0% instances), dislocated (8; 0% instances), iobj (7; 0% instances), ccomp (6; 0% instances), discourse (3; 0% instances), acl:cleft (1; 0% instances), advcl (1; 0% instances), csubj (1; 0% instances), orphan (1; 0% instances)

Parents of PROPN nodes belong to 10 different parts of speech: VERB (1442; 50% instances), NOUN (691; 24% instances), PROPN (548; 19% instances), ADJ (75; 3% instances), (48; 2% instances), PRON (27; 1% instances), ADV (16; 1% instances), AUX (8; 0% instances), NUM (3; 0% instances), INTJ (2; 0% instances)

1390 (49%) PROPN nodes are leaves.

876 (31%) PROPN nodes have one child.

377 (13%) PROPN nodes have two children.

217 (8%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 28 different relations: case (705; 29% instances), flat (386; 16% instances), nmod (353; 15% instances), punct (265; 11% instances), conj (192; 8% instances), cc (123; 5% instances), amod (66; 3% instances), acl:relcl (55; 2% instances), advmod (42; 2% instances), det (39; 2% instances), appos (38; 2% instances), cop (32; 1% instances), nsubj (22; 1% instances), nummod (22; 1% instances), nmod:poss (14; 1% instances), parataxis (11; 0% instances), discourse (9; 0% instances), mark (9; 0% instances), acl (8; 0% instances), acl:cleft (8; 0% instances), expl (8; 0% instances), aux (6; 0% instances), advcl (3; 0% instances), compound (3; 0% instances), obl (3; 0% instances), vocative (2; 0% instances), csubj (1; 0% instances), orphan (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: ADP (705; 29% instances), PROPN (548; 23% instances), NOUN (423; 17% instances), PUNCT (265; 11% instances), CCONJ (124; 5% instances), VERB (81; 3% instances), ADJ (75; 3% instances), ADV (40; 2% instances), DET (39; 2% instances), AUX (38; 2% instances), PRON (38; 2% instances), NUM (24; 1% instances), INTJ (9; 0% instances), SCONJ (8; 0% instances), PART (6; 0% instances), X (3; 0% instances)