home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-Porttinari: POS Tags: PROPN

There are 3968 PROPN lemmas (28%), 3968 PROPN types (19%) and 10905 PROPN tokens (6%). Out of 16 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: Brasil, Paulo, São, EUA, Rio, Temer, JBS, Folha, Estado, Doria

The 10 most frequent PROPN types: Brasil, Paulo, São, EUA, Rio, Temer, JBS, Folha, Estado, Doria

The 10 most frequent ambiguous lemmas: the (PROPN 4, X 1), in (PROPN 18, X 3), a (ADP 2153, PROPN 3, NOUN 2), PM (PROPN 6, NOUN 1), and (PROPN 4, X 1), to (PROPN 4, X 1), 5 (NUM 25, PROPN 3), like (NOUN 2, PROPN 1), 55 (NUM 5, PROPN 2), se (PRON 731, SCONJ 275, PROPN 1)

The 10 most frequent ambiguous types: São (PROPN 135, AUX 35, VERB 4), Justiça (PROPN 48, NOUN 1), Polícia (PROPN 30, NOUN 2), Copa (PROPN 27, NOUN 4), the (PROPN 4, X 1), Norte (PROPN 22, NOUN 1), Brasileiro (PROPN 20, NOUN 1), in (PROPN 18, X 3), União (PROPN 19, NOUN 2), Exército (PROPN 17, NOUN 2)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.496159).

The 1st highest number of forms (1) was observed with the lemma “#Tamojunto”: #Tamojunto.

The 2nd highest number of forms (1) was observed with the lemma “1-Azul”: 1-Azul.

The 3rd highest number of forms (1) was observed with the lemma “157”: 157.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 22 different relations: flat:name (3355; 31% instances), nmod (3037; 28% instances), nsubj (1828; 17% instances), obl (989; 9% instances), conj (584; 5% instances), obj (319; 3% instances), appos (296; 3% instances), obl:agent (139; 1% instances), parataxis (118; 1% instances), root (82; 1% instances), nsubj:pass (61; 1% instances), advcl (34; 0% instances), xcomp (20; 0% instances), list (9; 0% instances), vocative (9; 0% instances), ccomp:speech (6; 0% instances), dislocated (5; 0% instances), acl:relcl (4; 0% instances), orphan (4; 0% instances), ccomp (3; 0% instances), acl (2; 0% instances), csubj (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: PROPN (4232; 39% instances), NOUN (3183; 29% instances), VERB (3065; 28% instances), ADJ (127; 1% instances), (82; 1% instances), PRON (79; 1% instances), ADV (74; 1% instances), NUM (30; 0% instances), SYM (15; 0% instances), X (14; 0% instances), AUX (3; 0% instances), INTJ (1; 0% instances)

3901 (36%) PROPN nodes are leaves.

2203 (20%) PROPN nodes have one child.

2681 (25%) PROPN nodes have two children.

2120 (19%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 10.

Children of PROPN nodes are attached using 25 different relations: case (4002; 26% instances), det (3393; 22% instances), flat:name (3356; 22% instances), punct (2162; 14% instances), conj (616; 4% instances), cc (451; 3% instances), appos (390; 3% instances), nmod (337; 2% instances), parataxis (181; 1% instances), acl:relcl (162; 1% instances), amod (143; 1% instances), cop (89; 1% instances), nsubj (64; 0% instances), acl (60; 0% instances), advmod (57; 0% instances), mark (41; 0% instances), list (17; 0% instances), nummod (16; 0% instances), orphan (14; 0% instances), advcl (11; 0% instances), ccomp (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), expl (1; 0% instances), obj (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (4232; 27% instances), ADP (4013; 26% instances), DET (3393; 22% instances), PUNCT (2162; 14% instances), NOUN (554; 4% instances), CCONJ (448; 3% instances), VERB (212; 1% instances), NUM (160; 1% instances), ADJ (154; 1% instances), AUX (89; 1% instances), ADV (71; 0% instances), PRON (33; 0% instances), SCONJ (20; 0% instances), SYM (19; 0% instances), X (7; 0% instances)