Treebank Statistics: UD_Portuguese-Bosque: POS Tags: PROPN
There are 7150 PROPN
lemmas (35%), 7185 PROPN
types (25%) and 18757 PROPN
tokens (8%).
Out of 17 observed tags, the rank of PROPN
is: 1 in number of lemmas, 2 in number of types and 6 in number of tokens.
The 10 most frequent PROPN
lemmas: São, Paulo, Portugal, Brasil, José, Porto, Governo, Lisboa, João, Nacional
The 10 most frequent PROPN
types: Paulo, São, Portugal, Brasil, José, Porto, Governo, Lisboa, João, Nacional
The 10 most frequent ambiguous lemmas: Nacional (PROPN 79, ADJ 1), Câmara (PROPN 46, NOUN 1), the (PROPN 11, X 2), ministério (NOUN 25, PROPN 1), o (DET 27837, PRON 587, ADP 6, PROPN 5), polícia (NOUN 57, PROPN 1), a (ADP 3179, SCONJ 942, DET 22, PRON 19, PROPN 6, NOUN 4, ADV 2), Municipal (PROPN 22, ADJ 1), e (CCONJ 4257, PROPN 20, ADP 2, SCONJ 2, NOUN 1), educação (NOUN 28, PROPN 1)
The 10 most frequent ambiguous types: São (PROPN 141, AUX 30), Governo (PROPN 84, NOUN 7), Lisboa (PROPN 83, NOUN 1), Nacional (PROPN 78, ADJ 10, NOUN 1), Estados (PROPN 69, NOUN 13), Folha (PROPN 57, NOUN 13), Banco (PROPN 47, NOUN 3), Câmara (PROPN 46, NOUN 33), Nova (PROPN 43, ADJ 2), the (PROPN 11, X 1)
- São
- Governo
- Lisboa
- PROPN 83: Em Lisboa , só S. Pedro pode pregar uma partida
- NOUN 1: Optou por a pedagogia em os contactos que teve com os sinistrados e nem sequer esqueceu a sua experiência como autarca , recordando inúmeras vezes os tempos em que foi presidente de a Câmara de Lisboa para explicar as suas teorias – mais propriamente alguma « insatisfação natural » – sobre realojamentos .
- Nacional
- Estados
- Folha
- Banco
- Câmara
- Nova
- the
Morphology
The form / lemma ratio of PROPN
is 1.004895 (the average of all parts of speech is 1.423840).
The 1st highest number of forms (3) was observed with the lemma “Júnior”: JR., Jr., Júnior.
The 2nd highest number of forms (3) was observed with the lemma “São”: S., SÃO, São.
The 3rd highest number of forms (2) was observed with the lemma “Ana”: Ana, Anas.
PROPN
occurs with 6 features: Number (18752; 100% instances), Gender (11532; 61% instances), ExtPos (4103; 22% instances), Abbr (79; 0% instances), PronType (26; 0% instances), Typo (2; 0% instances)
PROPN
occurs with 9 feature-value pairs: Abbr=Yes
, ExtPos=NOUN
, ExtPos=PROPN
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
, PronType=Art
, Typo=Yes
PROPN
occurs with 30 feature combinations.
The most frequent feature combination is Number=Sing
(6995 tokens).
Examples: Paulo, Portugal, Nacional, São, Porto, Unidos, José, Brasil, Lisboa, Silva
Relations
PROPN
nodes are attached to their parents using 24 different relations: flat:name (5504; 29% instances), nmod (5096; 27% instances), nsubj (2290; 12% instances), appos (1697; 9% instances), obl (1357; 7% instances), conj (1231; 7% instances), obj (531; 3% instances), root (333; 2% instances), obl:agent (242; 1% instances), parataxis (209; 1% instances), nsubj:pass (86; 0% instances), iobj (64; 0% instances), xcomp (32; 0% instances), amod (28; 0% instances), advcl (15; 0% instances), compound (10; 0% instances), acl:relcl (9; 0% instances), flat (7; 0% instances), list (6; 0% instances), vocative (4; 0% instances), ccomp (2; 0% instances), csubj (2; 0% instances), acl (1; 0% instances), orphan (1; 0% instances)
Parents of PROPN
nodes belong to 13 different parts of speech: PROPN (8916; 48% instances), NOUN (4880; 26% instances), VERB (4119; 22% instances), (333; 2% instances), ADJ (243; 1% instances), ADV (93; 0% instances), PRON (74; 0% instances), NUM (54; 0% instances), DET (18; 0% instances), SYM (10; 0% instances), X (10; 0% instances), ADP (5; 0% instances), INTJ (2; 0% instances)
6603 (35%) PROPN
nodes are leaves.
3440 (18%) PROPN
nodes have one child.
4391 (23%) PROPN
nodes have two children.
4323 (23%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 21.
Children of PROPN
nodes are attached using 27 different relations: case (7130; 25% instances), flat:name (5610; 20% instances), det (5481; 19% instances), punct (4147; 14% instances), nmod (2135; 7% instances), conj (1247; 4% instances), appos (824; 3% instances), cc (697; 2% instances), parataxis (340; 1% instances), acl:relcl (326; 1% instances), amod (258; 1% instances), acl (171; 1% instances), advmod (95; 0% instances), cop (93; 0% instances), nsubj (81; 0% instances), nummod (44; 0% instances), mark (24; 0% instances), compound (12; 0% instances), obl (11; 0% instances), list (10; 0% instances), advcl (9; 0% instances), obj (9; 0% instances), flat (5; 0% instances), orphan (4; 0% instances), xcomp (3; 0% instances), csubj (2; 0% instances), ccomp (1; 0% instances)
Children of PROPN
nodes belong to 15 different parts of speech: PROPN (8916; 31% instances), ADP (7143; 25% instances), DET (5521; 19% instances), PUNCT (4147; 14% instances), NOUN (1027; 4% instances), CCONJ (759; 3% instances), VERB (539; 2% instances), ADJ (260; 1% instances), NUM (170; 1% instances), ADV (119; 0% instances), AUX (93; 0% instances), PRON (37; 0% instances), SCONJ (21; 0% instances), SYM (14; 0% instances), X (3; 0% instances)