Treebank Statistics: UD_Portuguese-Bosque: POS Tags: NOUN
There are 6782 NOUN
lemmas (33%), 8628 NOUN
types (30%) and 41386 NOUN
tokens (18%).
Out of 17 observed tags, the rank of NOUN
is: 2 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: ano, dia, país, presidente, vez, empresa, pessoa, estado, tempo, parte
The 10 most frequent NOUN
types: anos, presidente, ano, dia, país, pessoas, estado, parte, tempo, contos
The 10 most frequent ambiguous lemmas: dia (NOUN 305, PROPN 1), vez (NOUN 194, ADP 12, PROPN 5), tempo (NOUN 154, ADJ 2), parte (NOUN 153, ADV 2), caso (NOUN 145, SCONJ 23), acordo (NOUN 113, ADJ 1), forma (NOUN 106, ADV 1), cidade (NOUN 105, PROPN 1), número (NOUN 93, PROPN 1), candidato (NOUN 83, ADJ 1)
The 10 most frequent ambiguous types: dia (NOUN 187, PROPN 1), estado (NOUN 34, AUX 3, VERB 2), parte (NOUN 129, VERB 2), tempo (NOUN 132, ADJ 2), vez (NOUN 120, ADP 12, PROPN 5), caso (NOUN 107, SCONJ 17), acordo (NOUN 102, ADJ 1), trabalho (NOUN 101, VERB 1), forma (NOUN 92, ADV 1), cidade (NOUN 81, PROPN 1)
- dia
- estado
- NOUN 34: Folha – O estado de sítio seria um golpe ?
- AUX 3: As conversações de ontem , presididas por Mario Raffaelli , o mesmo que dirige o processo de paz moçambicano , começaram sem eles , embora a própria Arménia tenha estado presente .
- VERB 2: Enquanto isso , os universitários de o Norte têm estado a reunir se e a estudar cada um de os princípios apresentados por o titular de a pasta de a Educação .
- parte
- tempo
- vez
- caso
- acordo
- NOUN 102: Não houve acordo para uma trégua durante a Copa .
- ADJ 1: O ex-ministro de a Fazenda Ernane Galvêas ( governo Figueiredo ) disse que sua participação em o episódio limitou se a um « de acordo » em o processo que teria vindo pronto de o Banco Nacional de Habitação , subordinado em a época a o Ministério de o Interior .
- trabalho
- forma
- cidade
- NOUN 81: Onde a « cidade de o homem , não de o lobo mas irmão » ?
- PROPN 1: Em o segundo centro populacional de Moçambique , a cidade de a Beira , os portugueses residentes conseguiram , por iniciativa própria , criar uma escola que lecciona o ensino primário e que para o ano vai iniciar se como escola preparatória .
Morphology
The form / lemma ratio of NOUN
is 1.272191 (the average of all parts of speech is 1.423840).
The 1st highest number of forms (4) was observed with the lemma “bom”: bom, bons, melhores, ótimo.
The 2nd highest number of forms (4) was observed with the lemma “homem”: homem, homens, mulher, mulheres.
The 3rd highest number of forms (4) was observed with the lemma “número”: nº, nº., número, números.
NOUN
occurs with 8 features: Number (41339; 100% instances), Gender (41220; 100% instances), ExtPos (272; 1% instances), Abbr (147; 0% instances), Typo (12; 0% instances), NumType (3; 0% instances), Foreign (1; 0% instances), Voice (1; 0% instances)
NOUN
occurs with 11 feature-value pairs: Abbr=Yes
, ExtPos=NOUN
, ExtPos=PROPN
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, NumType=Ord
, Number=Plur
, Number=Sing
, Typo=Yes
, Voice=Pass
NOUN
occurs with 29 feature combinations.
The most frequent feature combination is Gender=Masc|Number=Sing
(15297 tokens).
Examples: presidente, ano, dia, país, estado, tempo, grupo, governo, caso, acordo
Relations
NOUN
nodes are attached to their parents using 28 different relations: nmod (12776; 31% instances), obj (7736; 19% instances), obl (7491; 18% instances), nsubj (5433; 13% instances), conj (2626; 6% instances), root (1378; 3% instances), appos (1037; 3% instances), nsubj:pass (731; 2% instances), obl:agent (471; 1% instances), xcomp (355; 1% instances), iobj (344; 1% instances), fixed (299; 1% instances), parataxis (230; 1% instances), ccomp (127; 0% instances), compound (118; 0% instances), acl:relcl (70; 0% instances), advcl (67; 0% instances), amod (28; 0% instances), csubj (21; 0% instances), acl (13; 0% instances), vocative (10; 0% instances), flat:name (7; 0% instances), list (6; 0% instances), dislocated (5; 0% instances), orphan (4; 0% instances), discourse (1; 0% instances), flat (1; 0% instances), flat:foreign (1; 0% instances)
Parents of NOUN
nodes belong to 17 different parts of speech: VERB (20522; 50% instances), NOUN (15092; 36% instances), (1378; 3% instances), ADJ (1258; 3% instances), PROPN (1027; 2% instances), ADV (634; 2% instances), NUM (529; 1% instances), PRON (390; 1% instances), ADP (329; 1% instances), SYM (126; 0% instances), DET (64; 0% instances), X (15; 0% instances), SCONJ (14; 0% instances), AUX (4; 0% instances), PART (2; 0% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances)
1344 (3%) NOUN
nodes are leaves.
8775 (21%) NOUN
nodes have one child.
13513 (33%) NOUN
nodes have two children.
17754 (43%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 13.
Children of NOUN
nodes are attached using 37 different relations: det (28390; 28% instances), case (22984; 23% instances), nmod (15418; 15% instances), amod (9046; 9% instances), punct (8160; 8% instances), conj (2656; 3% instances), acl (2415; 2% instances), appos (2031; 2% instances), nummod (2016; 2% instances), acl:relcl (1992; 2% instances), cc (1963; 2% instances), cop (1148; 1% instances), advmod (1113; 1% instances), nsubj (734; 1% instances), mark (293; 0% instances), parataxis (235; 0% instances), compound (194; 0% instances), obl (170; 0% instances), advcl (136; 0% instances), csubj (46; 0% instances), xcomp (32; 0% instances), flat:name (24; 0% instances), aux (21; 0% instances), ccomp (13; 0% instances), list (9; 0% instances), obj (7; 0% instances), discourse (6; 0% instances), flat (6; 0% instances), orphan (5; 0% instances), fixed (4; 0% instances), iobj (4; 0% instances), det:poss (3; 0% instances), dislocated (2; 0% instances), vocative (2; 0% instances), aux:pass (1; 0% instances), flat:foreign (1; 0% instances), reparandum (1; 0% instances)
Children of NOUN
nodes belong to 17 different parts of speech: DET (28374; 28% instances), ADP (23106; 23% instances), NOUN (15092; 15% instances), ADJ (9236; 9% instances), PUNCT (8160; 8% instances), PROPN (4880; 5% instances), VERB (4656; 5% instances), NUM (2457; 2% instances), CCONJ (1909; 2% instances), ADV (1249; 1% instances), AUX (1174; 1% instances), PRON (529; 1% instances), SCONJ (290; 0% instances), SYM (124; 0% instances), X (40; 0% instances), PART (3; 0% instances), INTJ (2; 0% instances)