Treebank Statistics: UD_Portuguese-DANTEStocks: POS Tags: X
There are 344 X
lemmas (4%), 363 X
types (3%) and 1754 X
tokens (2%).
Out of 16 observed tags, the rank of X
is: 7 in number of lemmas, 7 in number of types and 11 in number of tokens.
The 10 most frequent X
lemmas: RT, #vale5, #infomoney, #petr4, $LIGT3, _, #ibov, rsrsr, #bovespa, #BR
The 10 most frequent X
types: RT, #vale5, #infomoney, #petr4, $LIGT3, #ibov, rsrsr, #bovespa, #BR, #PETR3
The 10 most frequent ambiguous lemmas: RT (X 314, PROPN 5), #vale5 (X 178, PROPN 63), #petr4 (PROPN 115, X 24), $LIGT3 (X 41, PROPN 5), _ (X 31, PUNCT 13, NOUN 4, ADJ 1, ADV 1), #ibov (X 8, PROPN 5), #bovespa (X 4, PROPN 3), #PETR3 (X 20, PROPN 19), $PETR4 (X 17, PROPN 8), #BBDC4 (PROPN 11, X 10)
The 10 most frequent ambiguous types: RT (X 314, PROPN 5), #vale5 (X 178, PROPN 63), #petr4 (PROPN 115, X 24), $LIGT3 (X 41, PROPN 5), #ibov (X 8, PROPN 5), #bovespa (X 4, PROPN 3), #PETR3 (X 20, PROPN 19), $PETR4 (X 17, PROPN 8), aprov (X 16, NOUN 3), Webcast (X 12, PROPN 4)
- RT
- #vale5
- #petr4
- $LIGT3
- #ibov
- #bovespa
- #PETR3
- $PETR4
- aprov
- Webcast
Morphology
The form / lemma ratio of X
is 1.055233 (the average of all parts of speech is 1.238049).
The 1st highest number of forms (18) was observed with the lemma “_”: 6, 64, BROKER, Bancotario, abertura, bonificação, diretor, feira, final, lenga, market, niquel, onda, petr4, provento, sal, sena, side.
The 2nd highest number of forms (2) was observed with the lemma “#Petrobras”: #Petrobras, #Petrobrás.
The 3rd highest number of forms (2) was observed with the lemma “cai”: cai, cain.
X
occurs with 1 features: Foreign (118; 7% instances)
X
occurs with 1 feature-value pairs: Foreign=Yes
X
occurs with 2 feature combinations.
The most frequent feature combination is _
(1636 tokens).
Examples: RT, #vale5, #infomoney, #petr4, $LIGT3, #ibov, rsrsr, #bovespa, #BR, #PETR3
Relations
X
nodes are attached to their parents using 23 different relations: parataxis (1341; 76% instances), discourse (181; 10% instances), nmod (81; 5% instances), flat:foreign (34; 2% instances), goeswith (26; 1% instances), dep (22; 1% instances), obj (14; 1% instances), obl (13; 1% instances), nsubj (7; 0% instances), root (7; 0% instances), conj (4; 0% instances), flat:name (4; 0% instances), vocative (4; 0% instances), case (3; 0% instances), appos (2; 0% instances), ccomp (2; 0% instances), fixed (2; 0% instances), list (2; 0% instances), advmod (1; 0% instances), amod (1; 0% instances), obl:agent (1; 0% instances), reparandum (1; 0% instances), xcomp (1; 0% instances)
Parents of X
nodes belong to 12 different parts of speech: VERB (1181; 67% instances), NOUN (356; 20% instances), PROPN (77; 4% instances), X (48; 3% instances), ADJ (32; 2% instances), ADV (18; 1% instances), PRON (11; 1% instances), SYM (11; 1% instances), NUM (9; 1% instances), (7; 0% instances), AUX (2; 0% instances), INTJ (2; 0% instances)
1144 (65%) X
nodes are leaves.
194 (11%) X
nodes have one child.
332 (19%) X
nodes have two children.
84 (5%) X
nodes have three or more children.
The highest child degree of a X
node is 10.
Children of X
nodes are attached using 24 different relations: punct (593; 50% instances), nmod (370; 31% instances), conj (40; 3% instances), flat:foreign (34; 3% instances), case (31; 3% instances), det (30; 3% instances), appos (27; 2% instances), parataxis (15; 1% instances), vocative (10; 1% instances), nsubj (7; 1% instances), advmod (6; 1% instances), amod (6; 1% instances), flat:name (4; 0% instances), cop (3; 0% instances), discourse (3; 0% instances), nummod (3; 0% instances), obl (3; 0% instances), cc (2; 0% instances), dep (2; 0% instances), fixed (2; 0% instances), list (2; 0% instances), mark (2; 0% instances), acl (1; 0% instances), obj (1; 0% instances)
Children of X
nodes belong to 16 different parts of speech: PUNCT (593; 50% instances), PROPN (379; 32% instances), NOUN (60; 5% instances), X (48; 4% instances), ADP (31; 3% instances), DET (30; 3% instances), NUM (21; 2% instances), SYM (7; 1% instances), ADJ (6; 1% instances), ADV (6; 1% instances), PRON (4; 0% instances), VERB (4; 0% instances), AUX (3; 0% instances), CCONJ (2; 0% instances), SCONJ (2; 0% instances), INTJ (1; 0% instances)