Treebank Statistics: UD_Portuguese-DANTEStocks: POS Tags: PUNCT
There are 39 PUNCT
lemmas (0%), 38 PUNCT
types (0%) and 13126 PUNCT
tokens (16%).
Out of 16 observed tags, the rank of PUNCT
is: 11 in number of lemmas, 14 in number of types and 1 in number of tokens.
The 10 most frequent PUNCT
lemmas: ,, ., -, :, !, …, (, ), ?, ‘
The 10 most frequent PUNCT
types: ,, ., -, :, !, …, (, ), ?, ‘
The 10 most frequent ambiguous lemmas: - (PUNCT 1787, SYM 329), ’ (PUNCT 258, SYM 15), / (PUNCT 54, SYM 1), > (PUNCT 20, SYM 4), _ (X 31, PUNCT 13, NOUN 4, ADJ 1, ADV 1), * (PUNCT 12, SYM 1), = (PUNCT 10, SYM 1), => (PUNCT 7, SYM 3), + (SYM 253, PUNCT 1), =) (SYM 20, PUNCT 1)
The 10 most frequent ambiguous types: - (PUNCT 1787, SYM 329), ’ (PUNCT 258, SYM 15), / (PUNCT 54, SYM 1), > (PUNCT 20, SYM 4), * (PUNCT 12, SYM 1), = (AUX 17, PUNCT 11, SYM 1), l (PUNCT 8, NOUN 2), => (PUNCT 7, SYM 3), + (SYM 253, ADV 10, PUNCT 1)
- -
- ’
- /
- >
- *
- =
- l
- =>
- +
- SYM 253: Petr4 15,68 + 2,35 % Cerveró falando sem parar … #CPIdaPTbras
- ADV 10: ITUB4 ja negociou + de 300 M ? Ta certo meu sistema aqui ? @ferrisss @dfittarelli @JPedro_Sullivan
- PUNCT 1: > > > Cemig ( CMIG4 ) : Julgamento de o mandado sobre Jaguara é suspenso > > > ( + ) Cemig ( CMIG4 ) : Julgamento de o mandado … http://t.co/M9A5yw7dU0
Morphology
The form / lemma ratio of PUNCT
is 0.974359 (the average of all parts of speech is 1.238049).
The 1st highest number of forms (2) was observed with the lemma “_”: ), _.
The 2nd highest number of forms (2) was observed with the lemma “ | ”: l, | </em>. |
The 3rd highest number of forms (1) was observed with the lemma “!”: !.
PUNCT
does not occur with any features.
Relations
PUNCT
nodes are attached to their parents using 1 different relations: punct (13126; 100% instances)
Parents of PUNCT
nodes belong to 14 different parts of speech: PROPN (3937; 30% instances), VERB (3571; 27% instances), NOUN (3227; 25% instances), SYM (600; 5% instances), X (593; 5% instances), ADJ (383; 3% instances), NUM (327; 2% instances), ADV (216; 2% instances), PRON (133; 1% instances), INTJ (79; 1% instances), AUX (29; 0% instances), CCONJ (17; 0% instances), ADP (8; 0% instances), PUNCT (6; 0% instances)
13122 (100%) PUNCT
nodes are leaves.
2 (0%) PUNCT
nodes have one child.
2 (0%) PUNCT
nodes have two children.
The highest child degree of a PUNCT
node is 2.
Children of PUNCT
nodes are attached using 1 different relations: punct (6; 100% instances)
Children of PUNCT
nodes belong to 1 different parts of speech: PUNCT (6; 100% instances)