home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-GSD: POS Tags: PUNCT

There are 33 PUNCT lemmas (0%), 55 PUNCT types (0%) and 41422 PUNCT tokens (13%). Out of 16 observed tags, the rank of PUNCT is: 10 in number of lemmas, 13 in number of types and 4 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, ., “, ), (, -, _, :, /, ?

The 10 most frequent PUNCT types: ,, ., “, ), (, -, –, :, ‘, /

The 10 most frequent ambiguous lemmas: - (PUNCT 1808, PRON 2), _ (PROPN 26803, ADP 7821, PRON 6131, DET 3765, NOUN 3010, NUM 2377, AUX 1984, CCONJ 1516, PUNCT 1272, VERB 1077, SYM 904, ADJ 597, PART 561, X 379, ADV 191, SCONJ 3), ² (PUNCT 24, ADJ 11), ’s (PUNCT 16, PROPN 4, PART 3), + (PUNCT 8, PROPN 1), @ (PROPN 1, PUNCT 1)

The 10 most frequent ambiguous types: . (PUNCT 11433, NUM 1), (PUNCT 2895, NOUN 5), - (PUNCT 1808, PRON 2), (PUNCT 210, NOUN 7, PART 1), º (PUNCT 89, NOUN 27, ADJ 7, PROPN 6), ª (PUNCT 86, ADJ 13, NOUN 4, PROPN 3), ² (PUNCT 52, ADJ 11, NOUN 1), ’s (PUNCT 16, PART 5, PROPN 5), x (PUNCT 13, ADP 3, NOUN 1, X 1), ° (PUNCT 12, NOUN 11)

Morphology

The form / lemma ratio of PUNCT is 1.666667 (the average of all parts of speech is 2.236183).

The 1st highest number of forms (38) was observed with the lemma “_”: #, ‘, ‘’, **, –, ., .., …, …., ……, ;, =, ==, A.D., AC, I., [, \, ], ^, _, a.C., d.C., x, {, |, }, §, ª, «, °, ², ³, ¹, º, », ¿, •.

The 2nd highest number of forms (1) was observed with the lemma “!”: !.

The 3rd highest number of forms (1) was observed with the lemma “””: .

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (41422; 100% instances)

Parents of PUNCT nodes belong to 15 different parts of speech: VERB (18778; 45% instances), NOUN (10035; 24% instances), PROPN (7499; 18% instances), NUM (1628; 4% instances), ADV (1217; 3% instances), ADJ (951; 2% instances), PRON (625; 2% instances), PART (282; 1% instances), SYM (205; 0% instances), X (91; 0% instances), CCONJ (46; 0% instances), ADP (40; 0% instances), DET (15; 0% instances), PUNCT (6; 0% instances), AUX (4; 0% instances)

41418 (100%) PUNCT nodes are leaves.

2 (0%) PUNCT nodes have one child.

2 (0%) PUNCT nodes have two children.

The highest child degree of a PUNCT node is 2.

Children of PUNCT nodes are attached using 1 different relations: punct (6; 100% instances)

Children of PUNCT nodes belong to 1 different parts of speech: PUNCT (6; 100% instances)