home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-CTeTex: POS Tags: PUNCT

There are 1 PUNCT lemmas (6%), 22 PUNCT types (1%) and 1455 PUNCT tokens (16%). Out of 17 observed tags, the rank of PUNCT is: 13 in number of lemmas, 9 in number of types and 2 in number of tokens.

The 10 most frequent PUNCT lemmas: _

The 10 most frequent PUNCT types: ,, ., ), -, (, :, [, ], •, “

The 10 most frequent ambiguous lemmas: _ (NOUN 2649, PUNCT 1455, DET 936, ADP 781, VERB 721, ADJ 647, AUX 492, NUM 317, PROPN 293, CCONJ 267, ADV 185, PART 165, SCONJ 163, SYM 98, PRON 83, X 17, INTJ 4)

The 10 most frequent ambiguous types: - (PUNCT 143, SYM 7), / (SYM 39, PUNCT 2), (PUNCT 2, PART 1)

Morphology

The form / lemma ratio of PUNCT is 22.000000 (the average of all parts of speech is 125.235294).

The 1st highest number of forms (22) was observed with the lemma “_”: ”, ‘, (, ), ,, -, ., …, /, :, ;, ?, [, ], o, ·, –, ‘, ’, “, ”, •.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (1455; 100% instances)

Parents of PUNCT nodes belong to 16 different parts of speech: NOUN (558; 38% instances), VERB (393; 27% instances), ADJ (156; 11% instances), PROPN (132; 9% instances), NUM (112; 8% instances), ADV (25; 2% instances), ADP (23; 2% instances), X (17; 1% instances), SYM (15; 1% instances), PRON (11; 1% instances), AUX (4; 0% instances), PUNCT (4; 0% instances), CCONJ (2; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

1451 (100%) PUNCT nodes are leaves.

4 (0%) PUNCT nodes have one child.

The highest child degree of a PUNCT node is 1.

Children of PUNCT nodes are attached using 1 different relations: punct (4; 100% instances)

Children of PUNCT nodes belong to 1 different parts of speech: PUNCT (4; 100% instances)