Treebank Statistics: UD_English-CTeTex: POS Tags: PUNCT
There are 1 PUNCT
lemmas (6%), 22 PUNCT
types (1%) and 1455 PUNCT
tokens (16%).
Out of 17 observed tags, the rank of PUNCT
is: 13 in number of lemmas, 9 in number of types and 2 in number of tokens.
The 10 most frequent PUNCT
lemmas: _
The 10 most frequent PUNCT
types: ,, ., ), -, (, :, [, ], •, “
The 10 most frequent ambiguous lemmas: _ (NOUN 2649, PUNCT 1455, DET 936, ADP 781, VERB 721, ADJ 647, AUX 492, NUM 317, PROPN 293, CCONJ 267, ADV 185, PART 165, SCONJ 163, SYM 98, PRON 83, X 17, INTJ 4)
The 10 most frequent ambiguous types: - (PUNCT 143, SYM 7), / (SYM 39, PUNCT 2), ’ (PUNCT 2, PART 1)
- -
- /
- SYM 39: This will most likely be in the form of one or more frames per UDP / IP packet
- PUNCT 2: NPAC SMS shall allow Service Providers to add / delete the NPA-NXX and / or LRN data via the NPAC SMS to Local SMS interface and SOA to NPAC SMS interface provided the changes do not cause mass updates to the Subscription Versions .
- ’
- PUNCT 2: The Cab radio shall support a ‘ shunting mode ’ of operation that provides a link assurance tone to reassure users of the integrity of the communication link ( see section 14 ) . ( M )
- PART 1: NPAC SMS shall support the Service Providers ’ acknowledgment via 2 secure electronic forms , Email or FTP using encryption mechanisms .
Morphology
The form / lemma ratio of PUNCT
is 22.000000 (the average of all parts of speech is 125.235294).
The 1st highest number of forms (22) was observed with the lemma “_”: ”, ‘, (, ), ,, -, ., …, /, :, ;, ?, [, ], o, ·, –, ‘, ’, “, ”, •.
PUNCT
does not occur with any features.
Relations
PUNCT
nodes are attached to their parents using 1 different relations: punct (1455; 100% instances)
Parents of PUNCT
nodes belong to 16 different parts of speech: NOUN (558; 38% instances), VERB (393; 27% instances), ADJ (156; 11% instances), PROPN (132; 9% instances), NUM (112; 8% instances), ADV (25; 2% instances), ADP (23; 2% instances), X (17; 1% instances), SYM (15; 1% instances), PRON (11; 1% instances), AUX (4; 0% instances), PUNCT (4; 0% instances), CCONJ (2; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)
1451 (100%) PUNCT
nodes are leaves.
4 (0%) PUNCT
nodes have one child.
The highest child degree of a PUNCT
node is 1.
Children of PUNCT
nodes are attached using 1 different relations: punct (4; 100% instances)
Children of PUNCT
nodes belong to 1 different parts of speech: PUNCT (4; 100% instances)