home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-VIT: POS Tags: PUNCT

There are 22 PUNCT lemmas (0%), 22 PUNCT types (0%) and 31585 PUNCT tokens (11%). Out of 17 observed tags, the rank of PUNCT is: 14 in number of lemmas, 15 in number of types and 4 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, ., “, ), -, (, :, ?, ;, «/em>

The 10 most frequent PUNCT types: ,, ., “, ), -, (, :, ?, ;, «/em>

The 10 most frequent ambiguous lemmas: - (PUNCT 1239, SYM 5), / (SYM 8, PUNCT 6), e (CCONJ 5950, NOUN 35, PUNCT 5, ADP 3, PROPN 2, ADV 1, SCONJ 1, X 1), come (ADP 382, CCONJ 142, SCONJ 68, PRON 27, ADV 6, INTJ 1, PUNCT 1, VERB 1), il (DET 36091, PRON 5, PUNCT 1, X 1), ma (CCONJ 766, ADV 10, PUNCT 1), o (CCONJ 677, NOUN 9, INTJ 1, PUNCT 1), p (NOUN 11, PUNCT 1)

The 10 most frequent ambiguous types: , (PUNCT 13145, ADJ 1), - (PUNCT 1239, SYM 5), / (SYM 8, PUNCT 6), e (CCONJ 5340, NOUN 33, PUNCT 6, ADP 3, PROPN 2, ADV 1, SCONJ 1, X 1), Ma (CCONJ 295, PUNCT 1), come (ADP 370, CCONJ 119, SCONJ 63, PRON 13, ADV 6, INTJ 1, PUNCT 1), le (DET 3916, PRON 52, NOUN 1, PUNCT 1), o (CCONJ 666, NOUN 10, INTJ 1, PUNCT 1), p (NOUN 12, PUNCT 1)

Morphology

The form / lemma ratio of PUNCT is 1.000000 (the average of all parts of speech is 1.501662).

The 1st highest number of forms (2) was observed with the lemma “!”: !, e.

The 2nd highest number of forms (2) was observed with the lemma “.”: ”, ..

The 3rd highest number of forms (1) was observed with the lemma “””: .

PUNCT occurs with 4 features: Clitic (1; 0% instances), Gender (1; 0% instances), Number (1; 0% instances), PronType (1; 0% instances)

PUNCT occurs with 4 feature-value pairs: Clitic=Yes, Gender=Fem, Number=Plur, PronType=Prs

PUNCT occurs with 3 feature combinations. The most frequent feature combination is _ (31583 tokens). Examples: ,, ., “, ), -, (, :, ?, ;, «/em>

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (31585; 100% instances)

Parents of PUNCT nodes belong to 15 different parts of speech: VERB (14846; 47% instances), NOUN (10568; 33% instances), PROPN (2580; 8% instances), ADJ (1711; 5% instances), NUM (567; 2% instances), PRON (527; 2% instances), ADV (340; 1% instances), SYM (237; 1% instances), X (97; 0% instances), INTJ (52; 0% instances), CCONJ (33; 0% instances), ADP (16; 0% instances), DET (6; 0% instances), SCONJ (4; 0% instances), AUX (1; 0% instances)

31585 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.