home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-PUD: POS Tags: CCONJ

There are 2 CCONJ lemmas (0%), 8 CCONJ types (0%) and 578 CCONJ tokens (2%). Out of 16 observed tags, the rank of CCONJ is: 13 in number of lemmas, 13 in number of types and 11 in number of tokens.

The 10 most frequent CCONJ lemmas: e, _

The 10 most frequent CCONJ types: e, mas, ou, porém, Entretanto, como, contudo, portanto

The 10 most frequent ambiguous lemmas: _ (VERB 1051, PRON 891, ADP 675, NUM 467, AUX 416, NOUN 311, DET 290, CCONJ 131, SCONJ 98, ADJ 94, SYM 40, ADV 14, X 5, INTJ 1)

The 10 most frequent ambiguous types: como (ADP 116, ADV 5, CCONJ 4, SCONJ 4), portanto (ADP 1, CCONJ 1)

Morphology

The form / lemma ratio of CCONJ is 4.000000 (the average of all parts of speech is 1.570488).

The 1st highest number of forms (8) was observed with the lemma “_”: E, Entretanto, como, contudo, mas, ou, portanto, porém.

The 2nd highest number of forms (1) was observed with the lemma “e”: e.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 3 different relations: cc (534; 92% instances), discourse (42; 7% instances), cc:preconj (2; 0% instances)

Parents of CCONJ nodes belong to 10 different parts of speech: NOUN (228; 39% instances), VERB (221; 38% instances), PROPN (52; 9% instances), ADJ (40; 7% instances), NUM (13; 2% instances), ADV (11; 2% instances), ADP (6; 1% instances), PRON (3; 1% instances), SYM (3; 1% instances), DET (1; 0% instances)

517 (89%) CCONJ nodes are leaves.

58 (10%) CCONJ nodes have one child.

3 (1%) CCONJ nodes have two children.

The highest child degree of a CCONJ node is 2.

Children of CCONJ nodes are attached using 2 different relations: punct (63; 98% instances), fixed (1; 2% instances)

Children of CCONJ nodes belong to 2 different parts of speech: PUNCT (63; 98% instances), ADV (1; 2% instances)