home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Basque-BDT: POS Tags: CCONJ

There are 47 CCONJ lemmas (0%), 51 CCONJ types (0%) and 6187 CCONJ tokens (5%). Out of 17 observed tags, the rank of CCONJ is: 9 in number of lemmas, 11 in number of types and 7 in number of tokens.

The 10 most frequent CCONJ lemmas: eta, ere, baina, edo, berriz, arren, ordea, gainera, beraz, bestalde

The 10 most frequent CCONJ types: eta, ere, baina, edo, berriz, arren, ordea, gainera, beraz, bestalde

The 10 most frequent ambiguous lemmas: berriz (CCONJ 100, ADV 28), arren (CCONJ 89, INTJ 1, NOUN 1, SCONJ 1), gainera (CCONJ 80, ADV 16, ADP 9), zein (DET 46, CCONJ 35, ADV 1), orduan (ADV 29, CCONJ 16), alta (CCONJ 14, NOUN 1), ezik (CCONJ 13, ADV 1), ostera (CCONJ 12, ADV 4), alegia (CCONJ 11, NOUN 4), hots (NOUN 8, CCONJ 7)

The 10 most frequent ambiguous types: berriz (CCONJ 100, ADV 27, ADJ 1), arren (CCONJ 89, INTJ 1, NOUN 1, SCONJ 1), gainera (CCONJ 37, ADP 9, ADV 6, NOUN 2), beraz (CCONJ 29, DET 1), aldiz (CCONJ 39, NOUN 39), zein (CCONJ 35, DET 32), orduan (ADV 21, NOUN 7, CCONJ 6), alta (CCONJ 1, NOUN 1), ezik (CCONJ 13, ADV 1), bada (AUX 33, VERB 20, CCONJ 10)

Morphology

The form / lemma ratio of CCONJ is 1.085106 (the average of all parts of speech is 2.170132).

The 1st highest number of forms (3) was observed with the lemma “eta”: ea, era, eta.

The 2nd highest number of forms (2) was observed with the lemma “baina”: baina, bainan.

The 3rd highest number of forms (2) was observed with the lemma “bestela”: Bertzela, bestela.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 18 different relations: cc (4641; 75% instances), advmod (1045; 17% instances), mark (148; 2% instances), fixed (133; 2% instances), advcl (81; 1% instances), obl (42; 1% instances), conj (35; 1% instances), dep (25; 0% instances), nmod (11; 0% instances), compound (10; 0% instances), discourse (6; 0% instances), flat (3; 0% instances), root (2; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), obj (1; 0% instances), orphan (1; 0% instances), xcomp (1; 0% instances)

Parents of CCONJ nodes belong to 16 different parts of speech: VERB (3649; 59% instances), NOUN (1225; 20% instances), PROPN (518; 8% instances), ADJ (325; 5% instances), ADV (183; 3% instances), NUM (106; 2% instances), AUX (91; 1% instances), DET (43; 1% instances), CCONJ (28; 0% instances), PRON (9; 0% instances), ADP (4; 0% instances), (2; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances), SYM (1; 0% instances)

5267 (85%) CCONJ nodes are leaves.

850 (14%) CCONJ nodes have one child.

45 (1%) CCONJ nodes have two children.

25 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 6.

Children of CCONJ nodes are attached using 20 different relations: punct (910; 89% instances), cc (24; 2% instances), compound (16; 2% instances), advcl (13; 1% instances), nsubj (11; 1% instances), nmod (9; 1% instances), conj (7; 1% instances), obj (7; 1% instances), advmod (5; 0% instances), dep (5; 0% instances), fixed (5; 0% instances), flat (3; 0% instances), xcomp (3; 0% instances), obl (2; 0% instances), amod (1; 0% instances), aux (1; 0% instances), cop (1; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances), mark (1; 0% instances)

Children of CCONJ nodes belong to 11 different parts of speech: PUNCT (910; 89% instances), VERB (32; 3% instances), CCONJ (28; 3% instances), NOUN (23; 2% instances), PART (11; 1% instances), ADV (7; 1% instances), PROPN (6; 1% instances), ADJ (4; 0% instances), AUX (2; 0% instances), DET (2; 0% instances), NUM (1; 0% instances)