home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: CCONJ

There are 32 CCONJ lemmas (0%), 43 CCONJ types (0%) and 8763 CCONJ tokens (3%). Out of 16 observed tags, the rank of CCONJ is: 13 in number of lemmas, 13 in number of types and 11 in number of tokens.

The 10 most frequent CCONJ lemmas: en, maar, of, zowel, want, tot, ofwel, doch, noch, respectievelijk

The 10 most frequent CCONJ types: en, maar, of, zowel, want, tot, ofwel, doch, noch, én

The 10 most frequent ambiguous lemmas: en (CCONJ 7250, PROPN 56, X 1), maar (CCONJ 720, ADV 103, DET 1), of (CCONJ 472, X 74, SCONJ 34, PROPN 22), zowel (CCONJ 78, ADV 1), want (CCONJ 63, X 2), tot (ADP 1037, CCONJ 29), respectievelijk (CCONJ 15, ADJ 6), enzovoorts (CCONJ 9, X 1), niet (ADV 954, CCONJ 9, DET 6), à (X 10, CCONJ 9, PROPN 2, ADP 1)

The 10 most frequent ambiguous types: en (CCONJ 7160, PROPN 56, DET 1, NUM 1, X 1), maar (CCONJ 616, ADV 101, DET 1), of (CCONJ 462, X 74, SCONJ 31, PROPN 22), zowel (CCONJ 66, ADV 1), want (CCONJ 59, X 2), tot (ADP 987, CCONJ 29), respectievelijk (CCONJ 13, ADJ 3), niet (ADV 940, CCONJ 8, DET 5), à (X 10, CCONJ 8, PROPN 2, ADP 1), dus (ADV 94, CCONJ 7)

Morphology

The form / lemma ratio of CCONJ is 1.343750 (the average of all parts of speech is 1.223407).

The 1st highest number of forms (5) was observed with the lemma “en”: eb, een, en, èn, én.

The 2nd highest number of forms (3) was observed with the lemma “enzovoorts”: enz, enz., enzovoorts.

The 3rd highest number of forms (2) was observed with the lemma “etcetera”: etc., etcetera.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 9 different relations: cc (8188; 93% instances), mark (219; 2% instances), flat (129; 1% instances), fixed (123; 1% instances), cc:preconj (100; 1% instances), advcl (1; 0% instances), amod (1; 0% instances), nsubj (1; 0% instances), parataxis (1; 0% instances)

Parents of CCONJ nodes belong to 13 different parts of speech: VERB (3064; 35% instances), NOUN (2974; 34% instances), PROPN (1372; 16% instances), ADJ (712; 8% instances), NUM (220; 3% instances), X (96; 1% instances), ADV (95; 1% instances), PRON (85; 1% instances), ADP (63; 1% instances), SYM (47; 1% instances), DET (29; 0% instances), AUX (3; 0% instances), CCONJ (3; 0% instances)

8712 (99%) CCONJ nodes are leaves.

46 (1%) CCONJ nodes have one child.

5 (0%) CCONJ nodes have two children.

The highest child degree of a CCONJ node is 2.

Children of CCONJ nodes are attached using 4 different relations: punct (36; 64% instances), fixed (18; 32% instances), conj (1; 2% instances), parataxis (1; 2% instances)

Children of CCONJ nodes belong to 7 different parts of speech: PUNCT (36; 64% instances), ADV (9; 16% instances), ADJ (3; 5% instances), CCONJ (3; 5% instances), SYM (3; 5% instances), NOUN (1; 2% instances), PRON (1; 2% instances)