home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian-HSE: POS Tags: CCONJ

There are 33 CCONJ lemmas (0%), 35 CCONJ types (0%) and 9000 CCONJ tokens (3%). Out of 17 observed tags, the rank of CCONJ is: 16 in number of lemmas, 16 in number of types and 10 in number of tokens.

The 10 most frequent CCONJ lemmas: і, а, але, ці, ды, або, таксама, ні, аднак, альбо

The 10 most frequent CCONJ types: і, а, але, ці, ды, або, й, ні, Таксама, аднак

The 10 most frequent ambiguous lemmas: і (CCONJ 6196, PART 454, ADJ 7, X 6), а (CCONJ 1177, ADP 190, X 11, NOUN 2, INTJ 1), але (CCONJ 659, NOUN 1), ці (CCONJ 279, SCONJ 123, PART 50, ADJ 1, X 1), таксама (ADV 337, CCONJ 57, PART 8), ні (CCONJ 55, PART 45), аднак (CCONJ 53, ADV 5), то (SCONJ 111, CCONJ 14, PRON 11, PART 4), и (CCONJ 9, X 1), ані (CCONJ 8, PART 3, PRON 1)

The 10 most frequent ambiguous types: і (CCONJ 5739, PART 441, X 2), а (CCONJ 731, ADP 173, X 9, NOUN 1), але (CCONJ 418, NOUN 1), ці (CCONJ 198, SCONJ 119, PART 18, ADJ 1, X 1), й (CCONJ 59, PART 2), ні (CCONJ 50, PART 33), Таксама (CCONJ 54, ADV 12), аднак (CCONJ 24, ADV 3), i (CCONJ 44, PART 2), то (SCONJ 106, CCONJ 9, PRON 9, PART 4)

Morphology

The form / lemma ratio of CCONJ is 1.060606 (the average of all parts of speech is 1.756638).

The 1st highest number of forms (3) was observed with the lemma “і”: i, й, і.

The 2nd highest number of forms (2) was observed with the lemma “таксама”: ТАКСАМА, Таксама.

The 3rd highest number of forms (2) was observed with the lemma “ці”: цi, ці.

CCONJ occurs with 5 features: Polarity (64; 1% instances), Animacy (1; 0% instances), Case (1; 0% instances), Gender (1; 0% instances), Number (1; 0% instances)

CCONJ occurs with 5 feature-value pairs: Animacy=Inan, Case=Gen, Gender=Fem, Number=Sing, Polarity=Neg

CCONJ occurs with 3 feature combinations. The most frequent feature combination is _ (8935 tokens). Examples: і, а, але, ці, ды, або, й, Таксама, аднак, альбо

Relations

CCONJ nodes are attached to their parents using 12 different relations: cc (8940; 99% instances), fixed (33; 0% instances), conj (12; 0% instances), advmod (5; 0% instances), case (2; 0% instances), root (2; 0% instances), dep (1; 0% instances), flat (1; 0% instances), iobj (1; 0% instances), mark (1; 0% instances), parataxis (1; 0% instances), reparandum (1; 0% instances)

Parents of CCONJ nodes belong to 17 different parts of speech: NOUN (3436; 38% instances), VERB (3408; 38% instances), ADJ (736; 8% instances), PROPN (630; 7% instances), ADV (263; 3% instances), DET (138; 2% instances), PRON (132; 1% instances), X (106; 1% instances), PART (50; 1% instances), NUM (43; 0% instances), CCONJ (22; 0% instances), SYM (14; 0% instances), AUX (7; 0% instances), SCONJ (6; 0% instances), ADP (4; 0% instances), INTJ (3; 0% instances), (2; 0% instances)

8746 (97%) CCONJ nodes are leaves.

236 (3%) CCONJ nodes have one child.

17 (0%) CCONJ nodes have two children.

1 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 6.

Children of CCONJ nodes are attached using 7 different relations: fixed (225; 82% instances), punct (34; 12% instances), conj (11; 4% instances), advmod (3; 1% instances), flat (1; 0% instances), nmod (1; 0% instances), reparandum (1; 0% instances)

Children of CCONJ nodes belong to 7 different parts of speech: ADV (167; 61% instances), PART (40; 14% instances), PUNCT (34; 12% instances), CCONJ (22; 8% instances), SCONJ (6; 2% instances), PRON (4; 1% instances), NOUN (3; 1% instances)