home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: CCONJ

There are 22 CCONJ lemmas (0%), 27 CCONJ types (0%) and 12398 CCONJ tokens (4%). Out of 17 observed tags, the rank of CCONJ is: 16 in number of lemmas, 16 in number of types and 9 in number of tokens.

The 10 most frequent CCONJ lemmas: un, bet, vai, gan, taču, arī, ne, kā, tomēr, nevis

The 10 most frequent CCONJ types: un, bet, vai, gan, taču, arī, ne, kā, tomēr, nevis

The 10 most frequent ambiguous lemmas: un (CCONJ 8175, X 1), vai (CCONJ 543, SCONJ 496, PART 277, INTJ 2), gan (CCONJ 469, PART 261, SCONJ 1), taču (CCONJ 431, PART 63), arī (PART 1741, CCONJ 278, SCONJ 1), ne (PART 272, CCONJ 252, SCONJ 1), (SCONJ 866, ADV 579, CCONJ 233, PART 85), tomēr (CCONJ 210, PART 86), tātad (CCONJ 50, PART 8), (PRON 951, ADV 398, DET 354, SCONJ 46, PART 32, CCONJ 11)

The 10 most frequent ambiguous types: un (CCONJ 7746, ADP 1, X 1), vai (CCONJ 515, SCONJ 492, PART 112, INTJ 1), gan (CCONJ 451, PART 257, SCONJ 1), taču (CCONJ 266, PART 58), arī (PART 1592, CCONJ 276, SCONJ 1), ne (PART 249, CCONJ 246, DET 1, SCONJ 1), (SCONJ 802, ADV 413, CCONJ 225, PART 85, PRON 26, DET 7), tomēr (CCONJ 103, PART 80), tātad (CCONJ 17, PART 4), (PRON 395, ADV 323, DET 162, PART 20, SCONJ 13, CCONJ 10)

Morphology

The form / lemma ratio of CCONJ is 1.227273 (the average of all parts of speech is 2.328168).

The 1st highest number of forms (2) was observed with the lemma “arī”: ari, arī.

The 2nd highest number of forms (2) was observed with the lemma “kā”: ka, kā.

The 3rd highest number of forms (2) was observed with the lemma “nevis”: nevis, nevīs.

CCONJ occurs with 2 features: Polarity (252; 2% instances), Typo (6; 0% instances)

CCONJ occurs with 2 feature-value pairs: Polarity=Neg, Typo=Yes

CCONJ occurs with 3 feature combinations. The most frequent feature combination is _ (12140 tokens). Examples: un, bet, vai, gan, taču, arī, kā, tomēr, nevis, jeb

Relations

CCONJ nodes are attached to their parents using 12 different relations: cc (11970; 97% instances), fixed (286; 2% instances), mark (72; 1% instances), case (26; 0% instances), dep (17; 0% instances), conj (9; 0% instances), discourse (7; 0% instances), root (5; 0% instances), nsubj (3; 0% instances), flat:name (1; 0% instances), iobj (1; 0% instances), reparandum (1; 0% instances)

Parents of CCONJ nodes belong to 17 different parts of speech: VERB (5951; 48% instances), NOUN (4106; 33% instances), ADJ (804; 6% instances), PROPN (533; 4% instances), ADV (346; 3% instances), CCONJ (243; 2% instances), PRON (189; 2% instances), NUM (75; 1% instances), SCONJ (43; 0% instances), X (34; 0% instances), PART (25; 0% instances), SYM (20; 0% instances), AUX (12; 0% instances), ADP (6; 0% instances), INTJ (5; 0% instances), (5; 0% instances), DET (1; 0% instances)

11861 (96%) CCONJ nodes are leaves.

526 (4%) CCONJ nodes have one child.

7 (0%) CCONJ nodes have two children.

4 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 5.

Children of CCONJ nodes are attached using 9 different relations: fixed (505; 91% instances), punct (32; 6% instances), conj (7; 1% instances), discourse (3; 1% instances), flat:name (3; 1% instances), nmod (2; 0% instances), nummod (1; 0% instances), parataxis (1; 0% instances), reparandum (1; 0% instances)

Children of CCONJ nodes belong to 9 different parts of speech: PART (268; 48% instances), CCONJ (243; 44% instances), PUNCT (32; 6% instances), NOUN (4; 1% instances), SCONJ (4; 1% instances), ADV (1; 0% instances), NUM (1; 0% instances), PRON (1; 0% instances), PROPN (1; 0% instances)