home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: CCONJ

There are 24 CCONJ lemmas (0%), 29 CCONJ types (0%) and 13280 CCONJ tokens (4%). Out of 17 observed tags, the rank of CCONJ is: 15 in number of lemmas, 16 in number of types and 9 in number of tokens.

The 10 most frequent CCONJ lemmas: un, bet, vai, gan, taču, arī, ne, kā, tomēr, nevis

The 10 most frequent CCONJ types: un, bet, vai, gan, taču, arī, ne, kā, tomēr, nevis

The 10 most frequent ambiguous lemmas: un (CCONJ 8732, X 1), vai (CCONJ 598, SCONJ 500, PART 301, INTJ 2), gan (CCONJ 487, PART 282, SCONJ 1), taču (CCONJ 456, PART 76), arī (PART 1840, CCONJ 282), ne (PART 288, CCONJ 280), (SCONJ 958, ADV 603, CCONJ 247, PART 94), tomēr (CCONJ 218, PART 93), tātad (CCONJ 50, PART 8, ADV 1), toties (CCONJ 22, PART 1)

The 10 most frequent ambiguous types: un (CCONJ 8281, ADP 1, X 1), vai (CCONJ 566, SCONJ 496, PART 119, INTJ 1), gan (CCONJ 469, PART 278, SCONJ 1), taču (CCONJ 279, PART 69), arī (PART 1685, CCONJ 280), ne (CCONJ 272, PART 262, PRON 1), (SCONJ 889, ADV 431, CCONJ 239, PART 94, PRON 34), tomēr (CCONJ 108, PART 87), tātad (CCONJ 17, PART 4, ADV 1), toties (CCONJ 11, PART 1)

Morphology

The form / lemma ratio of CCONJ is 1.208333 (the average of all parts of speech is 2.339090).

The 1st highest number of forms (2) was observed with the lemma “arī”: ari, arī.

The 2nd highest number of forms (2) was observed with the lemma “kā”: ka, kā.

The 3rd highest number of forms (2) was observed with the lemma “nevis”: nevis, nevīs.

CCONJ occurs with 2 features: Polarity (280; 2% instances), Typo (6; 0% instances)

CCONJ occurs with 2 feature-value pairs: Polarity=Neg, Typo=Yes

CCONJ occurs with 3 feature combinations. The most frequent feature combination is _ (12994 tokens). Examples: un, bet, vai, gan, taču, arī, kā, tomēr, nevis, jeb

Relations

CCONJ nodes are attached to their parents using 11 different relations: cc (12840; 97% instances), fixed (295; 2% instances), mark (102; 1% instances), dep (16; 0% instances), conj (9; 0% instances), discourse (7; 0% instances), root (5; 0% instances), nsubj (3; 0% instances), flat:name (1; 0% instances), iobj (1; 0% instances), reparandum (1; 0% instances)

Parents of CCONJ nodes belong to 17 different parts of speech: VERB (6360; 48% instances), NOUN (4413; 33% instances), ADJ (874; 7% instances), PROPN (560; 4% instances), ADV (368; 3% instances), CCONJ (249; 2% instances), DET (149; 1% instances), NUM (85; 1% instances), PRON (59; 0% instances), SCONJ (45; 0% instances), X (38; 0% instances), PART (27; 0% instances), SYM (23; 0% instances), AUX (14; 0% instances), ADP (6; 0% instances), INTJ (5; 0% instances), (5; 0% instances)

12703 (96%) CCONJ nodes are leaves.

566 (4%) CCONJ nodes have one child.

7 (0%) CCONJ nodes have two children.

4 (0%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 5.

Children of CCONJ nodes are attached using 9 different relations: fixed (543; 91% instances), punct (34; 6% instances), conj (7; 1% instances), discourse (3; 1% instances), flat:name (3; 1% instances), nmod (2; 0% instances), nummod (1; 0% instances), parataxis (1; 0% instances), reparandum (1; 0% instances)

Children of CCONJ nodes belong to 9 different parts of speech: PART (300; 50% instances), CCONJ (249; 42% instances), PUNCT (34; 6% instances), NOUN (4; 1% instances), SCONJ (4; 1% instances), ADV (1; 0% instances), NUM (1; 0% instances), PRON (1; 0% instances), PROPN (1; 0% instances)