home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Javanese-CSUI: POS Tags: CCONJ

There are 1 CCONJ lemmas (6%), 9 CCONJ types (0%) and 306 CCONJ tokens (2%). Out of 17 observed tags, the rank of CCONJ is: 5 in number of lemmas, 16 in number of types and 13 in number of tokens.

The 10 most frequent CCONJ lemmas: _

The 10 most frequent CCONJ types: lan, utawa, nanging, sarta, karo, utawi, tur, saha, ugi

The 10 most frequent ambiguous lemmas: _ (NOUN 2867, PUNCT 2233, VERB 1952, PROPN 1565, PRON 961, ADV 798, ADP 748, ADJ 736, DET 700, NUM 362, AUX 340, SCONJ 314, CCONJ 306, PART 234, X 183, INTJ 32, SYM 12)

The 10 most frequent ambiguous types: karo (ADP 40, SCONJ 9, CCONJ 5), ugi (ADV 7, CCONJ 1)

Morphology

The form / lemma ratio of CCONJ is 9.000000 (the average of all parts of speech is 238.352941).

The 1st highest number of forms (9) was observed with the lemma “_”: karo, lan, nanging, saha, sarta, tur, ugi, utawa, utawi.

CCONJ occurs with 1 features: Polite (303; 99% instances)

CCONJ occurs with 2 feature-value pairs: Polite=Form, Polite=Infm

CCONJ occurs with 3 feature combinations. The most frequent feature combination is Polite=Infm (298 tokens). Examples: lan, utawa, nanging, sarta, karo

Relations

CCONJ nodes are attached to their parents using 1 different relations: cc (306; 100% instances)

Parents of CCONJ nodes belong to 9 different parts of speech: NOUN (106; 35% instances), VERB (105; 34% instances), ADJ (39; 13% instances), PROPN (29; 9% instances), X (12; 4% instances), NUM (8; 3% instances), PRON (4; 1% instances), ADV (2; 1% instances), SYM (1; 0% instances)

305 (100%) CCONJ nodes are leaves.

1 (0%) CCONJ nodes have one child.

The highest child degree of a CCONJ node is 1.

Children of CCONJ nodes are attached using 1 different relations: punct (1; 100% instances)

Children of CCONJ nodes belong to 1 different parts of speech: PUNCT (1; 100% instances)