home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Vietnamese-TueCL: POS Tags: SCONJ

There are 16 SCONJ lemmas (2%), 16 SCONJ types (2%) and 27 SCONJ tokens (1%). Out of 15 observed tags, the rank of SCONJ is: 9 in number of lemmas, 9 in number of types and 13 in number of tokens.

The 10 most frequent SCONJ lemmas: rằng, nếu, như, và, bởi lẽ, bởi vì, dù, khi, liệu, ngay khi

The 10 most frequent SCONJ types: rằng, nếu, như, và, Bởi vì, Tuy, bởi lẽ, dù, khi, liệu

The 10 most frequent ambiguous lemmas: như (ADP 6, SCONJ 3), (CCONJ 32, SCONJ 2), khi (ADV 9, ADP 2, SCONJ 1), nhưng (CCONJ 8, SCONJ 1), nên (ADV 1, SCONJ 1)

The 10 most frequent ambiguous types: như (ADP 6, SCONJ 3), (CCONJ 26, SCONJ 2), khi (ADV 7, ADP 2, SCONJ 1), nhưng (CCONJ 3, SCONJ 1), nên (ADV 1, SCONJ 1)

Morphology

The form / lemma ratio of SCONJ is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “bởi lẽ”: bởi lẽ.

The 2nd highest number of forms (1) was observed with the lemma “bởi vì”: Bởi vì.

The 3rd highest number of forms (1) was observed with the lemma “dù”: .

SCONJ occurs with 1 features: Typo (1; 4% instances)

SCONJ occurs with 1 feature-value pairs: Typo=Yes

SCONJ occurs with 2 feature combinations. The most frequent feature combination is _ (26 tokens). Examples: rằng, nếu, như, và, Bởi vì, Tuy, bởi lẽ, dù, khi, liệu

Relations

SCONJ nodes are attached to their parents using 3 different relations: mark (23; 85% instances), cc (3; 11% instances), case (1; 4% instances)

Parents of SCONJ nodes belong to 3 different parts of speech: VERB (23; 85% instances), NOUN (3; 11% instances), ADJ (1; 4% instances)

27 (100%) SCONJ nodes are leaves.

The highest child degree of a SCONJ node is 0.