Treebank Statistics: UD_Romanian-SiMoNERo: POS Tags: SCONJ
There are 11 SCONJ
lemmas (0%), 12 SCONJ
types (0%) and 1054 SCONJ
tokens (1%).
Out of 16 observed tags, the rank of SCONJ
is: 12 in number of lemmas, 13 in number of types and 14 in number of tokens.
The 10 most frequent SCONJ
lemmas: că, dacă, deși, deoarece, până, întrucât, încât, fără, să, fiindcă
The 10 most frequent SCONJ
types: că, dacă, deși, deoarece, până, întrucât, încât, fără, să, daca
The 10 most frequent ambiguous lemmas: până (ADP 92, SCONJ 28), fără (ADP 135, SCONJ 3), să (PART 324, SCONJ 2)
The 10 most frequent ambiguous types: până (ADP 86, SCONJ 27), fără (ADP 135, SCONJ 3), să (PART 320, SCONJ 2)
- până
- fără
- să
Morphology
The form / lemma ratio of SCONJ
is 1.090909 (the average of all parts of speech is 1.666637).
The 1st highest number of forms (2) was observed with the lemma “dacă”: daca, dacă.
The 2nd highest number of forms (1) was observed with the lemma “că”: că.
The 3rd highest number of forms (1) was observed with the lemma “deoarece”: deoarece.
SCONJ
occurs with 2 features: Polarity (1054; 100% instances), Variant (1; 0% instances)
SCONJ
occurs with 3 feature-value pairs: Polarity=Neg
, Polarity=Pos
, Variant=Short
SCONJ
occurs with 3 feature combinations.
The most frequent feature combination is Polarity=Neg
(972 tokens).
Examples: că, dacă, deși, deoarece, până, încât, întrucât, fără, să, daca
Relations
SCONJ
nodes are attached to their parents using 5 different relations: mark (969; 92% instances), fixed (56; 5% instances), case (27; 3% instances), advmod (1; 0% instances), obj (1; 0% instances)
Parents of SCONJ
nodes belong to 8 different parts of speech: VERB (734; 70% instances), ADJ (152; 14% instances), NOUN (85; 8% instances), ADV (48; 5% instances), ADP (14; 1% instances), NUM (11; 1% instances), AUX (8; 1% instances), PRON (2; 0% instances)
1022 (97%) SCONJ
nodes are leaves.
31 (3%) SCONJ
nodes have one child.
1 (0%) SCONJ
nodes have two children.
The highest child degree of a SCONJ
node is 2.
Children of SCONJ
nodes are attached using 3 different relations: fixed (31; 94% instances), advmod (1; 3% instances), punct (1; 3% instances)
Children of SCONJ
nodes belong to 5 different parts of speech: ADP (27; 82% instances), ADV (3; 9% instances), NOUN (1; 3% instances), PART (1; 3% instances), PUNCT (1; 3% instances)