Treebank Statistics: UD_Romanian-ArT: POS Tags: PART
There are 3 PART
lemmas (1%), 11 PART
types (3%) and 50 PART
tokens (9%).
Out of 14 observed tags, the rank of PART
is: 13 in number of lemmas, 6 in number of types and 6 in number of tokens.
The 10 most frequent PART
lemmas: să, nu, sâ
The 10 most frequent PART
types: s-, nu, să, z-, -s, n-, no, no-, nu-, sa
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types: s- (PART 20, PRON 9), z- (PART 2, PRON 1), -s (PART 1, PRON 1), si (PRON 2, PART 1)
- s-
- z-
- -s
- si
Morphology
The form / lemma ratio of PART
is 3.666667 (the average of all parts of speech is 1.341667).
The 1st highest number of forms (6) was observed with the lemma “să”: -s, s-, sa, si, să, z-.
The 2nd highest number of forms (5) was observed with the lemma “nu”: n-, no, no-, nu, nu-.
The 3rd highest number of forms (1) was observed with the lemma “sâ”: s-.
PART
occurs with 3 features: Mood (30; 60% instances), Polarity (20; 40% instances), Variant (14; 28% instances)
PART
occurs with 4 feature-value pairs: Mood=Cnd
, Mood=Sub
, Polarity=Neg
, Variant=Short
PART
occurs with 5 feature combinations.
The most frequent feature combination is Polarity=Neg
(19 tokens).
Examples: nu, no, no-, nu-, n-
Relations
PART
nodes are attached to their parents using 2 different relations: mark (30; 60% instances), advmod (20; 40% instances)
Parents of PART
nodes belong to 2 different parts of speech: VERB (49; 98% instances), ADJ (1; 2% instances)
50 (100%) PART
nodes are leaves.
The highest child degree of a PART
node is 0.