Treebank Statistics: UD_Albanian-TSA: POS Tags: PART
There are 8 PART
lemmas (2%), 8 PART
types (2%) and 36 PART
tokens (4%).
Out of 14 observed tags, the rank of PART
is: 8 in number of lemmas, 8 in number of types and 10 in number of tokens.
The 10 most frequent PART
lemmas: të, më, duke, madje, s’, nuk, se, t’
The 10 most frequent PART
types: të, më, duke, madje, s’, nuk, se, t’
The 10 most frequent ambiguous lemmas: më (PART 10, ADP 1), se (SCONJ 2, PART 1)
The 10 most frequent ambiguous types: të (DET 47, PART 17), më (PART 10, ADP 1), se (SCONJ 2, PART 1)
- të
- më
- se
Morphology
The form / lemma ratio of PART
is 1.000000 (the average of all parts of speech is 1.167464).
The 1st highest number of forms (1) was observed with the lemma “duke”: duke.
The 2nd highest number of forms (1) was observed with the lemma “madje”: madje.
The 3rd highest number of forms (1) was observed with the lemma “më”: më.
PART
occurs with 1 features: Polarity (3; 8% instances)
PART
occurs with 1 feature-value pairs: Polarity=Neg
PART
occurs with 2 feature combinations.
The most frequent feature combination is _
(33 tokens).
Examples: të, më, duke, madje, se, t’
Relations
PART
nodes are attached to their parents using 3 different relations: mark (20; 56% instances), advmod (15; 42% instances), case (1; 3% instances)
Parents of PART
nodes belong to 4 different parts of speech: VERB (24; 67% instances), ADJ (7; 19% instances), ADV (3; 8% instances), NOUN (2; 6% instances)
36 (100%) PART
nodes are leaves.
The highest child degree of a PART
node is 0.