home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-GSD: POS Tags: PART

There are 40 PART lemmas (0%), 40 PART types (0%) and 1080 PART tokens (1%). Out of 16 observed tags, the rank of PART is: 9 in number of lemmas, 11 in number of types and 13 in number of tokens.

The 10 most frequent PART lemmas: не, и, же, также, только, лишь, даже, де, это, ни

The 10 most frequent PART types: не, и, же, также, только, лишь, даже, де, это, ни

The 10 most frequent ambiguous lemmas: не (PART 438, CCONJ 2), и (CCONJ 2230, PART 119, X 3), также (CCONJ 103, PART 84, ADV 4), только (PART 79, CCONJ 1), де (PART 6, X 5), это (PRON 151, PART 24), ни (PART 23, CCONJ 6), именно (PART 18, ADV 2), просто (PART 9, ADV 2), ла (PART 5, NOUN 1, X 1)

The 10 most frequent ambiguous types: не (PART 424, CCONJ 2), и (CCONJ 2215, PART 119, X 3), же (PART 115, X 5), также (PART 83, CCONJ 78, ADV 3), только (PART 78, CCONJ 1), де (PART 26, X 4), это (PRON 58, DET 23, PART 22), ни (PART 21, CCONJ 6), именно (PART 10, ADV 2), просто (PART 9, ADV 1)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.598617).

The 1st highest number of forms (2) was observed with the lemma “все-таки”: все, все-таки.

The 2nd highest number of forms (1) was observed with the lemma “Ван”: ван.

The 3rd highest number of forms (1) was observed with the lemma “Д”: д.

PART occurs with 2 features: Polarity (432; 40% instances), Typo (1; 0% instances)

PART occurs with 2 feature-value pairs: Polarity=Neg, Typo=Yes

PART occurs with 3 feature combinations. The most frequent feature combination is _ (647 tokens). Examples: и, же, также, только, лишь, даже, не, де, это, именно

Relations

PART nodes are attached to their parents using 16 different relations: advmod (927; 86% instances), fixed (50; 5% instances), flat:name (27; 3% instances), expl (25; 2% instances), cc (12; 1% instances), flat:foreign (8; 1% instances), appos (6; 1% instances), case (5; 0% instances), mark (5; 0% instances), nmod (5; 0% instances), nsubj (4; 0% instances), discourse (2; 0% instances), conj (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances), parataxis (1; 0% instances)

Parents of PART nodes belong to 12 different parts of speech: VERB (474; 44% instances), NOUN (209; 19% instances), DET (83; 8% instances), ADJ (80; 7% instances), ADV (72; 7% instances), PROPN (58; 5% instances), PRON (40; 4% instances), NUM (28; 3% instances), PART (20; 2% instances), CCONJ (11; 1% instances), X (3; 0% instances), SYM (2; 0% instances)

1027 (95%) PART nodes are leaves.

39 (4%) PART nodes have one child.

9 (1%) PART nodes have two children.

5 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 12 different relations: punct (23; 30% instances), fixed (15; 20% instances), advmod (12; 16% instances), flat:name (9; 12% instances), flat:foreign (8; 11% instances), nmod (2; 3% instances), nummod:entity (2; 3% instances), appos (1; 1% instances), case (1; 1% instances), cc (1; 1% instances), conj (1; 1% instances), goeswith (1; 1% instances)

Children of PART nodes belong to 11 different parts of speech: PUNCT (23; 30% instances), PART (20; 26% instances), PROPN (19; 25% instances), ADV (5; 7% instances), NOUN (2; 3% instances), NUM (2; 3% instances), ADP (1; 1% instances), CCONJ (1; 1% instances), DET (1; 1% instances), SCONJ (1; 1% instances), X (1; 1% instances)