home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-ParlaMint: POS Tags: X

There are 32 X lemmas (1%), 58 X types (1%) and 81 X tokens (0%). Out of 16 observed tags, the rank of X is: 10 in number of lemmas, 9 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: -, –, COVID, hoc, ad, from, you, бацька, поехать, .

The 10 most frequent X types: COVID, hoc, на, русском, ad, from, you, бацька, будете, вы

The 10 most frequent ambiguous lemmas: - (PUNCT 118, X 23), (PUNCT 166, X 14), . (PUNCT 3022, X 1)

The 10 most frequent ambiguous types: на (ADP 587, X 3), будете (AUX 7, X 2), за (ADP 272, X 2, ADV 1), я (PRON 267, X 2), . (PUNCT 3022, X 1), не (PART 519, X 1)

Morphology

The form / lemma ratio of X is 1.812500 (the average of all parts of speech is 1.786380).

The 1st highest number of forms (15) was observed with the lemma “-”: будете, вы, единую, за, извиняюсь, молчать, на, не, разговаривать, разговариваю, русском, страну, что, я, языке.

The 2nd highest number of forms (14) was observed with the lemma “–”: Бацька, И, Лукашэнка, дапамогу, дзякуй, за, косами, мертвые, прэзідэнт, с, сапраўдны, стоят, тишина, только.

The 3rd highest number of forms (2) was observed with the lemma “бацька”: бацька, бацькаю.

X occurs with 1 features: Foreign (80; 99% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (80 tokens). Examples: COVID, hoc, на, русском, ad, from, you, бацька, будете, вы

Relations

X nodes are attached to their parents using 12 different relations: flat:foreign (46; 57% instances), conj (7; 9% instances), nmod (7; 9% instances), appos (6; 7% instances), parataxis (4; 5% instances), amod (3; 4% instances), ccomp (2; 2% instances), obl (2; 2% instances), acl (1; 1% instances), advmod (1; 1% instances), fixed (1; 1% instances), obj (1; 1% instances)

Parents of X nodes belong to 5 different parts of speech: X (54; 67% instances), NOUN (13; 16% instances), VERB (11; 14% instances), PROPN (2; 2% instances), PRON (1; 1% instances)

58 (72%) X nodes are leaves.

11 (14%) X nodes have one child.

0 (0%) X nodes have two children.

12 (15%) X nodes have three or more children.

The highest child degree of a X node is 21.

Children of X nodes are attached using 15 different relations: flat:foreign (46; 44% instances), punct (29; 28% instances), conj (8; 8% instances), nsubj (4; 4% instances), case (3; 3% instances), parataxis (3; 3% instances), advmod:neg (2; 2% instances), mark (2; 2% instances), acl:relcl (1; 1% instances), advcl (1; 1% instances), appos (1; 1% instances), cc (1; 1% instances), det (1; 1% instances), discourse (1; 1% instances), fixed (1; 1% instances)

Children of X nodes belong to 13 different parts of speech: X (54; 52% instances), PUNCT (29; 28% instances), ADP (3; 3% instances), NOUN (3; 3% instances), PART (3; 3% instances), ADV (2; 2% instances), NUM (2; 2% instances), PRON (2; 2% instances), SCONJ (2; 2% instances), ADJ (1; 1% instances), CCONJ (1; 1% instances), DET (1; 1% instances), VERB (1; 1% instances)