home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: POS Tags: X

There are 733 X lemmas (1%), 732 X types (1%) and 1061 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: of, the, and, artist, empire, in, V&A, X, for, i

The 10 most frequent X types: of, the, and, Artist, Empire, in, V&A, X, for, i

The 10 most frequent ambiguous lemmas: X (X 9, ADJ 4, PROPN 2), i (NUM 19, ADJ 7, X 3), Y (X 6, PROPN 2, NOUN 1), g (X 3, SYM 1), а (CCONJ 8346, INTJ 16, NOUN 8, PART 6, X 4), University (X 3, PROPN 2), daily (X 3, ADJ 1), б (NOUN 5, X 3), и (CCONJ 35003, PART 6269, NOUN 4, X 3, VERB 1), * (X 2, PUNCT 1)

The 10 most frequent ambiguous types: X (ADJ 9, X 9, NUM 4, PROPN 2), Y (X 6, PROPN 2), Nature (X 4, PROPN 2), g (X 4, SYM 1), homo (PROPN 4, X 3), а (CCONJ 5759, INTJ 5, X 4, NOUN 3, PART 2), University (X 3, PROPN 2), daily (X 3, ADJ 1), б (AUX 22, X 3, PART 1), и (CCONJ 31843, PART 6210, NOUN 4, X 3)

Morphology

The form / lemma ratio of X is 0.998636 (the average of all parts of speech is 2.654430).

The 1st highest number of forms (1) was observed with the lemma “&”: &.

The 2nd highest number of forms (1) was observed with the lemma “*”: *.

The 3rd highest number of forms (1) was observed with the lemma “.doc”: .doc.

X occurs with 1 features: Foreign (1058; 100% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (1058 tokens). Examples: of, the, and, Artist, Empire, in, V&A, X, for, i

Relations

X nodes are attached to their parents using 22 different relations: flat:foreign (565; 53% instances), appos (184; 17% instances), parataxis (58; 5% instances), nmod (57; 5% instances), nsubj (51; 5% instances), conj (39; 4% instances), obl (31; 3% instances), root (28; 3% instances), obj (13; 1% instances), compound (7; 1% instances), flat (5; 0% instances), vocative (5; 0% instances), fixed (4; 0% instances), nsubj:pass (4; 0% instances), orphan (2; 0% instances), xcomp (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), case (1; 0% instances), ccomp (1; 0% instances), flat:name (1; 0% instances), list (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: NOUN (469; 44% instances), X (294; 28% instances), VERB (134; 13% instances), PROPN (76; 7% instances), ADJ (31; 3% instances), (28; 3% instances), ADV (9; 1% instances), DET (8; 1% instances), NUM (5; 0% instances), PRON (3; 0% instances), CCONJ (2; 0% instances), PART (2; 0% instances)

586 (55%) X nodes are leaves.

189 (18%) X nodes have one child.

144 (14%) X nodes have two children.

142 (13%) X nodes have three or more children.

The highest child degree of a X node is 23.

Children of X nodes are attached using 28 different relations: punct (551; 49% instances), flat:foreign (252; 22% instances), appos (67; 6% instances), case (48; 4% instances), parataxis (37; 3% instances), conj (32; 3% instances), amod (30; 3% instances), cc (29; 3% instances), advmod (16; 1% instances), nsubj (15; 1% instances), nmod (12; 1% instances), det (10; 1% instances), flat (8; 1% instances), obl (4; 0% instances), mark (3; 0% instances), nummod (3; 0% instances), acl (2; 0% instances), acl:relcl (2; 0% instances), cop (2; 0% instances), fixed (2; 0% instances), orphan (2; 0% instances), advcl (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), expl (1; 0% instances), iobj (1; 0% instances), nummod:gov (1; 0% instances), obj (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: PUNCT (551; 49% instances), X (294; 26% instances), NOUN (63; 6% instances), ADP (46; 4% instances), ADJ (36; 3% instances), CCONJ (29; 3% instances), VERB (22; 2% instances), PROPN (16; 1% instances), ADV (15; 1% instances), SYM (15; 1% instances), DET (11; 1% instances), NUM (11; 1% instances), PART (8; 1% instances), PRON (8; 1% instances), SCONJ (7; 1% instances), AUX (2; 0% instances)