home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: SYM

There are 266 SYM lemmas (1%), 266 SYM types (1%) and 880 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 8 in number of lemmas, 9 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: ), πŸ‘, +, ))), )), %, =, :), ☝️, ))))

The 10 most frequent SYM types: ), πŸ‘, +, ))), )), %, =, :), ☝️, ))))

The 10 most frequent ambiguous lemmas: ) (PUNCT 820, SYM 85), + (SYM 48, CCONJ 6), ))) (SYM 37, PUNCT 1), = (SYM 19, PUNCT 2), * (PUNCT 28, SYM 10, X 9, CCONJ 1), - (PUNCT 1482, SYM 10, ADJ 1), / (PUNCT 66, SYM 10, ADP 7), Ρ… (SYM 10, NOUN 1), (( (SYM 8, PUNCT 1), ( (PUNCT 780, SYM 4)

The 10 most frequent ambiguous types: ) (PUNCT 820, SYM 85), + (SYM 48, CCONJ 6), ))) (SYM 37, PUNCT 1), = (SYM 19, PUNCT 2), * (PUNCT 28, SYM 10, X 9, CCONJ 1), - (PUNCT 1481, SYM 10, ADJ 1), / (PUNCT 66, SYM 10, ADP 7), Ρ… (SYM 5, NOUN 2), (( (SYM 8, PUNCT 1), ( (PUNCT 780, SYM 4)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.879397).

The 1st highest number of forms (1) was observed with the lemma β€œβ€)))”: ”))).

The 2nd highest number of forms (1) was observed with the lemma β€œ#”: #.

The 3rd highest number of forms (1) was observed with the lemma β€œ$”: $.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 23 different relations: discourse (651; 74% instances), root (47; 5% instances), parataxis (24; 3% instances), case (22; 3% instances), flat (19; 2% instances), cc (15; 2% instances), nmod (15; 2% instances), conj (14; 2% instances), appos (13; 1% instances), compound (11; 1% instances), obj (11; 1% instances), obl (11; 1% instances), list (8; 1% instances), dep (4; 0% instances), nsubj (4; 0% instances), advmod (2; 0% instances), fixed (2; 0% instances), nsubj:pass (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances), flat:foreign (1; 0% instances), nummod:entity (1; 0% instances)

Parents of SYM nodes belong to 15 different parts of speech: VERB (347; 39% instances), NOUN (249; 28% instances), ADJ (85; 10% instances), (47; 5% instances), NUM (43; 5% instances), PROPN (24; 3% instances), X (22; 3% instances), SYM (15; 2% instances), PART (12; 1% instances), PRON (12; 1% instances), ADV (11; 1% instances), INTJ (6; 1% instances), DET (4; 0% instances), AUX (2; 0% instances), CCONJ (1; 0% instances)

802 (91%) SYM nodes are leaves.

31 (4%) SYM nodes have one child.

25 (3%) SYM nodes have two children.

22 (3%) SYM nodes have three or more children.

The highest child degree of a SYM node is 6.

Children of SYM nodes are attached using 21 different relations: nummod:gov (32; 19% instances), punct (31; 19% instances), nmod (22; 13% instances), nsubj (15; 9% instances), case (14; 8% instances), conj (11; 7% instances), nummod (8; 5% instances), amod (5; 3% instances), cc (4; 2% instances), parataxis (4; 2% instances), xcomp (4; 2% instances), advmod (3; 2% instances), appos (3; 2% instances), fixed (2; 1% instances), mark (2; 1% instances), orphan (2; 1% instances), cop (1; 1% instances), det (1; 1% instances), discourse (1; 1% instances), expl (1; 1% instances), iobj (1; 1% instances)

Children of SYM nodes belong to 16 different parts of speech: NUM (47; 28% instances), PUNCT (31; 19% instances), NOUN (29; 17% instances), SYM (15; 9% instances), ADP (13; 8% instances), X (6; 4% instances), ADJ (5; 3% instances), ADV (4; 2% instances), CCONJ (4; 2% instances), PRON (4; 2% instances), PART (2; 1% instances), SCONJ (2; 1% instances), VERB (2; 1% instances), AUX (1; 1% instances), DET (1; 1% instances), PROPN (1; 1% instances)