home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: SYM

There are 96 SYM lemmas (0%), 97 SYM types (0%) and 587 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 8 in number of lemmas, 9 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: %, , a, u.c., Re, &, utt., b, **, c

The 10 most frequent SYM types: %, , a, Re, &, u.c., utt., .u, b, **

The 10 most frequent ambiguous lemmas: a (SYM 17, CCONJ 7, X 2), & (SYM 17, X 1), T (SYM 4, NOUN 1), AB.LV (PROPN 1, SYM 1), U (SYM 1, X 1), i (PART 6, CCONJ 2, NUM 1, SYM 1), m (NOUN 3, SYM 1), s (X 2, SYM 1), ā (INTJ 4, SYM 1)

The 10 most frequent ambiguous types: a (SYM 16, CCONJ 2, X 2), Re (SYM 20, INTJ 7), & (SYM 17, X 1), T (SYM 4, NOUN 1), N (SYM 2, NOUN 1), AB.LV (PROPN 1, SYM 1), M (PROPN 1, SYM 1), U (SYM 1, X 1), i (PART 6, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.010417 (the average of all parts of speech is 2.339090).

The 1st highest number of forms (2) was observed with the lemma “u.c.”: .u, u.c..

The 2nd highest number of forms (1) was observed with the lemma “%”: %.

The 3rd highest number of forms (1) was observed with the lemma “&”: &.

SYM occurs with 2 features: Abbr (58; 10% instances), Typo (13; 2% instances)

SYM occurs with 2 feature-value pairs: Abbr=Yes, Typo=Yes

SYM occurs with 3 feature combinations. The most frequent feature combination is _ (529 tokens). Examples: %, , a, Re, &, b, **, c, +, L

Relations

SYM nodes are attached to their parents using 19 different relations: conj (71; 12% instances), flat (68; 12% instances), flat:name (58; 10% instances), parataxis (55; 9% instances), obl (50; 9% instances), nsubj (49; 8% instances), root (48; 8% instances), nmod (37; 6% instances), discourse (31; 5% instances), iobj (29; 5% instances), amod (28; 5% instances), dep (28; 5% instances), acl (10; 2% instances), advmod (9; 2% instances), nsubj:pass (6; 1% instances), xcomp (4; 1% instances), advcl (2; 0% instances), cc (2; 0% instances), orphan (2; 0% instances)

Parents of SYM nodes belong to 11 different parts of speech: VERB (175; 30% instances), NOUN (161; 27% instances), SYM (82; 14% instances), PROPN (56; 10% instances), (48; 8% instances), ADJ (23; 4% instances), NUM (20; 3% instances), X (14; 2% instances), ADV (6; 1% instances), DET (1; 0% instances), PART (1; 0% instances)

233 (40%) SYM nodes are leaves.

87 (15%) SYM nodes have one child.

169 (29%) SYM nodes have two children.

98 (17%) SYM nodes have three or more children.

The highest child degree of a SYM node is 8.

Children of SYM nodes are attached using 22 different relations: nummod (224; 28% instances), punct (201; 25% instances), case (83; 10% instances), nmod (74; 9% instances), flat (53; 7% instances), flat:name (41; 5% instances), cc (23; 3% instances), cop (14; 2% instances), amod (13; 2% instances), nsubj (13; 2% instances), conj (12; 2% instances), goeswith (12; 2% instances), obl (11; 1% instances), discourse (8; 1% instances), advmod (4; 1% instances), acl (3; 0% instances), dep (3; 0% instances), orphan (3; 0% instances), det (1; 0% instances), mark (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)

Children of SYM nodes belong to 16 different parts of speech: NUM (227; 28% instances), PUNCT (201; 25% instances), NOUN (104; 13% instances), ADP (83; 10% instances), SYM (82; 10% instances), X (24; 3% instances), CCONJ (23; 3% instances), VERB (17; 2% instances), AUX (14; 2% instances), PART (7; 1% instances), ADV (4; 1% instances), PROPN (4; 1% instances), ADJ (3; 0% instances), DET (3; 0% instances), PRON (2; 0% instances), SCONJ (1; 0% instances)