Treebank Statistics: UD_Estonian-EWT: POS Tags: SYM
There are 70 SYM
lemmas (1%), 71 SYM
types (0%) and 358 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 9 in number of lemmas, 13 in number of types and 15 in number of tokens.
The 10 most frequent SYM
lemmas: @, %, :), :D, ;), :s, :/, =), :(, :lol:
The 10 most frequent SYM
types: @, %, :), :D, ;), :s, :/, =), :(, :lol:
The 10 most frequent ambiguous lemmas: @ (SYM 96, PROPN 5, INTJ 1), :) (SYM 25, INTJ 5, PUNCT 1), :D (SYM 22, INTJ 2), * (PUNCT 10, SYM 5), € (SYM 5, NOUN 1), ” (PUNCT 580, SYM 4), + (PUNCT 16, SYM 4), = (PUNCT 5, SYM 3), & (CCONJ 7, SYM 2), -> (SYM 2, PUNCT 1)
The 10 most frequent ambiguous types: @ (SYM 96, PROPN 5, INTJ 1), :) (SYM 25, INTJ 5, PUNCT 1), :D (SYM 22, INTJ 1), * (PUNCT 10, SYM 5), € (SYM 5, NOUN 1), ” (PUNCT 578, SYM 4), + (PUNCT 16, SYM 4), = (PUNCT 5, SYM 3), & (CCONJ 7, SYM 2), -> (SYM 2, PUNCT 1)
- @
- :)
- :D
- *
- €
- ”
- +
- =
- &
- ->
- SYM 2: Turbonohik : Tallin -> põhiliselt ikkagi Saku , aga ega A Le Coq-ist ka ära ei ütle …
- PUNCT 1: Tartu Ülikooli esimeses füüsika ( ma ei tea , mis aine , vana klassivend rääkis , ehk füüsikaline maailmapilt ? ) loengus küsiti , et kes teist panid keskkoolis füüsika tunnis tähele -> mõned arglikud käed .
Morphology
The form / lemma ratio of SYM
is 1.014286 (the average of all parts of speech is 1.733702).
The 1st highest number of forms (2) was observed with the lemma “S3”: S3’med, S3-el.
The 2nd highest number of forms (1) was observed with the lemma “””: ”.
The 3rd highest number of forms (1) was observed with the lemma “%”: %.
SYM
occurs with 6 features: Abbr (14; 4% instances), Case (3; 1% instances), Number (3; 1% instances), Hyph (1; 0% instances), NumForm (1; 0% instances), NumType (1; 0% instances)
SYM
occurs with 8 feature-value pairs: Abbr=Yes
, Case=Ade
, Case=Nom
, Hyph=Yes
, NumForm=Digit
, NumType=Card
, Number=Plur
, Number=Sing
SYM
occurs with 7 feature combinations.
The most frequent feature combination is _
(341 tokens).
Examples: @, %, :), :D, ;), :s, :/, =), :(, :lol:
Relations
SYM
nodes are attached to their parents using 15 different relations: discourse (220; 61% instances), root (23; 6% instances), obl (20; 6% instances), advmod (18; 5% instances), dep (17; 5% instances), parataxis (12; 3% instances), nsubj:cop (11; 3% instances), nmod (8; 2% instances), nsubj (8; 2% instances), obj (7; 2% instances), flat (5; 1% instances), ccomp (3; 1% instances), conj (3; 1% instances), advcl (2; 1% instances), acl:relcl (1; 0% instances)
Parents of SYM
nodes belong to 10 different parts of speech: VERB (137; 38% instances), NOUN (94; 26% instances), PROPN (50; 14% instances), ADJ (23; 6% instances), (23; 6% instances), ADV (14; 4% instances), NUM (9; 3% instances), INTJ (3; 1% instances), PRON (3; 1% instances), PUNCT (2; 1% instances)
271 (76%) SYM
nodes are leaves.
47 (13%) SYM
nodes have one child.
23 (6%) SYM
nodes have two children.
17 (5%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 8.
Children of SYM
nodes are attached using 16 different relations: nummod (76; 43% instances), punct (23; 13% instances), nmod (19; 11% instances), nsubj:cop (14; 8% instances), cop (12; 7% instances), conj (6; 3% instances), mark (6; 3% instances), advmod (5; 3% instances), case (4; 2% instances), aux (3; 2% instances), det (3; 2% instances), cc (2; 1% instances), obl (2; 1% instances), flat (1; 1% instances), parataxis (1; 1% instances), xcomp (1; 1% instances)
Children of SYM
nodes belong to 12 different parts of speech: NUM (76; 43% instances), NOUN (33; 19% instances), PUNCT (23; 13% instances), AUX (15; 8% instances), ADV (6; 3% instances), SCONJ (6; 3% instances), ADP (4; 2% instances), DET (4; 2% instances), PROPN (4; 2% instances), VERB (3; 2% instances), CCONJ (2; 1% instances), PRON (2; 1% instances)