Treebank Statistics: UD_Finnish-TDT: POS Tags: SYM
There are 199 SYM
lemmas (1%), 201 SYM
types (0%) and 478 SYM
tokens (0%).
Out of 15 observed tags, the rank of SYM
is: 8 in number of lemmas, 10 in number of types and 13 in number of tokens.
The 10 most frequent SYM
lemmas: :), %, &, +, :D, ;), 3.Rf3, =, >, 2.f4
The 10 most frequent SYM
types: :), %, &, +, :D, ;), 3.Rf3, =, >, 2.f4
The 10 most frequent ambiguous lemmas: :) (SYM 64, PUNCT 1), % (SYM 42, NOUN 12), & (SYM 21, PROPN 1), + (SYM 20, PROPN 2), °C (SYM 3, NOUN 1), A (NOUN 22, PROPN 7, SYM 1), B (NOUN 3, PROPN 1, SYM 1), K (PROPN 1, SYM 1), V (ADJ 10, NOUN 1, SYM 1), × (PROPN 4, SYM 1)
The 10 most frequent ambiguous types: :) (SYM 64, PUNCT 1), & (SYM 21, PROPN 1), + (SYM 20, PROPN 2), A (NOUN 10, PROPN 7, SYM 1), B (NOUN 3, PROPN 1, SYM 1), V (ADJ 7, NOUN 1, SYM 1), × (PROPN 4, SYM 1)
- :)
- &
- +
- SYM 20: - Ruisleipä + oivariini + oltermanni maistuu vaan niin hyvältä .
- PROPN 2: 2. Korvataan liitteessä II olevan II osan 2 kohdan A alakohdan taulukossa 4 sarakkeessa jäljempänä vasemmalla lueteltujen lajien kohdalla olevat merkinnät jäljempänä oikealla olevilla merkinnöillä : Alopecurus pratensis 2 Arrhenatherum elatius 2 Dactylis glomerata 2 Festuca arundinacea 2 Festuca ovina 2 Festuca pratensis 2 Festuca rubra 2 Lolium multiflorum 2 Lolium perenne 2 Lolium × boucheanum 2 Phalaris aquatica 2 Hedysarum coronarium 2 Lotus corniculatus 3 Lupinus albus 2 Lupinus angustifolius 2 Lupinus luteus 2 Medicago sativa 3 Medicago × varia 3 Onobrychis viciifolia 2 Pisum sativum 2 Trifolium alexandrinum 3 Trifolium hybridum 3 Trifolium incarnatum 3 Trifolium resupinatum 3 Trigonella foenum-graecum 2 Vicia faba 2 Vicia pannonica 2 Vicia sativa 2 Vicia villosa 2 Brassica napus var. napobrassica 2 Brassica oleracea convar. acephala var. medullosa + var. viridis 3 Raphanus sativus var. oleiformis 2 .
- A
- B
- V
- ADJ 7: Hänen isänsä oli kuningas Mithridates V Euergetes .
- NOUN 1: Siitä kehittyivät kreikkalaisen kirjaimiston digamma ja ypsilon , myöhemmin latinalaisen kirjaimiston F , V ja Y sekä edelleen U ja W .
- SYM 1: Gliese 581 eli HO Librae on Vaa’an tähdistössä sijaitseva punainen kääpiötähti , jonka spektriluokka on M2,5 V .
- ×
- PROPN 4: 2. Korvataan liitteessä II olevan II osan 2 kohdan A alakohdan taulukossa 4 sarakkeessa jäljempänä vasemmalla lueteltujen lajien kohdalla olevat merkinnät jäljempänä oikealla olevilla merkinnöillä : Alopecurus pratensis 2 Arrhenatherum elatius 2 Dactylis glomerata 2 Festuca arundinacea 2 Festuca ovina 2 Festuca pratensis 2 Festuca rubra 2 Lolium multiflorum 2 Lolium perenne 2 Lolium × boucheanum 2 Phalaris aquatica 2 Hedysarum coronarium 2 Lotus corniculatus 3 Lupinus albus 2 Lupinus angustifolius 2 Lupinus luteus 2 Medicago sativa 3 Medicago × varia 3 Onobrychis viciifolia 2 Pisum sativum 2 Trifolium alexandrinum 3 Trifolium hybridum 3 Trifolium incarnatum 3 Trifolium resupinatum 3 Trigonella foenum-graecum 2 Vicia faba 2 Vicia pannonica 2 Vicia sativa 2 Vicia villosa 2 Brassica napus var. napobrassica 2 Brassica oleracea convar. acephala var. medullosa + var. viridis 3 Raphanus sativus var. oleiformis 2 .
- SYM 1: Tämä sopii hyvin yhteen sen kanssa , että tähti on vanha , ikä 7 - 11 × 109 vuotta .
Morphology
The form / lemma ratio of SYM
is 1.010050 (the average of all parts of speech is 2.067974).
The 1st highest number of forms (2) was observed with the lemma “SRT#8”: SRT-8, SRT-8:ssa.
The 2nd highest number of forms (2) was observed with the lemma “°C”: °C, °C:ta.
The 3rd highest number of forms (1) was observed with the lemma “#”: #.
SYM
occurs with 1 features: Case (2; 0% instances)
SYM
occurs with 2 feature-value pairs: Case=Ine
, Case=Par
SYM
occurs with 3 feature combinations.
The most frequent feature combination is _
(476 tokens).
Examples: :), %, &, +, :D, ;), 3.Rf3, =, >, 2.f4
Relations
SYM
nodes are attached to their parents using 22 different relations: discourse (136; 28% instances), flat:name (94; 20% instances), nmod (51; 11% instances), obj (30; 6% instances), cc (29; 6% instances), appos (28; 6% instances), nsubj (20; 4% instances), obl (17; 4% instances), conj (16; 3% instances), root (11; 2% instances), compound:nn (10; 2% instances), nsubj:cop (8; 2% instances), advcl (6; 1% instances), compound (6; 1% instances), dep (4; 1% instances), amod (3; 1% instances), parataxis (3; 1% instances), acl:relcl (2; 0% instances), advmod (1; 0% instances), case (1; 0% instances), orphan (1; 0% instances), vocative (1; 0% instances)
Parents of SYM
nodes belong to 10 different parts of speech: NOUN (157; 33% instances), VERB (150; 31% instances), SYM (82; 17% instances), ADJ (33; 7% instances), PROPN (26; 5% instances), (11; 2% instances), NUM (7; 1% instances), ADV (5; 1% instances), PRON (4; 1% instances), X (3; 1% instances)
312 (65%) SYM
nodes are leaves.
41 (9%) SYM
nodes have one child.
67 (14%) SYM
nodes have two children.
58 (12%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 14.
Children of SYM
nodes are attached using 20 different relations: punct (150; 34% instances), flat:name (90; 21% instances), nummod (56; 13% instances), nmod (22; 5% instances), nsubj:cop (18; 4% instances), conj (17; 4% instances), cop (15; 3% instances), cc (13; 3% instances), compound:nn (12; 3% instances), advmod (11; 3% instances), acl:relcl (5; 1% instances), appos (5; 1% instances), compound (4; 1% instances), obl (4; 1% instances), orphan (4; 1% instances), mark (3; 1% instances), acl (2; 0% instances), amod (2; 0% instances), advcl (1; 0% instances), case (1; 0% instances)
Children of SYM
nodes belong to 13 different parts of speech: PUNCT (150; 34% instances), SYM (82; 19% instances), NUM (74; 17% instances), NOUN (64; 15% instances), AUX (15; 3% instances), CCONJ (13; 3% instances), ADV (12; 3% instances), VERB (9; 2% instances), ADJ (5; 1% instances), PROPN (5; 1% instances), PRON (3; 1% instances), SCONJ (2; 0% instances), ADP (1; 0% instances)