home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Finnish-OOD: POS Tags: X

There are 63 X lemmas (1%), 65 X types (1%) and 89 X tokens (0%). Out of 15 observed tags, the rank of X is: 9 in number of lemmas, 11 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: LIST, All, Inclusive, _, author, baimbai, quote, time, #cmoref1, #nature

The 10 most frequent X types: LIST, All, Inclusive, author, baimbai, quote, time, #cmoref1, #nature, Nix

The 10 most frequent ambiguous lemmas: rausim (VERB 1, X 1)

The 10 most frequent ambiguous types: on (AUX 277, VERB 1, X 1)

Morphology

The form / lemma ratio of X is 1.031746 (the average of all parts of speech is 1.566190).

The 1st highest number of forms (3) was observed with the lemma “_”: pap, stä, ’n.

The 2nd highest number of forms (1) was observed with the lemma “#CmoreF1”: #CmoreF1.

The 3rd highest number of forms (1) was observed with the lemma “#ESLOneCologne”: #ESLOneCologne.

X occurs with 3 features: Foreign (67; 75% instances), Case (4; 4% instances), Number (4; 4% instances)

X occurs with 3 feature-value pairs: Case=Nom, Foreign=Yes, Number=Sing

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Yes (67 tokens). Examples: LIST, All, Inclusive, author, baimbai, quote, time, #nature, Nix, pekato

Relations

X nodes are attached to their parents using 12 different relations: flat:foreign (28; 31% instances), discourse (24; 27% instances), root (19; 21% instances), compound:nn (5; 6% instances), goeswith (3; 3% instances), nsubj (2; 2% instances), obl (2; 2% instances), parataxis (2; 2% instances), appos (1; 1% instances), conj (1; 1% instances), dep (1; 1% instances), flat:name (1; 1% instances)

Parents of X nodes belong to 9 different parts of speech: X (29; 33% instances), (19; 21% instances), NOUN (15; 17% instances), VERB (13; 15% instances), PROPN (8; 9% instances), NUM (2; 2% instances), ADJ (1; 1% instances), ADV (1; 1% instances), PRON (1; 1% instances)

68 (76%) X nodes are leaves.

9 (10%) X nodes have one child.

4 (4%) X nodes have two children.

8 (9%) X nodes have three or more children.

The highest child degree of a X node is 12.

Children of X nodes are attached using 12 different relations: flat:foreign (33; 41% instances), punct (28; 35% instances), obl (6; 7% instances), discourse (4; 5% instances), cop (2; 2% instances), nsubj:cop (2; 2% instances), advmod (1; 1% instances), cc (1; 1% instances), conj (1; 1% instances), flat:name (1; 1% instances), parataxis (1; 1% instances), vocative (1; 1% instances)

Children of X nodes belong to 11 different parts of speech: X (29; 36% instances), PUNCT (28; 35% instances), NOUN (6; 7% instances), NUM (6; 7% instances), SYM (4; 5% instances), AUX (2; 2% instances), INTJ (2; 2% instances), ADV (1; 1% instances), CCONJ (1; 1% instances), PRON (1; 1% instances), PROPN (1; 1% instances)