home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Finnish-FTB: POS Tags: X

There are 271 X lemmas (1%), 276 X types (1%) and 311 X tokens (0%). Out of 17 observed tags, the rank of X is: 8 in number of lemmas, 12 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: _, 70-, in, sosiaali-, the, ala-, kauppa-, keng-, maa-, 50-

The 10 most frequent X types: 70-, in, sosiaali-, the, Kauppa-, ala-, keng-, maa-, 50-, Lilla

The 10 most frequent ambiguous lemmas: the (X 4, PROPN 1), out (NOUN 2, X 2), home (NOUN 3, X 1), is (PROPN 2, X 1), made (NOUN 1, X 1), me (DET 1, X 1), new (PROPN 8, X 1), partners (PROPN 1, X 1), queen (PROPN 1, X 1), ride (PROPN 1, X 1)

The 10 most frequent ambiguous types: out (NOUN 1, X 1), New (PROPN 8, X 1), Ride (PROPN 1, X 1), hyvä (ADJ 128, X 1), m- (PRON 1, X 1), me (PRON 123, VERB 1, X 1), sana (NOUN 11, X 1), se- (PRON 1, X 1), termi (NOUN 2, X 1), yhdistelmä (NOUN 2, X 1)

Morphology

The form / lemma ratio of X is 1.018450 (the average of all parts of speech is 2.048736).

The 1st highest number of forms (7) was observed with the lemma “_”: hyvä, ihogeeli, sana, skertso, tyyppistä, veellä, yhdistelmä.

The 2nd highest number of forms (1) was observed with the lemma “10-”: 10-.

The 3rd highest number of forms (1) was observed with the lemma “100-”: 100-.

X occurs with 1 features: Foreign (129; 41% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (182 tokens). Examples: 70-, sosiaali-, Kauppa-, ala-, keng-, maa-, 50-, Vesi-, e-, kieli-

Relations

X nodes are attached to their parents using 23 different relations: nmod (67; 22% instances), amod (37; 12% instances), conj (35; 11% instances), root (33; 11% instances), dep (31; 10% instances), nsubj (29; 9% instances), obj (17; 5% instances), reparandum (17; 5% instances), flat (9; 3% instances), goeswith (7; 2% instances), compound:nn (5; 2% instances), nsubj:cop (5; 2% instances), ccomp (4; 1% instances), advcl (3; 1% instances), case (2; 1% instances), obl:agent (2; 1% instances), xcomp (2; 1% instances), acl (1; 0% instances), cc (1; 0% instances), compound:prt (1; 0% instances), csubj:cop (1; 0% instances), nmod:gobj (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 9 different parts of speech: VERB (85; 27% instances), X (83; 27% instances), NOUN (80; 26% instances), (33; 11% instances), PROPN (17; 5% instances), ADJ (9; 3% instances), PRON (2; 1% instances), DET (1; 0% instances), SCONJ (1; 0% instances)

112 (36%) X nodes are leaves.

148 (48%) X nodes have one child.

18 (6%) X nodes have two children.

33 (11%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 21 different relations: conj (144; 46% instances), punct (52; 17% instances), amod (27; 9% instances), nmod (19; 6% instances), dep (14; 5% instances), flat (8; 3% instances), cop (7; 2% instances), acl (6; 2% instances), nsubj:cop (6; 2% instances), advmod (5; 2% instances), cc (5; 2% instances), nsubj (5; 2% instances), case (3; 1% instances), aux (2; 1% instances), vocative (2; 1% instances), compound:nn (1; 0% instances), compound:prt (1; 0% instances), csubj:cop (1; 0% instances), det (1; 0% instances), mark (1; 0% instances), obj (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: NOUN (104; 33% instances), X (83; 27% instances), PUNCT (52; 17% instances), ADJ (18; 6% instances), VERB (14; 5% instances), PROPN (13; 4% instances), AUX (9; 3% instances), ADV (5; 2% instances), PRON (5; 2% instances), CCONJ (4; 1% instances), ADP (1; 0% instances), DET (1; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances)