home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-VIT: POS Tags: X

There are 220 X lemmas (1%), 219 X types (1%) and 402 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: joint, venture, work, personal, baby, cd, computer, sitter, station, condicio

The 10 most frequent X types: joint, venture, personal, station, work, baby, cd, computer, sitter, condicio

The 10 most frequent ambiguous lemmas: work (X 9, NOUN 2), personal (NOUN 8, X 8), computer (NOUN 20, X 7), station (X 7, NOUN 2), facile (ADJ 18, X 5), bond (X 4, NOUN 3), top (NOUN 4, X 4, ADJ 2), business (NOUN 5, X 3), pro (X 3, ADP 2, ADV 1), the (X 3, DET 2)

The 10 most frequent ambiguous types: personal (NOUN 8, X 8), station (X 8, NOUN 2), work (X 8, NOUN 2), computer (NOUN 20, X 7), facile (ADJ 17, X 5), bond (X 4, NOUN 2), c’ (PRON 174, X 3), top (X 4, NOUN 3, ADJ 2), business (NOUN 5, X 3), pro (X 3, ADP 2, ADV 1)

Morphology

The form / lemma ratio of X is 0.995455 (the average of all parts of speech is 1.501662).

The 1st highest number of forms (2) was observed with the lemma “work”: station, work.

The 2nd highest number of forms (1) was observed with the lemma “ANTIHOOLIGANS”: antihooligans.

The 3rd highest number of forms (1) was observed with the lemma “Allons”: allons.

X occurs with 3 features: Foreign (393; 98% instances), Gender (59; 15% instances), Number (27; 7% instances)

X occurs with 5 feature-value pairs: Foreign=Yes, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

X occurs with 8 feature combinations. The most frequent feature combination is Foreign=Yes (334 tokens). Examples: joint, venture, baby, cd, sitter, rom, condicio, par, est, facile

Relations

X nodes are attached to their parents using 15 different relations: flat:foreign (178; 44% instances), nmod (94; 23% instances), obl (25; 6% instances), nsubj (20; 5% instances), obj (17; 4% instances), conj (15; 4% instances), appos (13; 3% instances), compound (13; 3% instances), root (13; 3% instances), flat:name (4; 1% instances), nsubj:pass (3; 1% instances), flat (2; 0% instances), parataxis (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances)

Parents of X nodes belong to 9 different parts of speech: X (180; 45% instances), NOUN (110; 27% instances), VERB (65; 16% instances), PROPN (22; 5% instances), (13; 3% instances), NUM (4; 1% instances), ADJ (3; 1% instances), PRON (3; 1% instances), ADV (2; 0% instances)

197 (49%) X nodes are leaves.

35 (9%) X nodes have one child.

48 (12%) X nodes have two children.

122 (30%) X nodes have three or more children.

The highest child degree of a X node is 11.

Children of X nodes are attached using 25 different relations: flat:foreign (174; 27% instances), det (111; 17% instances), case (100; 15% instances), punct (97; 15% instances), nmod (48; 7% instances), amod (19; 3% instances), appos (13; 2% instances), acl:relcl (11; 2% instances), cc (10; 2% instances), conj (8; 1% instances), nummod (8; 1% instances), advmod (6; 1% instances), compound (6; 1% instances), flat:name (6; 1% instances), advcl (5; 1% instances), cop (5; 1% instances), nsubj (4; 1% instances), acl (3; 0% instances), det:poss (3; 0% instances), fixed (3; 0% instances), csubj (2; 0% instances), flat (2; 0% instances), parataxis (2; 0% instances), aux (1; 0% instances), obl:agent (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: X (180; 28% instances), DET (114; 18% instances), ADP (103; 16% instances), PUNCT (97; 15% instances), NOUN (50; 8% instances), VERB (23; 4% instances), ADJ (22; 3% instances), PROPN (21; 3% instances), NUM (11; 2% instances), CCONJ (10; 2% instances), ADV (7; 1% instances), AUX (6; 1% instances), PRON (2; 0% instances), SYM (2; 0% instances)