Treebank Statistics: UD_Italian-VIT: POS Tags: X
There are 215 X
lemmas (1%), 215 X
types (1%) and 392 X
tokens (0%).
Out of 17 observed tags, the rank of X
is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: joint, venture, work, baby, cd, personal, sitter, station, computer, condicio
The 10 most frequent X
types: joint, venture, station, work, baby, cd, personal, sitter, computer, condicio
The 10 most frequent ambiguous lemmas: work (X 9, NOUN 2), personal (NOUN 8, X 7, ADJ 1), station (X 7, NOUN 2), computer (NOUN 21, X 6), facile (ADJ 18, X 5), bond (X 4, NOUN 3), news (X 4, NOUN 1), top (NOUN 4, X 4, ADJ 2), business (NOUN 5, X 3), pro (X 3, ADP 2, ADV 1)
The 10 most frequent ambiguous types: station (X 8, NOUN 2), work (X 8, NOUN 2), personal (NOUN 8, X 7, ADJ 1), computer (NOUN 21, X 6), facile (ADJ 17, X 5), bond (X 4, NOUN 2), c’ (PRON 174, X 3), news (X 4, NOUN 1), top (X 4, NOUN 3, ADJ 2), business (NOUN 5, X 3)
- station
- work
- personal
- NOUN 8: A scrivere l’ indirizzo su la busta quando si deve inviare una lettera a una sola persona , perché prenderebbe troppo tempo far lo con la stampante di il personal “ .
- X 7: Quale personal ?
- ADJ 1: Questa è una cifra che di per sé spiega l’ interesse per la multimedialità interattiva da parte di le aziende che operano in il campo di l’ elettronica di consumo e di i personal computer .
- computer
- facile
- bond
- X 4: Il prezzo e la cedola di i titoli saranno fissati oggi , garantendo un premio di rendimento in l’ area di 350 basis point ( il 3,5 % ) su il treasury bond di riferimento .
- NOUN 2: A metà seduta , il bond trentennale era in rialzo di 25,32 e aveva toccato quota , pari a un rendimento di il 7,36 per cento .
- c’
- news
- top
- X 4: Niente top issime , ma le top model ci saranno .
- NOUN 3: Niente top issime , ma le top model ci saranno .
- ADJ 2: Durante un convegno sponsorizzato da la rivista Forbes a Annapolis , molti top managers hanno detto di puntare su i mercati esteri come motore per la crescita e di non aspettar si per il momento una ripresa forte di i consumi in gli Stati Uniti .
- business
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 1.502411).
The 1st highest number of forms (2) was observed with the lemma “work”: station, work.
The 2nd highest number of forms (1) was observed with the lemma “ANTIHOOLIGANS”: antihooligans.
The 3rd highest number of forms (1) was observed with the lemma “Allons”: allons.
X
occurs with 3 features: Foreign (383; 98% instances), Gender (8; 2% instances), Number (3; 1% instances)
X
occurs with 4 feature-value pairs: Foreign=Yes
, Gender=Fem
, Gender=Masc
, Number=Sing
X
occurs with 6 feature combinations.
The most frequent feature combination is Foreign=Yes
(375 tokens).
Examples: joint, venture, station, work, baby, cd, sitter, personal, computer, condicio
Relations
X
nodes are attached to their parents using 15 different relations: flat:foreign (170; 43% instances), nmod (93; 24% instances), obl (25; 6% instances), nsubj (20; 5% instances), obj (17; 4% instances), conj (14; 4% instances), root (13; 3% instances), appos (12; 3% instances), compound (12; 3% instances), flat (4; 1% instances), flat:name (4; 1% instances), nsubj:pass (3; 1% instances), parataxis (2; 1% instances), xcomp (2; 1% instances), acl:relcl (1; 0% instances)
Parents of X
nodes belong to 9 different parts of speech: X (178; 45% instances), NOUN (107; 27% instances), VERB (64; 16% instances), PROPN (18; 5% instances), (13; 3% instances), NUM (4; 1% instances), ADJ (3; 1% instances), PRON (3; 1% instances), ADV (2; 1% instances)
189 (48%) X
nodes are leaves.
36 (9%) X
nodes have one child.
45 (11%) X
nodes have two children.
122 (31%) X
nodes have three or more children.
The highest child degree of a X
node is 11.
Children of X
nodes are attached using 24 different relations: flat:foreign (170; 26% instances), det (111; 17% instances), case (100; 16% instances), punct (96; 15% instances), nmod (46; 7% instances), amod (20; 3% instances), appos (13; 2% instances), acl:relcl (11; 2% instances), cc (9; 1% instances), conj (8; 1% instances), flat (8; 1% instances), nummod (8; 1% instances), advmod (6; 1% instances), compound (6; 1% instances), advcl (5; 1% instances), cop (5; 1% instances), flat:name (4; 1% instances), nsubj (4; 1% instances), acl (3; 0% instances), det:poss (3; 0% instances), csubj (2; 0% instances), parataxis (2; 0% instances), aux (1; 0% instances), obl:agent (1; 0% instances)
Children of X
nodes belong to 14 different parts of speech: X (178; 28% instances), DET (114; 18% instances), ADP (101; 16% instances), PUNCT (96; 15% instances), NOUN (49; 8% instances), ADJ (23; 4% instances), VERB (23; 4% instances), PROPN (21; 3% instances), NUM (11; 2% instances), CCONJ (9; 1% instances), ADV (7; 1% instances), AUX (6; 1% instances), PRON (2; 0% instances), SYM (2; 0% instances)