Treebank Statistics: UD_Italian-TWITTIRO: POS Tags: NUM
There are 130 NUM
lemmas (3%), 131 NUM
types (2%) and 300 NUM
tokens (1%).
Out of 16 observed tags, the rank of NUM
is: 7 in number of lemmas, 7 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: due, 3, 2, mila, tre, 5, 1, 12, 7, 10
The 10 most frequent NUM
types: due, 3, 2, mila, tre, 5, 1, 12, 7, 10
The 10 most frequent ambiguous lemmas: 5 (NUM 9, PROPN 1), 4 (NUM 3, PROPN 1), quattro (NUM 3, ADP 1), guli1979 (NUM 2, SYM 1), @user1 (SYM 76, NUM 1), uno (DET 416, PRON 12, NUM 1)
The 10 most frequent ambiguous types: 3 (NUM 16, ADJ 1), 5 (NUM 9, PROPN 1), 1 (NUM 8, DET 1), 6 (NUM 6, AUX 1), 4 (NUM 3, PROPN 1), guli1979 (NUM 2, SYM 1), @user1 (SYM 76, NUM 1), licei (NOUN 1, NUM 1), sei (AUX 13, NUM 1), sette (NOUN 1, NUM 1)
- 3
- 5
- 1
- 6
- 4
- guli1979
- @user1
- licei
- sei
- sette
Morphology
The form / lemma ratio of NUM
is 1.007692 (the average of all parts of speech is 1.274961).
The 1st highest number of forms (2) was observed with the lemma “venti”: vent’, vent’.
The 2nd highest number of forms (1) was observed with the lemma “’10”: ‘10.
The 3rd highest number of forms (1) was observed with the lemma “’50”: ‘50.
NUM
occurs with 1 features: NumType (298; 99% instances)
NUM
occurs with 1 feature-value pairs: NumType=Card
NUM
occurs with 2 feature combinations.
The most frequent feature combination is NumType=Card
(298 tokens).
Examples: due, 3, 2, mila, tre, 1, 12, 5, 7, 10
Relations
NUM
nodes are attached to their parents using 13 different relations: nummod (221; 74% instances), obl (23; 8% instances), flat (19; 6% instances), conj (9; 3% instances), nmod (7; 2% instances), parataxis (7; 2% instances), nsubj (3; 1% instances), nsubj:pass (3; 1% instances), obj (3; 1% instances), flat:name (2; 1% instances), dislocated (1; 0% instances), orphan (1; 0% instances), vocative:mention (1; 0% instances)
Parents of NUM
nodes belong to 9 different parts of speech: NOUN (197; 66% instances), VERB (43; 14% instances), NUM (24; 8% instances), SYM (13; 4% instances), PROPN (10; 3% instances), ADJ (6; 2% instances), PRON (3; 1% instances), INTJ (2; 1% instances), X (2; 1% instances)
207 (69%) NUM
nodes are leaves.
45 (15%) NUM
nodes have one child.
35 (12%) NUM
nodes have two children.
13 (4%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 5.
Children of NUM
nodes are attached using 12 different relations: case (42; 26% instances), det (35; 22% instances), punct (30; 19% instances), flat (27; 17% instances), conj (7; 4% instances), nmod (6; 4% instances), advmod (5; 3% instances), cc (3; 2% instances), amod (2; 1% instances), acl (1; 1% instances), fixed (1; 1% instances), nummod (1; 1% instances)
Children of NUM
nodes belong to 11 different parts of speech: ADP (41; 26% instances), DET (35; 22% instances), PUNCT (30; 19% instances), NUM (24; 15% instances), NOUN (15; 9% instances), ADV (6; 4% instances), CCONJ (3; 2% instances), ADJ (2; 1% instances), SYM (2; 1% instances), PROPN (1; 1% instances), VERB (1; 1% instances)