Treebank Statistics: UD_French-PUD: POS Tags: NUM
There are 229 NUM
lemmas (5%), 230 NUM
types (4%) and 451 NUM
tokens (2%).
Out of 15 observed tags, the rank of NUM
is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.
The 10 most frequent NUM
lemmas: deux, trois, quatre, 1, 3, milliard, 10, II, III, dix
The 10 most frequent NUM
types: deux, trois, quatre, 1, 3, 10, II, III, dix, milliards
The 10 most frequent ambiguous lemmas: milliard (NUM 7, NOUN 2), un (DET 659, PRON 11, NUM 5), 1er (NUM 3, ADJ 1), premier (ADJ 36, NUM 1)
The 10 most frequent ambiguous types: milliards (NUM 6, NOUN 1), un (DET 225, PRON 10, NUM 5), 1er (NUM 3, ADJ 1), milliard (NOUN 1, NUM 1), premier (ADJ 6, NUM 1)
- milliards
- un
- 1er
- milliard
- NOUN 1: En échange , Uber recevra un milliard de dollars d’ investissement et un siège à le conseil d’ administration de l’ entreprise chinoise .
- NUM 1: Les directeurs ont également reçu une « prime de rendement » pour atteinte ou dépassement de leurs objectifs ; ils se sont ainsi partagé une enveloppe d’ un milliard et demi de dollars , ce qui équivaut à 15 000 dollars en moyenne par personne .
- premier
Morphology
The form / lemma ratio of NUM
is 1.004367 (the average of all parts of speech is 1.300126).
The 1st highest number of forms (2) was observed with the lemma “milliard”: milliard, milliards.
The 2nd highest number of forms (1) was observed with the lemma “0”: 0.
The 3rd highest number of forms (1) was observed with the lemma “06h30”: 06h30.
NUM
occurs with 3 features: Gender (4; 1% instances), Number (4; 1% instances), Typo (1; 0% instances)
NUM
occurs with 3 feature-value pairs: Gender=Masc
, Number=Sing
, Typo=Yes
NUM
occurs with 3 feature combinations.
The most frequent feature combination is _
(446 tokens).
Examples: deux, trois, quatre, 1, 3, 10, II, III, dix, milliards
Relations
NUM
nodes are attached to their parents using 9 different relations: nummod (218; 48% instances), obl (90; 20% instances), nmod (70; 16% instances), obl:mod (32; 7% instances), appos (25; 6% instances), conj (9; 2% instances), nsubj (4; 1% instances), orphan (2; 0% instances), obj (1; 0% instances)
Parents of NUM
nodes belong to 8 different parts of speech: NOUN (258; 57% instances), VERB (107; 24% instances), SYM (35; 8% instances), NUM (23; 5% instances), PROPN (22; 5% instances), ADJ (2; 0% instances), ADV (2; 0% instances), PRON (2; 0% instances)
240 (53%) NUM
nodes are leaves.
118 (26%) NUM
nodes have one child.
60 (13%) NUM
nodes have two children.
33 (7%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 5.
Children of NUM
nodes are attached using 13 different relations: case (134; 39% instances), punct (66; 19% instances), nmod (45; 13% instances), advmod (37; 11% instances), det (30; 9% instances), cc (9; 3% instances), conj (9; 3% instances), nummod (8; 2% instances), appos (2; 1% instances), amod (1; 0% instances), goeswith (1; 0% instances), orphan (1; 0% instances), parataxis (1; 0% instances)
Children of NUM
nodes belong to 11 different parts of speech: ADP (132; 38% instances), PUNCT (66; 19% instances), ADV (39; 11% instances), NOUN (34; 10% instances), DET (30; 9% instances), NUM (23; 7% instances), CCONJ (9; 3% instances), PROPN (8; 2% instances), ADJ (1; 0% instances), PRON (1; 0% instances), X (1; 0% instances)