Treebank Statistics: UD_Latin-LLCT: POS Tags: NUM
There are 31 NUM
lemmas (1%), 82 NUM
types (1%) and 1501 NUM
tokens (1%).
Out of 15 observed tags, the rank of NUM
is: 8 in number of lemmas, 8 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: duo, uiginti, triginta, quinquaginta, tres, quattuor, decem, sex, duodecim, quinque
The 10 most frequent NUM
types: duas, duo, viginti, triginta, tres, quinquaginta, decem, sex, quattuor, quinque
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM
is 2.645161 (the average of all parts of speech is 2.623423).
The 1st highest number of forms (9) was observed with the lemma “duo”: dua, duabus, duae, duas, due, dues, duo, duobus, duos.
The 2nd highest number of forms (8) was observed with the lemma “quadraginta”: quadraginta, quaraginta, quatragenta, quatragentas, quatraginta, quatragintas, quatraientas, quatroginta.
The 3rd highest number of forms (5) was observed with the lemma “sexaginta”: sesaginta, sessaginta, sexaginta, sexagintam, sexsaginta.
NUM
occurs with 5 features: NumForm (1501; 100% instances), NumType (1501; 100% instances), Case (629; 42% instances), Gender (629; 42% instances), Number (629; 42% instances)
NUM
occurs with 10 feature-value pairs: Case=Abl
, Case=Acc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumForm=Roman
, NumForm=Word
, NumType=Card
, Number=Plur
NUM
occurs with 11 feature combinations.
The most frequent feature combination is NumForm=Word|NumType=Card
(870 tokens).
Examples: viginti, triginta, quinquaginta, decem, sex, quattuor, quinque, duodecim, centum, octo
Relations
NUM
nodes are attached to their parents using 8 different relations: nummod (1361; 91% instances), conj (74; 5% instances), flat (59; 4% instances), xcomp (3; 0% instances), acl:relcl (1; 0% instances), advcl:cmp (1; 0% instances), nsubj:pass (1; 0% instances), root (1; 0% instances)
Parents of NUM
nodes belong to 6 different parts of speech: NOUN (1360; 91% instances), NUM (130; 9% instances), VERB (6; 0% instances), DET (3; 0% instances), ADJ (1; 0% instances), (1; 0% instances)
1283 (85%) NUM
nodes are leaves.
208 (14%) NUM
nodes have one child.
7 (0%) NUM
nodes have two children.
3 (0%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 7.
Children of NUM
nodes are attached using 13 different relations: conj (79; 33% instances), cc (75; 31% instances), flat (59; 25% instances), nmod (12; 5% instances), advmod (4; 2% instances), cop (2; 1% instances), obl (2; 1% instances), punct (2; 1% instances), advmod:emph (1; 0% instances), dep (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), xcomp (1; 0% instances)
Children of NUM
nodes belong to 12 different parts of speech: NUM (130; 54% instances), CCONJ (76; 32% instances), NOUN (16; 7% instances), DET (5; 2% instances), ADV (4; 2% instances), AUX (2; 1% instances), PUNCT (2; 1% instances), ADJ (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances), VERB (1; 0% instances), X (1; 0% instances)