home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Galician-PUD: POS Tags: NUM

There are 224 NUM lemmas (5%), 247 NUM types (4%) and 502 NUM tokens (2%). Out of 15 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: 2, primeiro, 3, 10, segundo, 1, 4, 6, iii, 20

The 10 most frequent NUM types: dous, primeira, tres, dúas, primeiro, 1, catro, III, segunda, seis

The 10 most frequent ambiguous lemmas: primeiro (NUM 33, ADJ 2, ADV 2), segundo (NUM 11, ADP 10, ADV 1, NOUN 1), mil (NOUN 5, NUM 5), i (NUM 4, NOUN 1), terceiro (NUM 3, NOUN 1)

The 10 most frequent ambiguous types: dous (NUM 23, NOUN 1), primeira (NUM 14, ADJ 1), primeiro (NUM 9, ADJ 1, ADV 1), mil (NUM 5, NOUN 2), I (NUM 4, NOUN 1), segundo (ADP 5, NUM 4), sete (NUM 3, NOUN 1), un (DET 241, PRON 15, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.102679 (the average of all parts of speech is 1.319483).

The 1st highest number of forms (4) was observed with the lemma “primeiro”: primeira, primeiras, primeiro, primeiros.

The 2nd highest number of forms (3) was observed with the lemma “2”: 2, dous, dúas.

The 3rd highest number of forms (2) was observed with the lemma “1”: 1, un.

NUM occurs with 3 features: NumType (487; 97% instances), Gender (9; 2% instances), Number (2; 0% instances)

NUM occurs with 4 feature-value pairs: Gender=Masc, NumType=Card, NumType=Ord, Number=Plur

NUM occurs with 5 feature combinations. The most frequent feature combination is NumType=Card (413 tokens). Examples: dous, tres, dúas, 1, catro, seis, 10, 3, dez, 100

Relations

NUM nodes are attached to their parents using 13 different relations: nummod (306; 61% instances), obl (98; 20% instances), nmod (38; 8% instances), appos (13; 3% instances), conj (12; 2% instances), flat:name (12; 2% instances), compound (9; 2% instances), nsubj (7; 1% instances), fixed (2; 0% instances), obj (2; 0% instances), amod (1; 0% instances), nsubj:pass (1; 0% instances), root (1; 0% instances)

Parents of NUM nodes belong to 12 different parts of speech: NOUN (326; 65% instances), VERB (103; 21% instances), PROPN (23; 5% instances), NUM (19; 4% instances), SYM (17; 3% instances), PRON (5; 1% instances), ADJ (4; 1% instances), ADP (1; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances), (1; 0% instances), X (1; 0% instances)

273 (54%) NUM nodes are leaves.

144 (29%) NUM nodes have one child.

46 (9%) NUM nodes have two children.

39 (8%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 15 different relations: case (168; 46% instances), punct (56; 15% instances), det (41; 11% instances), nmod (34; 9% instances), advmod (19; 5% instances), cc (13; 4% instances), conj (11; 3% instances), compound (8; 2% instances), appos (3; 1% instances), amod (2; 1% instances), cop (2; 1% instances), nsubj (2; 1% instances), nummod (2; 1% instances), acl (1; 0% instances), orphan (1; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: ADP (169; 47% instances), PUNCT (56; 15% instances), DET (41; 11% instances), NOUN (34; 9% instances), ADV (19; 5% instances), NUM (19; 5% instances), CCONJ (15; 4% instances), ADJ (5; 1% instances), AUX (2; 1% instances), PROPN (2; 1% instances), VERB (1; 0% instances)