home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-PUD: POS Tags: NUM

There are 209 NUM lemmas (4%), 209 NUM types (3%) and 356 NUM tokens (2%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: zwei, drei, vier, 3, sechs, zehn, 1, 10, 50, 100

The 10 most frequent NUM types: zwei, drei, vier, 3, sechs, zehn, 1, 10, 50, 100

The 10 most frequent ambiguous lemmas: zwei (NUM 23, NOUN 1), 3 (NUM 7, ADJ 1), 1 (NUM 5, ADJ 1, NOUN 1), 10 (NUM 5, NOUN 1), sieben (NUM 3, NOUN 1), - (PUNCT 178, NUM 2, CCONJ 1), 16 (NUM 2, ADJ 1), 31 (NOUN 2, NUM 2), 6 (NUM 2, ADJ 1), 21 (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: 3 (NUM 7, ADJ 1), 1 (NUM 5, NOUN 1), - (PUNCT 178, NUM 2, CCONJ 1), 16 (NUM 2, ADJ 1), 31 (NOUN 2, NUM 2), 21. (ADJ 1, NUM 1), 4 (ADJ 1, NUM 1), III (NOUN 3, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.198007).

The 1st highest number of forms (2) was observed with the lemma “zwei”: zwei, zweier.

The 2nd highest number of forms (1) was observed with the lemma “-”: -.

The 3rd highest number of forms (1) was observed with the lemma “1”: 1.

NUM occurs with 4 features: NumType (356; 100% instances), Case (4; 1% instances), Gender (4; 1% instances), Number (4; 1% instances)

NUM occurs with 9 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Gender=Fem, Gender=Masc, NumType=Card, NumType=Ord, Number=Plur, Number=Sing

NUM occurs with 5 feature combinations. The most frequent feature combination is NumType=Card (352 tokens). Examples: zwei, drei, vier, 3, sechs, zehn, 1, 10, 50, 100

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (227; 64% instances), obl:tmod (69; 19% instances), obl (17; 5% instances), nmod (14; 4% instances), conj (12; 3% instances), compound (10; 3% instances), nsubj (4; 1% instances), amod (1; 0% instances), nsubj:pass (1; 0% instances), obj (1; 0% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (245; 69% instances), VERB (66; 19% instances), SYM (22; 6% instances), NUM (16; 4% instances), ADJ (5; 1% instances), PROPN (2; 1% instances)

255 (72%) NUM nodes are leaves.

74 (21%) NUM nodes have one child.

17 (5%) NUM nodes have two children.

10 (3%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 13 different relations: advmod (38; 26% instances), case (31; 21% instances), punct (27; 19% instances), cc (11; 8% instances), conj (11; 8% instances), nmod (11; 8% instances), compound (5; 3% instances), det (3; 2% instances), cop (2; 1% instances), nsubj (2; 1% instances), obl:tmod (2; 1% instances), acl:relcl (1; 1% instances), cc:preconj (1; 1% instances)

Children of NUM nodes belong to 12 different parts of speech: ADV (37; 26% instances), ADP (31; 21% instances), PUNCT (27; 19% instances), NUM (16; 11% instances), CCONJ (12; 8% instances), PROPN (8; 6% instances), NOUN (6; 4% instances), DET (3; 2% instances), AUX (2; 1% instances), ADJ (1; 1% instances), PART (1; 1% instances), VERB (1; 1% instances)