home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ancient_Greek-PROIEL: POS Tags: NUM

There are 70 NUM lemmas (1%), 165 NUM types (0%) and 1639 NUM tokens (1%). Out of 13 observed tags, the rank of NUM is: 6 in number of lemmas, 8 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: εἷς, δύο, ἑπτά, τρεῖς, δώδεκα, τέσσαρες, πέντε, δέκα, ἑκατόν, εἴκοσι

The 10 most frequent NUM types: δύο, εἷς, ἑπτὰ, δώδεκα, πέντε, δέκα, τρεῖς, ἓν, ἕνα, μίαν

The 10 most frequent ambiguous lemmas: εἷς (NUM 410, ADJ 1), διακόσιοι (NUM 27, ADJ 2), τετρακόσιοι (NUM 16, ADJ 2), καὶ (NUM 15, ADJ 1), δέκατος (ADJ 8, NUM 2), τε (CCONJ 1392, ADV 36, NUM 2), μυρίος (ADJ 8, NUM 1), χίλιος (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: εἷς (NUM 103, ADP 1), ἑνὶ (NUM 26, ADJ 1), καὶ (CCONJ 10526, ADV 1198, NUM 15, ADJ 1), τε (CCONJ 1377, ADV 34, NUM 2), τετρακόσιοι (NUM 2, ADJ 1), εἶς (AUX 6, NUM 1), τετρακοσίας (ADJ 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 2.357143 (the average of all parts of speech is 3.440021).

The 1st highest number of forms (16) was observed with the lemma “εἷς”: εἶς, εἷς, μία, μίαν, μιᾶς, μιᾷ, μιῆς, μιῇ, ἐνὶ, ἑνί, ἑνός, ἑνὶ, ἑνὸς, ἓν, ἕν, ἕνα.

The 2nd highest number of forms (12) was observed with the lemma “διακόσιοι”: διακοσίας, διακοσίους, διακοσίων, διακόσιαι, διηκοσίας, διηκοσίων, διηκοσιέων, διηκόσια, διηκόσιαί, διηκόσιαι, διηκόσιοί, διηκόσιοι.

The 3rd highest number of forms (11) was observed with the lemma “τέσσαρες”: τέσσαρα, τέσσαρας, τέσσαρες, τέσσαρσιν, τέσσερα, τέσσερας, τέσσερες, τέσσερσι, τέτορες, τεσσάρων, τεσσέρων.

NUM occurs with 3 features: Case (827; 50% instances), Number (827; 50% instances), Gender (794; 48% instances)

NUM occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 33 feature combinations. The most frequent feature combination is _ (812 tokens). Examples: δύο, ἑπτὰ, δώδεκα, πέντε, δέκα, εἴκοσι, ἑκατὸν, τεσσεράκοντα, τριήκοντα, πεντήκοντα

Relations

NUM nodes are attached to their parents using 21 different relations: nummod (982; 60% instances), conj (127; 8% instances), nsubj (112; 7% instances), obj (68; 4% instances), fixed (65; 4% instances), obl (64; 4% instances), root (42; 3% instances), orphan (34; 2% instances), obl:arg (29; 2% instances), xcomp (21; 1% instances), nmod (20; 1% instances), nsubj:pass (18; 1% instances), appos (17; 1% instances), advcl (13; 1% instances), parataxis (9; 1% instances), dislocated (6; 0% instances), obl:agent (4; 0% instances), ccomp (3; 0% instances), acl (2; 0% instances), advcl:cmp (2; 0% instances), csubj:pass (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (966; 59% instances), VERB (310; 19% instances), NUM (191; 12% instances), ADJ (60; 4% instances), (42; 3% instances), PRON (30; 2% instances), PROPN (18; 1% instances), ADV (10; 1% instances), ADP (5; 0% instances), AUX (4; 0% instances), SCONJ (3; 0% instances)

998 (61%) NUM nodes are leaves.

379 (23%) NUM nodes have one child.

179 (11%) NUM nodes have two children.

83 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 11.

Children of NUM nodes are attached using 19 different relations: nmod (170; 16% instances), det (143; 13% instances), conj (141; 13% instances), cc (130; 12% instances), case (83; 8% instances), fixed (65; 6% instances), cop (54; 5% instances), nsubj (48; 5% instances), advmod (47; 4% instances), discourse (44; 4% instances), orphan (33; 3% instances), appos (27; 3% instances), acl (25; 2% instances), obl (15; 1% instances), advcl (12; 1% instances), amod (11; 1% instances), mark (10; 1% instances), nummod (3; 0% instances), advcl:cmp (1; 0% instances)

Children of NUM nodes belong to 12 different parts of speech: NUM (191; 18% instances), NOUN (144; 14% instances), DET (143; 13% instances), CCONJ (130; 12% instances), ADV (98; 9% instances), ADP (84; 8% instances), PRON (71; 7% instances), ADJ (60; 6% instances), AUX (55; 5% instances), VERB (47; 4% instances), PROPN (32; 3% instances), SCONJ (7; 1% instances)