home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ancient_Greek-PROIEL: POS Tags: NOUN

There are 3042 NOUN lemmas (32%), 7697 NOUN types (23%) and 35818 NOUN tokens (17%). Out of 13 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: θεός, κύριος, ἄνθρωπος, ἀνήρ, λόγος, ἡμέρα, πατήρ, βασιλεύς, γυνή, πόλις

The 10 most frequent NOUN types: θεοῦ, θεὸς, κυρίου, λόγον, θεῷ, πόλιν, γῆς, κύριος, ἀνθρώπων, ἡμέρας

The 10 most frequent ambiguous lemmas: κύριος (NOUN 685, ADJ 4), Πέρσης (NOUN 312, PROPN 1), δοῦλος (NOUN 138, ADJ 3), Ἴων (NOUN 126, PROPN 1), Λυδός (NOUN 96, PROPN 2), δικαιοσύνη (NOUN 87, ADJ 1), Μῆδος (NOUN 85, ADJ 1), θύρα (NOUN 49, VERB 1), φίλος (NOUN 46, ADJ 7), Σκύθης (NOUN 44, PROPN 4)

The 10 most frequent ambiguous types: νέας (NOUN 84, ADJ 2), χάριν (NOUN 41, ADP 7), ὥρα (NOUN 31, VERB 7), δικαιοσύνης (NOUN 29, ADJ 1), ἀρχήν (NOUN 21, ADV 1), δόξῃ (NOUN 20, VERB 2), ἡμέρης (NOUN 20, ADJ 1), Πέρσην (NOUN 19, PROPN 1), βαρβάρων (NOUN 19, ADJ 1), ἱρὸν (NOUN 17, ADJ 3)

Morphology

The form / lemma ratio of NOUN is 2.530243 (the average of all parts of speech is 3.440021).

The 1st highest number of forms (25) was observed with the lemma “ἀδελφός”: ἀδελφέ, ἀδελφεοί, ἀδελφεούς, ἀδελφεοὶ, ἀδελφεοὺς, ἀδελφεοῖσι, ἀδελφεοῦ, ἀδελφεόν, ἀδελφεός, ἀδελφεὸν, ἀδελφεὸς, ἀδελφεῶν, ἀδελφεῷ, ἀδελφοί, ἀδελφούς, ἀδελφοὶ, ἀδελφοὺς, ἀδελφοῖς, ἀδελφοῦ, ἀδελφόν, ἀδελφός, ἀδελφὸν, ἀδελφὸς, ἀδελφῶν, ἀδελφῷ.

The 2nd highest number of forms (18) was observed with the lemma “γυνή”: γυνή, γυναικί, γυναικός, γυναικὶ, γυναικὸς, γυναικῶν, γυναιξί, γυναιξίν, γυναιξὶ, γυναιξὶν, γυναῖκά, γυναῖκάς, γυναῖκές, γυναῖκα, γυναῖκας, γυναῖκες, γυνὴ, γύναι.

The 3rd highest number of forms (17) was observed with the lemma “χείρ”: χέρες, χείρ, χειρί, χειρός, χειρὶ, χειρὸς, χειρῶν, χερσί, χερσίν, χερσὶ, χερσὶν, χεὶρ, χεῖρά, χεῖράς, χεῖρα, χεῖρας, χεῖρες.

NOUN occurs with 3 features: Case (35741; 100% instances), Number (35741; 100% instances), Gender (35740; 100% instances)

NOUN occurs with 12 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Case=Voc, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing

NOUN occurs with 43 feature combinations. The most frequent feature combination is Case=Acc|Gender=Fem|Number=Sing (3968 tokens). Examples: πόλιν, γῆν, θάλασσαν, γυναῖκα, ὁδὸν, βασιλείαν, δόξαν, χώρην, ἐξουσίαν, ἡμέραν

Relations

NOUN nodes are attached to their parents using 23 different relations: obl (8251; 23% instances), obj (6701; 19% instances), nsubj (5419; 15% instances), nmod (4909; 14% instances), conj (3272; 9% instances), obl:arg (1718; 5% instances), appos (1078; 3% instances), nsubj:pass (949; 3% instances), root (930; 3% instances), xcomp (683; 2% instances), vocative (575; 2% instances), orphan (323; 1% instances), advcl (236; 1% instances), obl:agent (194; 1% instances), advcl:cmp (159; 0% instances), acl (147; 0% instances), ccomp (143; 0% instances), dislocated (71; 0% instances), parataxis (36; 0% instances), nsubj:outer (16; 0% instances), csubj:pass (6; 0% instances), csubj (1; 0% instances), dep (1; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (22894; 64% instances), NOUN (7585; 21% instances), ADJ (1645; 5% instances), PROPN (973; 3% instances), (930; 3% instances), PRON (762; 2% instances), AUX (399; 1% instances), ADV (383; 1% instances), NUM (144; 0% instances), INTJ (34; 0% instances), SCONJ (28; 0% instances), DET (24; 0% instances), ADP (17; 0% instances)

5259 (15%) NOUN nodes are leaves.

12658 (35%) NOUN nodes have one child.

11115 (31%) NOUN nodes have two children.

6786 (19%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 17.

Children of NOUN nodes are attached using 26 different relations: det (25022; 43% instances), case (8546; 15% instances), nmod (6440; 11% instances), cc (3456; 6% instances), amod (3330; 6% instances), conj (3072; 5% instances), acl (2135; 4% instances), cop (1062; 2% instances), nummod (920; 2% instances), appos (880; 1% instances), discourse (852; 1% instances), nsubj (843; 1% instances), advmod (721; 1% instances), orphan (537; 1% instances), mark (350; 1% instances), obl (175; 0% instances), advcl (173; 0% instances), ccomp (111; 0% instances), obl:arg (57; 0% instances), dislocated (47; 0% instances), vocative (31; 0% instances), csubj (27; 0% instances), parataxis (15; 0% instances), advcl:cmp (12; 0% instances), obj (9; 0% instances), dep (2; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: DET (21800; 37% instances), ADP (8554; 15% instances), NOUN (7585; 13% instances), ADJ (4426; 8% instances), PRON (4266; 7% instances), CCONJ (3460; 6% instances), VERB (2785; 5% instances), ADV (1796; 3% instances), PROPN (1778; 3% instances), AUX (1097; 2% instances), NUM (966; 2% instances), SCONJ (184; 0% instances), INTJ (128; 0% instances)