Treebank Statistics: UD_Hittite-HitTB: POS Tags: NOUN
There are 155 NOUN
lemmas (35%), 226 NOUN
types (32%) and 387 NOUN
tokens (30%).
Out of 15 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: URU, d, utnē-, šiun(i)-, GIŠ, LÚ, pišna/i-, UDU-u-, per, šiwatt-
The 10 most frequent NOUN
types: URU, d, LÚ, GIŠ, KUR, KÙ.BABBAR, LÚ.MEŠ, m, DUMU, UDU
The 10 most frequent ambiguous lemmas: šuḫḫa- (NOUN 2, VERB 1), idālu- (ADJ 2, NOUN 1)
The 10 most frequent ambiguous types: GAL (ADJ 1, NOUN 1)
- GAL
- ADJ 1: nu-u š-ša-an 2 NINDA.GUR4.RA.ḪI.A GAL 10 NINDA.GUR4.RA TUR GIŠ BANŠUR-i da-a-i
- NOUN 1: MUNUS.LUGAL f Pu-du-ḫé-pa-aš kán ku-wa-pí m UR.MAḪ.LÚ-in GAL DUB.SAR.MEŠ URU Ḫa-at-tu-ši A-NA ṬUP-PA-ḪI.A URU Ki-iz-zu-wa-at-na ša-an-ḫu-u-wa-an-zi ú-e-ri-ya-at n a-aš-ta ke-e ṬUP-PA-ḪI.A ŠA EZEN4 ḫi-šu-wa-a-aš a-pí-ya UD-at ar-ḫa a-ni-ya-at
Morphology
The form / lemma ratio of NOUN
is 1.458065 (the average of all parts of speech is 1.571106).
The 1st highest number of forms (7) was observed with the lemma “šiun(i)-”: DINGIR, DINGIR-LIM, DINGIR-LIM-ni, DINGIR-LUM, DINGIR.MEŠ-aš, DINGIR.MEŠ-eš, ši-ú.
The 2nd highest number of forms (5) was observed with the lemma “per”: pár-na, É, É-er, É-na, É-ri.
The 3rd highest number of forms (5) was observed with the lemma “šiwatt-”: UD, UD-UM, UD-at, UD-aš, UD-ti.
NOUN
occurs with 5 features: Gender (254; 66% instances), Number (223; 58% instances), Case (198; 51% instances), Language (3; 1% instances), Definite (2; 1% instances)
NOUN
occurs with 14 feature-value pairs: Case=Abl
, Case=Abs
, Case=Acc
, Case=All
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Nom
, Definite=Cons
, Gender=Com
, Gender=Neut
, Language=Sum
, Number=Plur
, Number=Sing
NOUN
occurs with 49 feature combinations.
The most frequent feature combination is _
(119 tokens).
Examples: URU, d, GIŠ, LÚ, m, GÍN, UDU, DUG, KAM, MA.NA
Relations
NOUN
nodes are attached to their parents using 16 different relations: nmod:det (97; 25% instances), obj (69; 18% instances), obl (68; 18% instances), nmod (49; 13% instances), nsubj (45; 12% instances), conj (17; 4% instances), iobj (7; 2% instances), appos (6; 2% instances), root (6; 2% instances), xcomp (6; 2% instances), dislocated (5; 1% instances), compound (4; 1% instances), orphan (4; 1% instances), vocative (2; 1% instances), advcl (1; 0% instances), parataxis (1; 0% instances)
Parents of NOUN
nodes belong to 9 different parts of speech: VERB (199; 51% instances), NOUN (92; 24% instances), PROPN (81; 21% instances), (6; 2% instances), ADJ (4; 1% instances), PRON (2; 1% instances), DET (1; 0% instances), NUM (1; 0% instances), PART (1; 0% instances)
210 (54%) NOUN
nodes are leaves.
112 (29%) NOUN
nodes have one child.
45 (12%) NOUN
nodes have two children.
20 (5%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 6.
Children of NOUN
nodes are attached using 25 different relations: nmod (88; 32% instances), case (34; 12% instances), nmod:det (26; 9% instances), nummod (26; 9% instances), conj (17; 6% instances), discourse (15; 5% instances), amod (12; 4% instances), det (12; 4% instances), orphan (8; 3% instances), cc (6; 2% instances), nsubj (5; 2% instances), compound (4; 1% instances), acl:relcl (3; 1% instances), advmod (3; 1% instances), advmod:emph (3; 1% instances), cop (3; 1% instances), discourse:conn (2; 1% instances), dislocated (2; 1% instances), expl:pass (2; 1% instances), obl (2; 1% instances), acl (1; 0% instances), appos (1; 0% instances), flat (1; 0% instances), mark (1; 0% instances), obj (1; 0% instances)
Children of NOUN
nodes belong to 13 different parts of speech: NOUN (92; 33% instances), PRON (39; 14% instances), ADP (34; 12% instances), NUM (26; 9% instances), PART (25; 9% instances), PROPN (20; 7% instances), DET (12; 4% instances), ADJ (9; 3% instances), VERB (8; 3% instances), CCONJ (6; 2% instances), ADV (3; 1% instances), AUX (3; 1% instances), SCONJ (1; 0% instances)