home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Armenian-CAVaL: POS Tags: NOUN

There are 1212 NOUN lemmas (38%), 2431 NOUN types (32%) and 12129 NOUN tokens (14%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent NOUN lemmas: Աստուած, որդի, հայր, տէր, աշակերտ, պատասխանի, ժողովուրդ, բան, այր, աւր

The 10 most frequent NOUN types: պատասխանի, Աստուածոյ, հայր, որդի, տէր, այր, անուն, Աստուած, մարդոյ, աշակերտք

The 10 most frequent ambiguous lemmas: ձեռն (NOUN 114, ADP 15), մանուկ (NOUN 59, ADJ 1), հանդերձ (NOUN 50, ADP 44), պարտ (NOUN 38, ADJ 2), մէջ (ADV 40, NOUN 30), այս (PRON 197, DET 176, NOUN 29), յաւիտեան (NOUN 20, ADV 4), յոյն (NOUN 20, ADJ 3), անապատ (NOUN 19, ADJ 14), ինչ (PRON 215, DET 38, ADV 25, NOUN 19)

The 10 most frequent ambiguous types: ձեռն (NOUN 45, ADP 15), մանուկ (NOUN 41, ADJ 1), տան (NOUN 39, VERB 3), պարտ (NOUN 35, ADJ 2), հանդերձ (ADP 44, NOUN 16), մէջ (ADV 40, NOUN 16), այս (PRON 137, DET 83, NOUN 14), յաւիտեան (NOUN 12, ADV 3), վաղիւ (NOUN 11, ADV 4), անապատի (NOUN 9, ADJ 7)

Morphology

The form / lemma ratio of NOUN is 2.005776 (the average of all parts of speech is 2.427126).

The 1st highest number of forms (11) was observed with the lemma “ժողովուրդ”: ժոլովուրդս, ժողով, ժողովըրդեան, ժողովըրդենէ, ժողովուրդ, ժողովուրդս, ժողովուրդք, ժողովրդեան, ժողովրդենէ, ժողովրդով, ժողովրդոց.

The 2nd highest number of forms (10) was observed with the lemma “փարիսեցի”: փարեսեցի, փարեսեցւոյ, փարիսացի, փարիսացիք, փարիսացւոց, փարիսեցի, փարիսեցիս, փարիսեցիք, փարիսեցւոյ, փարիսեցւոց.

The 3rd highest number of forms (9) was observed with the lemma “հրեշտակ”: հըրեշտեկաց, հրեշտակ, հրեշտակաց, հրեշտակէ, հրեշտակի, հրեշտակս, հրեշտակք, հրեշտեկաց, հրեշտեկաւք.

NOUN occurs with 2 features: Case (12128; 100% instances), Number (12128; 100% instances)

NOUN occurs with 9 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing

NOUN occurs with 15 feature combinations. The most frequent feature combination is Case=Acc|Number=Sing (3083 tokens). Examples: պատասխանի, անձն, հայր, տուն, անուն, երկիր, բան, Աստուած, որդի, հաց

Relations

NOUN nodes are attached to their parents using 26 different relations: obl (3089; 25% instances), obj (2483; 20% instances), nsubj (1862; 15% instances), nmod (1705; 14% instances), conj (975; 8% instances), iobj (304; 3% instances), ccomp (264; 2% instances), vocative (229; 2% instances), root (219; 2% instances), appos (181; 1% instances), advcl (155; 1% instances), xcomp (144; 1% instances), obl:arg (121; 1% instances), orphan (106; 1% instances), acl (95; 1% instances), nsubj:pass (50; 0% instances), nsubj:caus (39; 0% instances), obl:agent (35; 0% instances), csubj (17; 0% instances), amod (11; 0% instances), compound:redup (11; 0% instances), fixed (10; 0% instances), dislocated (9; 0% instances), flat (9; 0% instances), parataxis (5; 0% instances), compound (1; 0% instances)

Parents of NOUN nodes belong to 15 different parts of speech: VERB (8085; 67% instances), NOUN (2387; 20% instances), ADJ (477; 4% instances), PRON (306; 3% instances), PROPN (301; 2% instances), (219; 2% instances), ADV (149; 1% instances), AUX (77; 1% instances), NUM (52; 0% instances), DET (26; 0% instances), ADP (24; 0% instances), INTJ (21; 0% instances), CCONJ (2; 0% instances), PART (2; 0% instances), X (1; 0% instances)

2332 (19%) NOUN nodes are leaves.

3520 (29%) NOUN nodes have one child.

3680 (30%) NOUN nodes have two children.

2597 (21%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 12.

Children of NOUN nodes are attached using 30 different relations: case (5370; 26% instances), det (4477; 21% instances), nmod (3118; 15% instances), punct (1664; 8% instances), cc (1046; 5% instances), conj (935; 4% instances), amod (753; 4% instances), cop (681; 3% instances), acl (638; 3% instances), nsubj (382; 2% instances), orphan (381; 2% instances), nummod (285; 1% instances), mark (254; 1% instances), advmod (228; 1% instances), obl (163; 1% instances), advcl (122; 1% instances), iobj (69; 0% instances), xcomp (65; 0% instances), discourse (63; 0% instances), ccomp (51; 0% instances), appos (47; 0% instances), csubj (20; 0% instances), obj (17; 0% instances), compound:redup (11; 0% instances), vocative (11; 0% instances), parataxis (9; 0% instances), flat (3; 0% instances), obl:arg (3; 0% instances), compound (2; 0% instances), dislocated (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: ADP (5433; 26% instances), DET (4527; 22% instances), NOUN (2387; 11% instances), PRON (1909; 9% instances), PUNCT (1664; 8% instances), CCONJ (1127; 5% instances), VERB (885; 4% instances), ADJ (877; 4% instances), AUX (695; 3% instances), PROPN (393; 2% instances), NUM (312; 1% instances), SCONJ (267; 1% instances), ADV (189; 1% instances), PART (171; 1% instances), INTJ (33; 0% instances)