home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cappadocian-AMGiC: POS Tags: NOUN

There are 56 NOUN lemmas (28%), 60 NOUN types (24%) and 78 NOUN tokens (17%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: kóri, pará, psomí, staxtiǰís, vavás, (e)néka, enéka, mána, pséma, t’emélio

The 10 most frequent NOUN types: kóri, pará, psomí, enéka, mána, psémata, staxtiǰís, t’emélia, vaván, Aγás

The 10 most frequent ambiguous lemmas: _ (X 2, NOUN 1, PRON 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.071429 (the average of all parts of speech is 1.238806).

The 1st highest number of forms (2) was observed with the lemma “(e)néka”: enéka, néka.

The 2nd highest number of forms (2) was observed with the lemma “enéka”: enéka, enékan.

The 3rd highest number of forms (2) was observed with the lemma “kóri”: góri, kóri.

NOUN occurs with 3 features: Case (74; 95% instances), Number (74; 95% instances), Gender (73; 94% instances)

NOUN occurs with 9 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NOUN occurs with 14 feature combinations. The most frequent feature combination is Case=Acc|Gender=Neut|Number=Sing (18 tokens). Examples: psomí, kalaǰí, küréi, limóri, ombrín, pará, perín, pisárin, psematáki, spíči

Relations

NOUN nodes are attached to their parents using 10 different relations: obj (29; 37% instances), nsubj (20; 26% instances), obl (16; 21% instances), vocative (4; 5% instances), ccomp (2; 3% instances), conj (2; 3% instances), iobj (2; 3% instances), appos (1; 1% instances), nmod (1; 1% instances), xcomp (1; 1% instances)

Parents of NOUN nodes belong to 5 different parts of speech: VERB (64; 82% instances), ADV (7; 9% instances), NOUN (5; 6% instances), ADJ (1; 1% instances), PROPN (1; 1% instances)

22 (28%) NOUN nodes are leaves.

30 (38%) NOUN nodes have one child.

17 (22%) NOUN nodes have two children.

9 (12%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 5.

Children of NOUN nodes are attached using 15 different relations: det (32; 33% instances), nmod (23; 24% instances), amod (10; 10% instances), case (6; 6% instances), punct (5; 5% instances), cc (3; 3% instances), cop (3; 3% instances), mark (3; 3% instances), acl (2; 2% instances), aux:q (2; 2% instances), conj (2; 2% instances), nummod (2; 2% instances), advmod:emph (1; 1% instances), appos (1; 1% instances), nsubj (1; 1% instances)

Children of NOUN nodes belong to 12 different parts of speech: DET (29; 30% instances), PRON (25; 26% instances), ADJ (10; 10% instances), ADP (6; 6% instances), AUX (5; 5% instances), NOUN (5; 5% instances), PUNCT (5; 5% instances), CCONJ (3; 3% instances), SCONJ (3; 3% instances), ADV (2; 2% instances), NUM (2; 2% instances), VERB (1; 1% instances)