home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Irish-DipSGG: POS Tags: NOUN

There are 98 NOUN lemmas (43%), 113 NOUN types (38%) and 118 NOUN tokens (28%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: _, cocart, membrum, cásc, dub, día, días, ingen, laithirt, lebor

The 10 most frequent NOUN types: ᚉᚑᚉᚐᚏᚈ, dia, dias, ingen, accentus, ainm, airdircus, aite, aithne, animalis

The 10 most frequent ambiguous lemmas: _ (NOUN 7, X 5, VERB 4, ADV 3, CCONJ 2, SCONJ 2, ADJ 1, PROPN 1, PUNCT 1)

The 10 most frequent ambiguous types: con (NOUN 1, SCONJ 1)

Morphology

The form / lemma ratio of NOUN is 1.153061 (the average of all parts of speech is 1.295652).

The 1st highest number of forms (7) was observed with the lemma “_”: accentus, animalis, femininum, nomen, pedo, sona, vesíca.

The 2nd highest number of forms (3) was observed with the lemma “membrum”: membrumm, memmbrum, memrum.

The 3rd highest number of forms (2) was observed with the lemma “cocart”: ᚉᚑᚉᚐᚏᚈ, ᚙᚑᚉᚐᚏᚈ.

NOUN occurs with 5 features: Case (108; 92% instances), Number (108; 92% instances), Foreign (9; 8% instances), Typo (4; 3% instances), Prefix (1; 1% instances)

NOUN occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Case=Voc, Foreign=Yes, Number=Dual, Number=Plur, Number=Sing, Prefix=Yes, Typo=Yes

NOUN occurs with 13 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (49 tokens). Examples: dias, ainm, airdircus, aite, aithne, bendacht, bruach, cenéle, chliab, choimmdiu

Relations

NOUN nodes are attached to their parents using 11 different relations: root (40; 34% instances), nmod (31; 26% instances), nsubj (18; 15% instances), obj (9; 8% instances), obl (7; 6% instances), nmod:pre (5; 4% instances), conj (4; 3% instances), dislocated (1; 1% instances), obl:tmod (1; 1% instances), parataxis (1; 1% instances), vocative (1; 1% instances)

Parents of NOUN nodes belong to 6 different parts of speech: NOUN (40; 34% instances), (40; 34% instances), VERB (26; 22% instances), ADJ (10; 8% instances), PRON (1; 1% instances), PROPN (1; 1% instances)

27 (23%) NOUN nodes are leaves.

35 (30%) NOUN nodes have one child.

24 (20%) NOUN nodes have two children.

32 (27%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 5.

Children of NOUN nodes are attached using 23 different relations: nmod (35; 18% instances), case (25; 13% instances), det (25; 13% instances), amod (21; 11% instances), advmod (15; 8% instances), punct (11; 6% instances), nmod:poss (9; 5% instances), obl:prep (8; 4% instances), mark (5; 3% instances), nmod:pre (5; 3% instances), acl:relcl (4; 2% instances), conj (4; 2% instances), nsubj (4; 2% instances), parataxis (4; 2% instances), cc (3; 2% instances), acl (2; 1% instances), advcl (2; 1% instances), case:voc (2; 1% instances), discourse (2; 1% instances), dislocated (2; 1% instances), nummod (1; 1% instances), obl:tmod (1; 1% instances), vocative (1; 1% instances)

Children of NOUN nodes belong to 16 different parts of speech: NOUN (40; 21% instances), ADP (33; 17% instances), DET (25; 13% instances), ADJ (16; 8% instances), ADV (15; 8% instances), PUNCT (11; 6% instances), PRON (10; 5% instances), PART (9; 5% instances), VERB (9; 5% instances), PROPN (6; 3% instances), SCONJ (5; 3% instances), AUX (4; 2% instances), CCONJ (3; 2% instances), INTJ (2; 1% instances), NUM (2; 1% instances), X (1; 1% instances)