home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Armenian-ArmTDP: POS Tags: X

There are 109 X lemmas (1%), 110 X types (1%) and 191 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 8 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: ո, ր, շ, թույլ, մ, նկատի, ու, յելլ, կոյունլու, ւ

The 10 most frequent X types: ո, ր, շ, թույլ, մ, նկատի, ու, Յելլ, կոյունլու, ւ

The 10 most frequent ambiguous lemmas: ու (CCONJ 689, X 5), ը (NOUN 40, X 1), տարեց (ADJ 3, X 1), փուլ (NOUN 10, X 1), օ (INTJ 3, X 1)

The 10 most frequent ambiguous types: ու (CCONJ 668, X 5), Ա (ADJ 5, X 1), Զ (PROPN 1, X 1), Օ (NOUN 2, INTJ 1, X 1), ը (NOUN 12, X 1), տարեց (ADJ 2, X 1)

Morphology

The form / lemma ratio of X is 1.009174 (the average of all parts of speech is 1.814455).

The 1st highest number of forms (2) was observed with the lemma “The”: The, Windrose.

The 2nd highest number of forms (1) was observed with the lemma “AGBU”: AGBU.

The 3rd highest number of forms (1) was observed with the lemma “Allianplace”: Allianplace.

X occurs with 5 features: Foreign (88; 46% instances), Style (13; 7% instances), Abbr (6; 3% instances), Echo (1; 1% instances), Typo (1; 1% instances)

X occurs with 8 feature-value pairs: Abbr=Yes, Echo=Ech, Foreign=Yes, Style=Arch, Style=Coll, Style=Rare, Style=Vrnc, Typo=Yes

X occurs with 8 feature combinations. The most frequent feature combination is _ (90 tokens). Examples: ո, ր, շ, թույլ, մ, ու, կոյունլու, ւ, Ե, Ց

Relations

X nodes are attached to their parents using 16 different relations: dep (54; 28% instances), flat (39; 20% instances), compound:lvc (30; 16% instances), appos (14; 7% instances), nmod (14; 7% instances), root (11; 6% instances), conj (7; 4% instances), nsubj (7; 4% instances), compound (5; 3% instances), parataxis (3; 2% instances), advcl (2; 1% instances), nmod:npmod (1; 1% instances), nmod:poss (1; 1% instances), nsubj:pass (1; 1% instances), obj (1; 1% instances), obl (1; 1% instances)

Parents of X nodes belong to 8 different parts of speech: X (98; 51% instances), VERB (40; 21% instances), NOUN (31; 16% instances), (11; 6% instances), PROPN (5; 3% instances), ADJ (3; 2% instances), ADV (2; 1% instances), PRON (1; 1% instances)

115 (60%) X nodes are leaves.

19 (10%) X nodes have one child.

24 (13%) X nodes have two children.

33 (17%) X nodes have three or more children.

The highest child degree of a X node is 11.

Children of X nodes are attached using 22 different relations: punct (100; 38% instances), dep (70; 27% instances), flat (41; 16% instances), advcl (9; 3% instances), nsubj (9; 3% instances), conj (8; 3% instances), aux (7; 3% instances), appos (2; 1% instances), case (2; 1% instances), cop (2; 1% instances), det (2; 1% instances), parataxis (2; 1% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), amod (1; 0% instances), cc (1; 0% instances), compound (1; 0% instances), compound:redup (1; 0% instances), nmod (1; 0% instances), nmod:poss (1; 0% instances), nummod (1; 0% instances), obl (1; 0% instances)

Children of X nodes belong to 11 different parts of speech: PUNCT (100; 38% instances), X (98; 37% instances), NOUN (36; 14% instances), VERB (11; 4% instances), AUX (9; 3% instances), ADJ (2; 1% instances), ADP (2; 1% instances), DET (2; 1% instances), PROPN (2; 1% instances), CCONJ (1; 0% instances), NUM (1; 0% instances)