home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: POS Tags: AUX

There are 2 AUX lemmas (0%), 1 AUX types (6%) and 9155 AUX tokens (1%). Out of 16 observed tags, the rank of AUX is: 15 in number of lemmas, 4 in number of types and 12 in number of tokens.

The 10 most frequent AUX lemmas: _، s

The 10 most frequent AUX types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 221327, PUNCT 71973, ADJ 68841, ADP 62617, VERB 55127, PROPN 48391, ADV 23955, SCONJ 15652, NUM 15105, PRON 12926, AUX 6881, DET 6354, CCONJ 3889, PART 1501, X 380, INTJ 56), s (AUX 2274, VERB 7, NOUN 1, X 1)

The 10 most frequent ambiguous types: _ (NOUN 221899, ADP 91743, PUNCT 75266, ADJ 69355, PROPN 57421, VERB 55469, CCONJ 49161, PRON 43495, ADV 24067, SCONJ 16614, NUM 15377, AUX 9155, DET 6363, PART 2521, X 927, INTJ 56)

Morphology

The form / lemma ratio of AUX is 0.500000 (the average of all parts of speech is 0.003044).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

The 2nd highest number of forms (1) was observed with the lemma “s”: _.

AUX occurs with 9 features: Gender (4101; 45% instances), Number (4101; 45% instances), Person (4088; 45% instances), Voice (4076; 45% instances), Mood (4056; 44% instances), Definite (15; 0% instances), Case (12; 0% instances), Aspect (6; 0% instances), AdpType (5; 0% instances)

AUX occurs with 20 feature-value pairs: AdpType=Prep, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Mood=Ind, Mood=Jus, Mood=Sub, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Voice=Act

AUX occurs with 44 feature combinations. The most frequent feature combination is _ (5051 tokens). Examples: _

Relations

AUX nodes are attached to their parents using 6 different relations: cop (5289; 58% instances), aux (3574; 39% instances), obj (182; 2% instances), root (41; 0% instances), iobj (39; 0% instances), nmod (30; 0% instances)

Parents of AUX nodes belong to 15 different parts of speech: VERB (5164; 56% instances), NOUN (2112; 23% instances), ADJ (755; 8% instances), PRON (468; 5% instances), ADV (302; 3% instances), CCONJ (115; 1% instances), PROPN (78; 1% instances), DET (54; 1% instances), (41; 0% instances), AUX (24; 0% instances), NUM (20; 0% instances), X (12; 0% instances), PART (8; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

8909 (97%) AUX nodes are leaves.

46 (1%) AUX nodes have one child.

59 (1%) AUX nodes have two children.

141 (2%) AUX nodes have three or more children.

The highest child degree of a AUX node is 14.

Children of AUX nodes are attached using 17 different relations: xcomp (142; 19% instances), nmod (121; 16% instances), punct (101; 14% instances), ccomp (80; 11% instances), nsubj (73; 10% instances), case (59; 8% instances), cc (54; 7% instances), obj (53; 7% instances), amod (14; 2% instances), cop (14; 2% instances), mark (11; 1% instances), advmod (7; 1% instances), dep (3; 0% instances), nmod:poss (2; 0% instances), aux (1; 0% instances), iobj (1; 0% instances), nummod (1; 0% instances)

Children of AUX nodes belong to 14 different parts of speech: VERB (222; 30% instances), NOUN (136; 18% instances), PUNCT (101; 14% instances), PRON (78; 11% instances), ADP (59; 8% instances), CCONJ (54; 7% instances), AUX (24; 3% instances), ADJ (20; 3% instances), PROPN (18; 2% instances), ADV (9; 1% instances), SCONJ (9; 1% instances), DET (4; 1% instances), PART (2; 0% instances), NUM (1; 0% instances)