Treebank Statistics: UD_Arabic-NYUAD: POS Tags: AUX
There are 2 AUX
lemmas (0%), 1 AUX
types (6%) and 9155 AUX
tokens (1%).
Out of 16 observed tags, the rank of AUX
is: 15 in number of lemmas, 4 in number of types and 12 in number of tokens.
The 10 most frequent AUX
lemmas: _، s
The 10 most frequent AUX
types: _
The 10 most frequent ambiguous lemmas: _ (NOUN 221327, PUNCT 71973, ADJ 68841, ADP 62617, VERB 55127, PROPN 48391, ADV 23955, SCONJ 15652, NUM 15105, PRON 12926, AUX 6881, DET 6354, CCONJ 3889, PART 1501, X 380, INTJ 56), s (AUX 2274, VERB 7, NOUN 1, X 1)
The 10 most frequent ambiguous types: _ (NOUN 221899, ADP 91743, PUNCT 75266, ADJ 69355, PROPN 57421, VERB 55469, CCONJ 49161, PRON 43495, ADV 24067, SCONJ 16614, NUM 15377, AUX 9155, DET 6363, PART 2521, X 927, INTJ 56)
- _
- NOUN 221899: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADP 91743: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 75266: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADJ 69355: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 57421: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 55469: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CCONJ 49161: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 43495: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADV 24067: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 16614: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 15377: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 9155: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 6363: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 2521: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- X 927: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- INTJ 56: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Morphology
The form / lemma ratio of AUX
is 0.500000 (the average of all parts of speech is 0.003044).
The 1st highest number of forms (1) was observed with the lemma “_”: _.
The 2nd highest number of forms (1) was observed with the lemma “s”: _.
AUX
occurs with 9 features: Gender (4101; 45% instances), Number (4101; 45% instances), Person (4088; 45% instances), Voice (4076; 45% instances), Mood (4056; 44% instances), Definite (15; 0% instances), Case (12; 0% instances), Aspect (6; 0% instances), AdpType (5; 0% instances)
AUX
occurs with 20 feature-value pairs: AdpType=Prep
, Aspect=Imp
, Aspect=Perf
, Case=Acc
, Case=Gen
, Case=Nom
, Definite=Def
, Definite=Ind
, Gender=Fem
, Gender=Masc
, Mood=Ind
, Mood=Jus
, Mood=Sub
, Number=Dual
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Voice=Act
AUX
occurs with 44 feature combinations.
The most frequent feature combination is _
(5051 tokens).
Examples: _
Relations
AUX
nodes are attached to their parents using 6 different relations: cop (5289; 58% instances), aux (3574; 39% instances), obj (182; 2% instances), root (41; 0% instances), iobj (39; 0% instances), nmod (30; 0% instances)
Parents of AUX
nodes belong to 15 different parts of speech: VERB (5164; 56% instances), NOUN (2112; 23% instances), ADJ (755; 8% instances), PRON (468; 5% instances), ADV (302; 3% instances), CCONJ (115; 1% instances), PROPN (78; 1% instances), DET (54; 1% instances), (41; 0% instances), AUX (24; 0% instances), NUM (20; 0% instances), X (12; 0% instances), PART (8; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)
8909 (97%) AUX
nodes are leaves.
46 (1%) AUX
nodes have one child.
59 (1%) AUX
nodes have two children.
141 (2%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 14.
Children of AUX
nodes are attached using 17 different relations: xcomp (142; 19% instances), nmod (121; 16% instances), punct (101; 14% instances), ccomp (80; 11% instances), nsubj (73; 10% instances), case (59; 8% instances), cc (54; 7% instances), obj (53; 7% instances), amod (14; 2% instances), cop (14; 2% instances), mark (11; 1% instances), advmod (7; 1% instances), dep (3; 0% instances), nmod:poss (2; 0% instances), aux (1; 0% instances), iobj (1; 0% instances), nummod (1; 0% instances)
Children of AUX
nodes belong to 14 different parts of speech: VERB (222; 30% instances), NOUN (136; 18% instances), PUNCT (101; 14% instances), PRON (78; 11% instances), ADP (59; 8% instances), CCONJ (54; 7% instances), AUX (24; 3% instances), ADJ (20; 3% instances), PROPN (18; 2% instances), ADV (9; 1% instances), SCONJ (9; 1% instances), DET (4; 1% instances), PART (2; 0% instances), NUM (1; 0% instances)