Treebank Statistics: UD_Arabic-NYUAD: Features: Case
This feature is universal.
It occurs with 3 different values: Acc
, Gen
, Nom
.
334414 tokens (45%) have a non-empty value of Case
.
1 types (0) occur at least once with a non-empty value of Case
.
230 lemmas (5%) occur at least once with a non-empty value of Case
.
The feature is used with 16 part-of-speech tags: NOUN (213749; 29% instances), ADJ (65605; 9% instances), PRON (24351; 3% instances), ADV (16050; 2% instances), PROPN (10760; 1% instances), NUM (3387; 0% instances), ADP (137; 0% instances), PUNCT (126; 0% instances), CCONJ (73; 0% instances), VERB (60; 0% instances), DET (46; 0% instances), X (38; 0% instances), AUX (12; 0% instances), SCONJ (11; 0% instances), PART (8; 0% instances), INTJ (1; 0% instances).
NOUN
213749 NOUN tokens (96% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Number=Sing (189550; 89%), Gender=Masc (147050; 69%).
NOUN
tokens may have the following values of Case
:
Acc
(41522; 19% of non-emptyCase
): _Gen
(142652; 67% of non-emptyCase
): _Nom
(29575; 14% of non-emptyCase
): _EMPTY
(8150): _
ADJ
65605 ADJ tokens (95% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Number=Sing (62554; 95%), Definite=Def (43275; 66%), Gender=Masc (33608; 51%).
ADJ
tokens may have the following values of Case
:
Acc
(13586; 21% of non-emptyCase
): _Gen
(40733; 62% of non-emptyCase
): _Nom
(11286; 17% of non-emptyCase
): _EMPTY
(3750): _
PRON
24351 PRON tokens (56% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: PronType=Prs (22577; 93%), Person=3 (21806; 90%), Definite=Def (21312; 88%), Number=Sing (19722; 81%), Gender=Masc (15770; 65%).
PRON
tokens may have the following values of Case
:
Acc
(4778; 20% of non-emptyCase
): _Gen
(16699; 69% of non-emptyCase
): _Nom
(2874; 12% of non-emptyCase
): _EMPTY
(19144): _
ADV
16050 ADV tokens (67% of all ADV
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADV
and Case
co-occurred: Number=Sing (16050; 100%), Polarity=EMPTY (16050; 100%), Gender=Masc (16029; 100%), Definite=Com (15102; 94%).
ADV
tokens may have the following values of Case
:
Acc
(13032; 81% of non-emptyCase
): _Gen
(2640; 16% of non-emptyCase
): _Nom
(378; 2% of non-emptyCase
): _EMPTY
(8017): _
PROPN
10760 PROPN tokens (19% of all PROPN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PROPN
and Case
co-occurred: Number=Sing (10345; 96%), Gender=Masc (9152; 85%).
PROPN
tokens may have the following values of Case
:
Acc
(2300; 21% of non-emptyCase
): _Gen
(6548; 61% of non-emptyCase
): _Nom
(1912; 18% of non-emptyCase
): _EMPTY
(46661): _
Case
seems to be lexical feature of PROPN
. 100% lemmas (214) occur only with one value of Case
.
NUM
3387 NUM tokens (22% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumForm=Word (3198; 94%), Number=Sing (2986; 88%), Definite=Com (2440; 72%), Gender=Masc (2019; 60%).
NUM
tokens may have the following values of Case
:
Acc
(951; 28% of non-emptyCase
): _Gen
(2114; 62% of non-emptyCase
): _Nom
(322; 10% of non-emptyCase
): _EMPTY
(11990): _
ADP
137 ADP tokens (0% of all ADP
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADP
and Case
co-occurred: AdpType=Prep (128; 93%).
ADP
tokens may have the following values of Case
:
Acc
(42; 31% of non-emptyCase
): _Gen
(80; 58% of non-emptyCase
): _Nom
(15; 11% of non-emptyCase
): _EMPTY
(91606): _
PUNCT
126 PUNCT tokens (0% of all PUNCT
tokens) have a non-empty value of Case
.
PUNCT
tokens may have the following values of Case
:
Acc
(14; 11% of non-emptyCase
): _Gen
(100; 79% of non-emptyCase
): _Nom
(12; 10% of non-emptyCase
): _EMPTY
(75140): _
CCONJ
73 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of Case
.
CCONJ
tokens may have the following values of Case
:
Acc
(19; 26% of non-emptyCase
): _Gen
(48; 66% of non-emptyCase
): _Nom
(6; 8% of non-emptyCase
): _EMPTY
(49088): _
VERB
60 VERB tokens (0% of all VERB
tokens) have a non-empty value of Case
.
The most frequent other feature values with which VERB
and Case
co-occurred: Aspect=EMPTY (60; 100%), Mood=EMPTY (60; 100%), Voice=EMPTY (60; 100%), Number=Sing (56; 93%), Person=EMPTY (51; 85%), Gender=Masc (49; 82%).
VERB
tokens may have the following values of Case
:
Acc
(24; 40% of non-emptyCase
): _Gen
(21; 35% of non-emptyCase
): _Nom
(15; 25% of non-emptyCase
): _EMPTY
(55409): _
DET
46 DET tokens (1% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: Definite=Ind (41; 89%), Number=Dual (39; 85%), Gender=Masc (25; 54%).
DET
tokens may have the following values of Case
:
Acc
(33; 72% of non-emptyCase
): _Gen
(4; 9% of non-emptyCase
): _Nom
(9; 20% of non-emptyCase
): _EMPTY
(6317): _
X
38 X tokens (4% of all X
tokens) have a non-empty value of Case
.
The most frequent other feature values with which X
and Case
co-occurred: Mood=EMPTY (38; 100%), Person=EMPTY (38; 100%), Voice=EMPTY (38; 100%), Gender=Masc (35; 92%), Number=Dual (24; 63%).
X
tokens may have the following values of Case
:
Acc
(25; 66% of non-emptyCase
): _Gen
(1; 3% of non-emptyCase
): _Nom
(12; 32% of non-emptyCase
): _EMPTY
(889): _
AUX
12 AUX tokens (0% of all AUX
tokens) have a non-empty value of Case
.
The most frequent other feature values with which AUX
and Case
co-occurred: Mood=EMPTY (12; 100%), Voice=EMPTY (12; 100%), Number=Sing (11; 92%), Person=EMPTY (10; 83%), Gender=Masc (7; 58%).
AUX
tokens may have the following values of Case
:
Acc
(3; 25% of non-emptyCase
): _Gen
(6; 50% of non-emptyCase
): _Nom
(3; 25% of non-emptyCase
): _EMPTY
(9143): _
SCONJ
11 SCONJ tokens (0% of all SCONJ
tokens) have a non-empty value of Case
.
SCONJ
tokens may have the following values of Case
:
Acc
(3; 27% of non-emptyCase
): _Gen
(4; 36% of non-emptyCase
): _Nom
(4; 36% of non-emptyCase
): _EMPTY
(16603): _
PART
8 PART tokens (0% of all PART
tokens) have a non-empty value of Case
.
PART
tokens may have the following values of Case
:
Acc
(1; 13% of non-emptyCase
): _Gen
(6; 75% of non-emptyCase
): _Nom
(1; 13% of non-emptyCase
): _EMPTY
(2513): _
INTJ
1 INTJ tokens (2% of all INTJ
tokens) have a non-empty value of Case
.
INTJ
tokens may have the following values of Case
:
Gen
(1; 100% of non-emptyCase
): _EMPTY
(55): _
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[amod]–> ADJ (48285; 88%),
NOUN –[nmod:poss]–> NOUN (36195; 66%),
NOUN –[obj]–> NOUN (29062; 65%),
NOUN –[nmod:poss]–> PRON (9419; 61%),
NOUN –[nmod]–> NOUN (5836; 51%),
ADJ –[obj]–> ADJ (2236; 91%),
NOUN –[iobj]–> NOUN (1515; 53%),
ADJ –[amod]–> ADJ (1020; 78%),
NOUN –[nmod:poss]–> ADJ (678; 57%),
NOUN –[nsubj]–> PRON (669; 60%).