home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: POS Tags: PROPN

There are 4807 PROPN lemmas (91%), 1 PROPN types (6%) and 57421 PROPN tokens (8%). Out of 16 observed tags, the rank of PROPN is: 1 in number of lemmas, 12 in number of types and 5 in number of tokens.

The 10 most frequent PROPN lemmas: _، TBupdate، bry، bwls، Aljmyl، lArsn، EAlyh، fAdy، myqAty، Almr

The 10 most frequent PROPN types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 221327, PUNCT 71973, ADJ 68841, ADP 62617, VERB 55127, PROPN 48391, ADV 23955, SCONJ 15652, NUM 15105, PRON 12926, AUX 6881, DET 6354, CCONJ 3889, PART 1501, X 380, INTJ 56), TBupdate (NOUN 408, ADJ 340, VERB 268, X 190, PROPN 69, PUNCT 15, ADP 1, SCONJ 1), bHmdwn (PROPN 15, NOUN 1), w (CCONJ 43819, SCONJ 235, ADP 42, NOUN 41, VERB 40, ADJ 14, PRON 12, PROPN 9, DET 4, PART 3, NUM 2, PUNCT 2, X 2), “ (PUNCT 207, ADP 6, PROPN 6, CCONJ 5, NOUN 5, ADJ 2, PART 2, ADV 1, VERB 1), lA (ADV 100, PROPN 6, ADP 1, PART 1), y (PRON 595, PROPN 5, VERB 2, ADJ 1, ADP 1, PUNCT 1, SCONJ 1), Ay (PROPN 4, NOUN 2), , (PUNCT 254, CCONJ 68, NOUN 9, ADJ 8, ADP 7, NUM 5, SCONJ 4, VERB 4, PRON 3, PROPN 3, ADV 2, DET 1, PART 1), . (PUNCT 312, ADP 3, CCONJ 3, NOUN 3, PROPN 3, VERB 2, DET 1)

The 10 most frequent ambiguous types: _ (NOUN 221899, ADP 91743, PUNCT 75266, ADJ 69355, PROPN 57421, VERB 55469, CCONJ 49161, PRON 43495, ADV 24067, SCONJ 16614, NUM 15377, AUX 9155, DET 6363, PART 2521, X 927, INTJ 56)

Morphology

The form / lemma ratio of PROPN is 0.000208 (the average of all parts of speech is 0.003044).

The 1st highest number of forms (1) was observed with the lemma “””: _.

The 2nd highest number of forms (1) was observed with the lemma “$AbAlykyn”: _.

The 3rd highest number of forms (1) was observed with the lemma “$AbAny”: _.

PROPN occurs with 10 features: Gender (54272; 95% instances), Number (54272; 95% instances), Definite (53581; 93% instances), Case (10760; 19% instances), Person (704; 1% instances), Voice (693; 1% instances), Mood (671; 1% instances), AdpType (75; 0% instances), Polarity (6; 0% instances), Aspect (4; 0% instances)

PROPN occurs with 22 feature-value pairs: AdpType=Prep, Aspect=Perf, Case=Acc, Case=Gen, Case=Nom, Definite=Com, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Mood=Ind, Mood=Jus, Mood=Sub, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Voice=Act, Voice=Pass

PROPN occurs with 106 feature combinations. The most frequent feature combination is Definite=Ind|Gender=Masc|Number=Sing (36200 tokens). Examples: _

Relations

PROPN nodes are attached to their parents using 10 different relations: flat (15884; 28% instances), nmod (12664; 22% instances), obj (10705; 19% instances), nmod:poss (10426; 18% instances), nsubj (4545; 8% instances), root (1576; 3% instances), iobj (1508; 3% instances), appos (107; 0% instances), acl (4; 0% instances), dep (2; 0% instances)

Parents of PROPN nodes belong to 14 different parts of speech: NOUN (22642; 39% instances), PROPN (21401; 37% instances), VERB (8486; 15% instances), ADV (2025; 4% instances), (1576; 3% instances), ADJ (936; 2% instances), PRON (117; 0% instances), NUM (91; 0% instances), X (45; 0% instances), CCONJ (38; 0% instances), PART (34; 0% instances), AUX (18; 0% instances), DET (10; 0% instances), SCONJ (2; 0% instances)

28754 (50%) PROPN nodes are leaves.

14848 (26%) PROPN nodes have one child.

7110 (12%) PROPN nodes have two children.

6709 (12%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 36.

Children of PROPN nodes are attached using 19 different relations: flat (15884; 28% instances), punct (12516; 22% instances), case (7975; 14% instances), nmod (4748; 8% instances), obj (4098; 7% instances), cc (3512; 6% instances), nummod (2309; 4% instances), amod (1674; 3% instances), ccomp (1314; 2% instances), mark (873; 2% instances), nmod:poss (410; 1% instances), xcomp (403; 1% instances), advmod (177; 0% instances), nsubj (172; 0% instances), cop (71; 0% instances), iobj (36; 0% instances), det (24; 0% instances), aux (7; 0% instances), discourse (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: PROPN (21401; 38% instances), PUNCT (12516; 22% instances), ADP (7980; 14% instances), CCONJ (3516; 6% instances), NOUN (2907; 5% instances), NUM (2326; 4% instances), VERB (1717; 3% instances), ADJ (1708; 3% instances), SCONJ (828; 1% instances), PRON (760; 1% instances), ADV (342; 1% instances), AUX (78; 0% instances), PART (53; 0% instances), DET (36; 0% instances), X (35; 0% instances), INTJ (1; 0% instances)