home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sanskrit-Vedic: POS Tags: ADJ

There are 3718 ADJ lemmas (27%), 7467 ADJ types (20%) and 18315 ADJ tokens (9%). Out of 13 observed tags, the rank of ADJ is: 2 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: uttara, dakṣiṇa, viśva, prāñc, prathama, mahat, udañc, priya, uttama, pratyañc

The 10 most frequent ADJ types: _, dakṣiṇam, priyam, udak, prathamaḥ, prāc, uttaram, viśva, uru, bṛhat

The 10 most frequent ambiguous lemmas: uttara (ADJ 362, NOUN 2), dakṣiṇa (ADJ 249, NOUN 1), viśva (ADJ 245, DET 237, PRON 3, NOUN 2), prāñc (ADJ 233, ADP 1), udañc (ADJ 183, VERB 3, ADP 1), priya (ADJ 176, NOUN 3), uttama (ADJ 144, NOUN 1), pratyañc (ADJ 135, ADP 1), amṛta (ADJ 127, NOUN 101), maya (ADJ 109, NOUN 19)

The 10 most frequent ambiguous types: _ (NOUN 2255, ADJ 407, CCONJ 331, SCONJ 216, NUM 186, PRON 124, VERB 106, INTJ 86, ADP 73, ADV 54, DET 14), dakṣiṇam (ADJ 77, NOUN 3), priyam (ADJ 75, NOUN 1), uttaram (ADJ 62, NOUN 1), viśva (ADJ 60, DET 50), bṛhat (ADJ 54, NOUN 19), viśve (ADJ 50, DET 26), mayaḥ (ADJ 48, NOUN 24), samānam (ADJ 43, DET 1), prāñcam (ADJ 39, ADP 1)

Morphology

The form / lemma ratio of ADJ is 2.008338 (the average of all parts of speech is 2.674382).

The 1st highest number of forms (29) was observed with the lemma “uttara”: _, uttara, uttaraiḥ, uttaram, uttarasmin, uttarasmāt, uttarasya, uttarasyām, uttarau, uttarayoḥ, uttarayā, uttaraḥ, uttare, uttarebhyaḥ, uttarena, uttareṇa, uttareṣu, uttarā, uttarābhiḥ, uttarābhyām, uttarām, uttarān, uttarāsu, uttarāt, uttarāyai, uttarāyām, uttarāḥ, uttarāṇi, uttarāṇām.

The 2nd highest number of forms (20) was observed with the lemma “dakṣiṇa”: _, dakṣiṇa, dakṣiṇaiḥ, dakṣiṇam, dakṣiṇasya, dakṣiṇasyām, dakṣiṇau, dakṣiṇayā, dakṣiṇaḥ, dakṣiṇe, dakṣiṇena, dakṣiṇeṣām, dakṣiṇā, dakṣiṇām, dakṣiṇān, dakṣiṇāt, dakṣiṇāyai, dakṣiṇāyām, dakṣiṇāyāḥ, dakṣiṇāḥ.

The 3rd highest number of forms (20) was observed with the lemma “prathama”: _, prathama, prathamam, prathamasya, prathamasyāḥ, prathamau, prathamayā, prathamaḥ, prathame, prathamena, prathamā, prathamām, prathamāni, prathamāsaḥ, prathamāya, prathamāyai, prathamāyām, prathamāyāḥ, prathamāḥ, prathamāṁ.

ADJ occurs with 4 features: Case (16621; 91% instances), Gender (16621; 91% instances), Number (16621; 91% instances), Compound (1694; 9% instances)

ADJ occurs with 15 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Compound=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

ADJ occurs with 66 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (3863 tokens). Examples: prathamaḥ, mayaḥ, vid, pratyaṅ, ūrdhvaḥ, priyaḥ, hā, śuciḥ, dāḥ, mahān

Relations

ADJ nodes are attached to their parents using 51 different relations: amod (4257; 23% instances), conj (2436; 13% instances), root (1703; 9% instances), acl (1608; 9% instances), flat (1474; 8% instances), obj (803; 4% instances), nsubj (793; 4% instances), xcomp (489; 3% instances), nmod (451; 2% instances), advcl (440; 2% instances), orphan (402; 2% instances), xcomp:result (371; 2% instances), vocative (293; 2% instances), advmod (283; 2% instances), obl (269; 1% instances), ccomp (234; 1% instances), acl:relcl (189; 1% instances), compound:coord (186; 1% instances), acl:dpct (161; 1% instances), obl:instr (159; 1% instances), acl:attr (121; 1% instances), advcl:dpct (114; 1% instances), advcl:ccomp (110; 1% instances), parataxis (110; 1% instances), iobj (90; 0% instances), nmod:appos (89; 0% instances), obl:goal (68; 0% instances), obl:manner (65; 0% instances), advcl:manner (63; 0% instances), appos (62; 0% instances), obl:tmod (46; 0% instances), advcl:caus (38; 0% instances), compound:name (38; 0% instances), obl:lmod (37; 0% instances), advcl:cond (33; 0% instances), obl:agent (32; 0% instances), obl:source (28; 0% instances), obl:soc (24; 0% instances), csubj (23; 0% instances), advcl:tcl (20; 0% instances), advcl:fin (18; 0% instances), dislocated (18; 0% instances), obl:grad (18; 0% instances), obl:benef (14; 0% instances), advcl:concess (9; 0% instances), obl:path (8; 0% instances), compound (6; 0% instances), acl:ptcp (5; 0% instances), acl:crel (4; 0% instances), ccomp:rel (2; 0% instances), advcl:lcl (1; 0% instances)

Parents of ADJ nodes belong to 13 different parts of speech: NOUN (7782; 42% instances), VERB (5226; 29% instances), ADJ (2482; 14% instances), (1703; 9% instances), PRON (825; 5% instances), ADV (109; 1% instances), NUM (87; 0% instances), PART (32; 0% instances), ADP (27; 0% instances), CCONJ (21; 0% instances), INTJ (12; 0% instances), DET (5; 0% instances), SCONJ (4; 0% instances)

9855 (54%) ADJ nodes are leaves.

5541 (30%) ADJ nodes have one child.

1632 (9%) ADJ nodes have two children.

1287 (7%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 14.

Children of ADJ nodes are attached using 58 different relations: conj (2200; 16% instances), nsubj (1682; 12% instances), flat (1461; 11% instances), discourse (787; 6% instances), nmod (751; 6% instances), obj (716; 5% instances), orphan (678; 5% instances), advmod (637; 5% instances), cop (577; 4% instances), mark (540; 4% instances), cc (475; 3% instances), obl (386; 3% instances), acl (262; 2% instances), det (246; 2% instances), amod (177; 1% instances), compound:coord (171; 1% instances), obl:grad (153; 1% instances), parataxis (133; 1% instances), advcl (120; 1% instances), iobj (109; 1% instances), nummod (108; 1% instances), vocative (106; 1% instances), obl:lmod (82; 1% instances), nmod:appos (79; 1% instances), obl:manner (76; 1% instances), case:sim (71; 1% instances), acl:relcl (66; 0% instances), obl:source (65; 0% instances), case (58; 0% instances), obl:tmod (58; 0% instances), obl:goal (50; 0% instances), advcl:cond (49; 0% instances), obl:soc (47; 0% instances), ccomp (45; 0% instances), csubj (44; 0% instances), obl:benef (37; 0% instances), acl:dpct (34; 0% instances), obl:instr (32; 0% instances), xcomp (28; 0% instances), advcl:manner (27; 0% instances), mark:sim (27; 0% instances), appos (24; 0% instances), advcl:tcl (19; 0% instances), acl:attr (18; 0% instances), advcl:caus (16; 0% instances), acl:ptcp (14; 0% instances), advcl:fin (12; 0% instances), dislocated (8; 0% instances), obl:agent (8; 0% instances), advcl:concess (7; 0% instances), compound (6; 0% instances), xcomp:result (5; 0% instances), advcl:ccomp (4; 0% instances), advcl:dpct (4; 0% instances), aux (2; 0% instances), obl:path (2; 0% instances), advcl:lcl (1; 0% instances), fixed (1; 0% instances)

Children of ADJ nodes belong to 13 different parts of speech: NOUN (4917; 36% instances), ADJ (2482; 18% instances), PART (1663; 12% instances), PRON (1363; 10% instances), VERB (1072; 8% instances), ADV (598; 4% instances), AUX (579; 4% instances), CCONJ (486; 4% instances), NUM (178; 1% instances), SCONJ (112; 1% instances), ADP (92; 1% instances), INTJ (33; 0% instances), DET (26; 0% instances)