home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Telugu_English-TECT: POS Tags: NOUN

There are 1 NOUN lemmas (10%), 83 NOUN types (41%) and 122 NOUN tokens (27%). Out of 10 observed tags, the rank of NOUN is: 5 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: _

The 10 most frequent NOUN types: work, book, kòtta, lunch, song, Amerikālō, Iṇṭini, brother, chocolate, cinema

The 10 most frequent ambiguous lemmas: _ (NOUN 122, VERB 99, PUNCT 85, PRON 78, DET 20, PROPN 20, ADV 14, ADJ 9, ADP 6, NUM 3)

The 10 most frequent ambiguous types: telugu (NOUN 2, PROPN 2), ikkaḍiki (ADV 2, NOUN 1), kamala (NOUN 1, PROPN 1), rāmu (PROPN 6, NOUN 1), small (ADJ 1, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 83.000000 (the average of all parts of speech is 20.300000).

The 1st highest number of forms (83) was observed with the lemma “_”: Amerikālō, Hyderabad, Iṇṭini, Mumbai, Mūḍu, alarm, aunty, book, books, brother, burger, cake, cheque, chocolate, cigarette, cinema, city, coffee, cows, cupboard, curry, dance, days, dinner, dūraṁ, friend, hello, help, hit, husband, ikkaḍiki, inṭlo, iṁṭlo, job, joke, juice, kamala, kòtta, lights, lunch, magazine, meat, milk, mind, money, movie, music, novel, novels, officer, pizza, poem, poetry, pustakāla, pustakālā, pālu, rain, reṇḍu, rice, rooms, rāmu, shop, sister, small, son, song, songs, story, sukhaṁ, sweet, tall, telugu, time, tomorrow, town, tubs, uncle, water, work, yesterday, Āvulu, ūriki, ūḷḷo.

NOUN does not occur with any features.

Relations

NOUN nodes are attached to their parents using 8 different relations: obj (47; 39% instances), nsubj (19; 16% instances), root (17; 14% instances), nmod (15; 12% instances), obl (15; 12% instances), compound (5; 4% instances), nummod (2; 2% instances), xcomp (2; 2% instances)

Parents of NOUN nodes belong to 4 different parts of speech: VERB (88; 72% instances), (17; 14% instances), NOUN (15; 12% instances), PRON (2; 2% instances)

66 (54%) NOUN nodes are leaves.

33 (27%) NOUN nodes have one child.

11 (9%) NOUN nodes have two children.

12 (10%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 4.

Children of NOUN nodes are attached using 12 different relations: nmod (25; 27% instances), det (20; 22% instances), punct (12; 13% instances), nsubj (10; 11% instances), advmod (6; 6% instances), case (5; 5% instances), compound (5; 5% instances), acl (2; 2% instances), amod (2; 2% instances), nummod (2; 2% instances), obj (2; 2% instances), xcomp (2; 2% instances)

Children of NOUN nodes belong to 10 different parts of speech: DET (20; 22% instances), PRON (18; 19% instances), NOUN (15; 16% instances), PUNCT (12; 13% instances), VERB (9; 10% instances), ADJ (6; 6% instances), ADP (5; 5% instances), ADV (4; 4% instances), NUM (2; 2% instances), PROPN (2; 2% instances)