home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tamil-MWTT: POS Tags: DET

There are 7 DET lemmas (1%), 11 DET types (1%) and 57 DET tokens (2%). Out of 13 observed tags, the rank of DET is: 9 in number of lemmas, 10 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: இந்த, அந்த, எல்லாம், இன்னொரு, சில, பல, வேறொரு

The 10 most frequent DET types: இந்த, அந்த, இந்தத், எல்லாம், *இந்த, அந்தப், இந்தப், இன்னொரு, சில, பல

The 10 most frequent ambiguous lemmas: எல்லாம் (DET 3, ADV 1), சில (PRON 2, ADJ 1, DET 1), பல (DET 1, PRON 1)

The 10 most frequent ambiguous types: எல்லாம் (DET 3, ADV 1), சில (ADJ 1, DET 1)

Morphology

The form / lemma ratio of DET is 1.571429 (the average of all parts of speech is 1.743028).

The 1st highest number of forms (4) was observed with the lemma “இந்த”: *இந்த, இந்த, இந்தத், இந்தப்.

The 2nd highest number of forms (2) was observed with the lemma “அந்த”: அந்த, அந்தப்.

The 3rd highest number of forms (1) was observed with the lemma “இன்னொரு”: இன்னொரு.

DET occurs with 3 features: Number (2; 4% instances), NumType (1; 2% instances), Person (1; 2% instances)

DET occurs with 3 feature-value pairs: NumType=Card, Number=Sing, Person=3

DET occurs with 4 feature combinations. The most frequent feature combination is _ (54 tokens). Examples: இந்த, அந்த, இந்தத், எல்லாம், *இந்த, அந்தப், இந்தப், சில, பல

Relations

DET nodes are attached to their parents using 2 different relations: det (54; 95% instances), nsubj (3; 5% instances)

Parents of DET nodes belong to 5 different parts of speech: NOUN (47; 82% instances), ADJ (4; 7% instances), VERB (3; 5% instances), NUM (2; 4% instances), ADV (1; 2% instances)

56 (98%) DET nodes are leaves.

1 (2%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 1 different relations: case (1; 100% instances)

Children of DET nodes belong to 1 different parts of speech: ADP (1; 100% instances)