Treebank Statistics: UD_Uyghur-UDT: POS Tags: ADV
There are 104 ADV
lemmas (3%), 161 ADV
types (1%) and 982 ADV
tokens (2%).
Out of 16 observed tags, the rank of ADV
is: 4 in number of lemmas, 6 in number of types and 7 in number of tokens.
The 10 most frequent ADV
lemmas: _، يەنە، شۇنداق، بەك، شۇڭا، قانداق، قېتىم، تېخى، دەرھال، دەل
The 10 most frequent ADV
types: يەنە، شۇنداق، بەك، شۇڭا، قانداق، قېتىم، تېخى، دەرھال، دەل، ئىنتايىن
The 10 most frequent ambiguous lemmas: _ (VERB 4560, NOUN 4224, PRON 479, PUNCT 434, ADJ 326, AUX 185, ADV 157, PART 119, NUM 75, CCONJ 72, ADP 51, INTJ 47, DET 28, X 27), شۇنداق (ADV 47, DET 11), قانداق (ADV 30, DET 17), قېتىم (ADV 28, DET 4), شۇنچە (ADV 7, DET 3), مۇنداق (ADV 7, DET 6), ئاخىر (NOUN 27, ADV 2), شۇنچىلىك (ADV 2, DET 1), بىللە (ADJ 19, ADV 1), نېرى (ADV 1, NOUN 1)
The 10 most frequent ambiguous types: شۇنداق (ADV 47, DET 11), قانداق (ADV 30, DET 17, PRON 7), قېتىم (ADV 28, DET 4), ئەمەس (AUX 30, ADV 10), كېيىن (NOUN 79, ADV 10), ئەمدى (NOUN 22, ADV 9), شۇنچە (ADV 7, DET 3), مۇنداق (ADV 7, DET 6), كىيىن (ADV 5, NOUN 2, VERB 1), بىردەمدىن (NOUN 6, ADV 4)
- شۇنداق
- قانداق
- قېتىم
- ئەمەس
- كېيىن
- ئەمدى
- شۇنچە
- مۇنداق
- كىيىن
- بىردەمدىن
Morphology
The form / lemma ratio of ADV
is 1.548077 (the average of all parts of speech is 4.088599).
The 1st highest number of forms (58) was observed with the lemma “_”: ئاخىرىغىچە, ئانچە-مۇنچە, ئوغرىلىقچە, ئەتىگەنلىكى, ئەجەپ, ئەمدى, بارا-بارا, بارىچە, بالدۇرلا, باياتىن, بىرئاز, بىردىنبىر, بىردەم, بىردەمدىن, بىرقەدەر, بۆلەكچىلا, بۇرۇن, بۇرۇننىڭ, بەكرەك, بەكلا, بەكمۇ, تاراملاپ, تولىمۇ, تېخىمۇ, تېخىچىلا, تەرلەپ, تەستىقلىنىشتا, خېلىلا, دۈم, دەس, راستتىنلا, زادىلا, شۇئان, قىشىچە, كىيىن, كۈنسايىن, كۈنسېرى, كېيىن, كېيىنلا, كېچىلەپ, كەچقۇرۇنلۇقى, لىققىدە, لەپىلدەپ, نېرى-بېرى, نېرىدا, پات-پات, پىيادىلەر, پېتى, ھازىردىن, ھازىرغىچە, ھازىرلا, ھازىرمۇ, ھۈپپىدە, ھېلىلا, ھېلىھەم, ھەدەپ, ھەرگىزمۇ, ھەقىقەتەنمۇ.
The 2nd highest number of forms (1) was observed with the lemma “ئاخىر”: ئاخىر.
The 3rd highest number of forms (1) was observed with the lemma “ئارىلاپ”: ئارىلاپ.
ADV
occurs with 1 features: PronType (48; 5% instances)
ADV
occurs with 1 feature-value pairs: PronType=Int
ADV
occurs with 2 feature combinations.
The most frequent feature combination is _
(934 tokens).
Examples: يەنە، شۇنداق، بەك، شۇڭا، قېتىم، تېخى، دەرھال، دەل، ئىنتايىن، ناھايىتى
Relations
ADV
nodes are attached to their parents using 23 different relations: advmod (663; 68% instances), cc (67; 7% instances), nmod (40; 4% instances), amod (28; 3% instances), mark (21; 2% instances), obl (20; 2% instances), discourse (19; 2% instances), compound:redup (18; 2% instances), compound (17; 2% instances), advmod:emph (14; 1% instances), fixed (12; 1% instances), root (11; 1% instances), case (8; 1% instances), nsubj (8; 1% instances), parataxis (8; 1% instances), nmod:tmod (7; 1% instances), advcl (5; 1% instances), ccomp (5; 1% instances), conj (5; 1% instances), compound:lvc (2; 0% instances), dep (2; 0% instances), flat (1; 0% instances), xcomp (1; 0% instances)
Parents of ADV
nodes belong to 11 different parts of speech: VERB (626; 64% instances), NOUN (163; 17% instances), ADJ (121; 12% instances), NUM (23; 2% instances), ADV (18; 2% instances), PRON (13; 1% instances), (11; 1% instances), ADP (2; 0% instances), AUX (2; 0% instances), DET (2; 0% instances), CCONJ (1; 0% instances)
800 (81%) ADV
nodes are leaves.
140 (14%) ADV
nodes have one child.
22 (2%) ADV
nodes have two children.
20 (2%) ADV
nodes have three or more children.
The highest child degree of a ADV
node is 6.
Children of ADV
nodes are attached using 24 different relations: punct (123; 49% instances), nsubj (19; 8% instances), nummod (17; 7% instances), fixed (13; 5% instances), compound (11; 4% instances), compound:redup (8; 3% instances), nmod (8; 3% instances), obl (8; 3% instances), advmod (7; 3% instances), det (7; 3% instances), cop (6; 2% instances), nmod:poss (5; 2% instances), case (4; 2% instances), amod (3; 1% instances), cc (3; 1% instances), advcl (2; 1% instances), discourse (2; 1% instances), acl (1; 0% instances), advmod:emph (1; 0% instances), aux (1; 0% instances), compound:lvc (1; 0% instances), nmod:tmod (1; 0% instances), obj (1; 0% instances), parataxis (1; 0% instances)
Children of ADV
nodes belong to 13 different parts of speech: PUNCT (123; 49% instances), NOUN (36; 14% instances), ADV (18; 7% instances), NUM (16; 6% instances), PRON (16; 6% instances), VERB (15; 6% instances), ADJ (7; 3% instances), AUX (7; 3% instances), ADP (6; 2% instances), DET (6; 2% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)