home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Uyghur-UDT: POS Tags: ADV

There are 104 ADV lemmas (3%), 161 ADV types (1%) and 982 ADV tokens (2%). Out of 16 observed tags, the rank of ADV is: 4 in number of lemmas, 6 in number of types and 7 in number of tokens.

The 10 most frequent ADV lemmas: _، يەنە، شۇنداق، بەك، شۇڭا، قانداق، قېتىم، تېخى، دەرھال، دەل

The 10 most frequent ADV types: يەنە، شۇنداق، بەك، شۇڭا، قانداق، قېتىم، تېخى، دەرھال، دەل، ئىنتايىن

The 10 most frequent ambiguous lemmas: _ (VERB 4560, NOUN 4224, PRON 479, PUNCT 434, ADJ 326, AUX 185, ADV 157, PART 119, NUM 75, CCONJ 72, ADP 51, INTJ 47, DET 28, X 27), شۇنداق (ADV 47, DET 11), قانداق (ADV 30, DET 17), قېتىم (ADV 28, DET 4), شۇنچە (ADV 7, DET 3), مۇنداق (ADV 7, DET 6), ئاخىر (NOUN 27, ADV 2), شۇنچىلىك (ADV 2, DET 1), بىللە (ADJ 19, ADV 1), نېرى (ADV 1, NOUN 1)

The 10 most frequent ambiguous types: شۇنداق (ADV 47, DET 11), قانداق (ADV 30, DET 17, PRON 7), قېتىم (ADV 28, DET 4), ئەمەس (AUX 30, ADV 10), كېيىن (NOUN 79, ADV 10), ئەمدى (NOUN 22, ADV 9), شۇنچە (ADV 7, DET 3), مۇنداق (ADV 7, DET 6), كىيىن (ADV 5, NOUN 2, VERB 1), بىردەمدىن (NOUN 6, ADV 4)

Morphology

The form / lemma ratio of ADV is 1.548077 (the average of all parts of speech is 4.088599).

The 1st highest number of forms (58) was observed with the lemma “_”: ئاخىرىغىچە, ئانچە-مۇنچە, ئوغرىلىقچە, ئەتىگەنلىكى, ئەجەپ, ئەمدى, بارا-بارا, بارىچە, بالدۇرلا, باياتىن, بىرئاز, بىردىنبىر, بىردەم, بىردەمدىن, بىرقەدەر, بۆلەكچىلا, بۇرۇن, بۇرۇننىڭ, بەكرەك, بەكلا, بەكمۇ, تاراملاپ, تولىمۇ, تېخىمۇ, تېخىچىلا, تەرلەپ, تەستىقلىنىشتا, خېلىلا, دۈم, دەس, راستتىنلا, زادىلا, شۇئان, قىشىچە, كىيىن, كۈنسايىن, كۈنسېرى, كېيىن, كېيىنلا, كېچىلەپ, كەچقۇرۇنلۇقى, لىققىدە, لەپىلدەپ, نېرى-بېرى, نېرىدا, پات-پات, پىيادىلەر, پېتى, ھازىردىن, ھازىرغىچە, ھازىرلا, ھازىرمۇ, ھۈپپىدە, ھېلىلا, ھېلىھەم, ھەدەپ, ھەرگىزمۇ, ھەقىقەتەنمۇ.

The 2nd highest number of forms (1) was observed with the lemma “ئاخىر”: ئاخىر.

The 3rd highest number of forms (1) was observed with the lemma “ئارىلاپ”: ئارىلاپ.

ADV occurs with 1 features: PronType (48; 5% instances)

ADV occurs with 1 feature-value pairs: PronType=Int

ADV occurs with 2 feature combinations. The most frequent feature combination is _ (934 tokens). Examples: يەنە، شۇنداق، بەك، شۇڭا، قېتىم، تېخى، دەرھال، دەل، ئىنتايىن، ناھايىتى

Relations

ADV nodes are attached to their parents using 23 different relations: advmod (663; 68% instances), cc (67; 7% instances), nmod (40; 4% instances), amod (28; 3% instances), mark (21; 2% instances), obl (20; 2% instances), discourse (19; 2% instances), compound:redup (18; 2% instances), compound (17; 2% instances), advmod:emph (14; 1% instances), fixed (12; 1% instances), root (11; 1% instances), case (8; 1% instances), nsubj (8; 1% instances), parataxis (8; 1% instances), nmod:tmod (7; 1% instances), advcl (5; 1% instances), ccomp (5; 1% instances), conj (5; 1% instances), compound:lvc (2; 0% instances), dep (2; 0% instances), flat (1; 0% instances), xcomp (1; 0% instances)

Parents of ADV nodes belong to 11 different parts of speech: VERB (626; 64% instances), NOUN (163; 17% instances), ADJ (121; 12% instances), NUM (23; 2% instances), ADV (18; 2% instances), PRON (13; 1% instances), (11; 1% instances), ADP (2; 0% instances), AUX (2; 0% instances), DET (2; 0% instances), CCONJ (1; 0% instances)

800 (81%) ADV nodes are leaves.

140 (14%) ADV nodes have one child.

22 (2%) ADV nodes have two children.

20 (2%) ADV nodes have three or more children.

The highest child degree of a ADV node is 6.

Children of ADV nodes are attached using 24 different relations: punct (123; 49% instances), nsubj (19; 8% instances), nummod (17; 7% instances), fixed (13; 5% instances), compound (11; 4% instances), compound:redup (8; 3% instances), nmod (8; 3% instances), obl (8; 3% instances), advmod (7; 3% instances), det (7; 3% instances), cop (6; 2% instances), nmod:poss (5; 2% instances), case (4; 2% instances), amod (3; 1% instances), cc (3; 1% instances), advcl (2; 1% instances), discourse (2; 1% instances), acl (1; 0% instances), advmod:emph (1; 0% instances), aux (1; 0% instances), compound:lvc (1; 0% instances), nmod:tmod (1; 0% instances), obj (1; 0% instances), parataxis (1; 0% instances)

Children of ADV nodes belong to 13 different parts of speech: PUNCT (123; 49% instances), NOUN (36; 14% instances), ADV (18; 7% instances), NUM (16; 6% instances), PRON (16; 6% instances), VERB (15; 6% instances), ADJ (7; 3% instances), AUX (7; 3% instances), ADP (6; 2% instances), DET (6; 2% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)