Treebank Statistics: UD_Sinhala-STB: Features: Number
This feature is universal.
It occurs with 3 different values: Plur
, Ptan
, Sing
.
304 tokens (35%) have a non-empty value of Number
.
249 types (50%) occur at least once with a non-empty value of Number
.
215 lemmas (52%) occur at least once with a non-empty value of Number
.
The feature is used with 4 part-of-speech tags: NOUN (242; 28% instances), PRON (29; 3% instances), PROPN (28; 3% instances), VERB (5; 1% instances).
NOUN
242 NOUN tokens (79% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Animacy=EMPTY (182; 75%), Gender=Neut (161; 67%).
NOUN
tokens may have the following values of Number
:
Plur
(59; 24% of non-emptyNumber
): කොටි, අංශ, අභියෝග, අයවැය, අස්සන්, ආණ්ඩුව, ආයතන, ආරාධනා, ආරාමවලට, උපදේශPtan
(10; 4% of non-emptyNumber
): යුද, කලකට, කලක්, ගිනි, දේශපාලන, බදු, විනය, සල්ලි, හමුදාවේSing
(173; 71% of non-emptyNumber
): මහතා, කිරීම, ජනතාව, තත්ත්වය, අයවැය, අවස්ථාව, ආණ්ඩුව, ආර්ථික, ආර්ථිකය, උද්ධමනයEMPTY
(66): ආර්ථික, සිදු, හමුදා, අද, අහෝසි, දේශපාලන, බොහෝ, වෙනස්, අත්අඩංගුවේ, අනාගත
Paradigm දේශපාලන | Plur | Ptan |
---|---|---|
Animacy=Inan | දේශපාලන | |
Gender=Neut | දේශපාලන |
Number
seems to be lexical feature of NOUN
. 97% lemmas (183) occur only with one value of Number
.
PRON
29 PRON tokens (66% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Poss=EMPTY (29; 100%), Animacy=EMPTY (24; 83%), Person=EMPTY (24; 83%), Case=Nom (18; 62%).
PRON
tokens may have the following values of Number
:
Plur
(4; 14% of non-emptyNumber
): අපට, ඔව්හු, ඔවුනට, ඔවුන්Sing
(25; 86% of non-emptyNumber
): ඔහු, එය, එහි, ඒ, ඔහුට, ඉන්, කිහිපයක්, මීට, මෙයEMPTY
(15): ඒ, ඊට, සිය, අප, අපේ, එකිනෙකා, එම, තම, මේ
Paradigm ඔහු | Sing | Plur |
---|---|---|
Animacy=Anim|Case=Nom|Typo=Yes | ඔව්හු | |
Case=Dat|Gender=Masc|Person=3 | ඔහුට | |
Case=Dat|Gender=Masc | ඔහුට | |
Case=Nom|Gender=Masc|Person=3 | ඔහු | |
Case=Nom|Gender=Masc | ඔහු |
PROPN
28 PROPN tokens (74% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Foreign=EMPTY (27; 96%), Animacy=EMPTY (20; 71%), Case=Nom (20; 71%), Definite=EMPTY (16; 57%).
PROPN
tokens may have the following values of Number
:
Sing
(28; 100% of non-emptyNumber
): ලංකාව, මහින්ද, රනිල්, රාජපක්ෂ, වික්රමසිංහ, ෆොන්සේකා, අමෙරිකාවේ, ඉන්දියාව, ඉරානය, චීනයEMPTY
(10): ශ්රී, කොසෝවෝ, යුනෙස්කෝ, ලිප්ටන්, ෂැවොලින්, සර්බියානු
Number
seems to be lexical feature of PROPN
. 100% lemmas (19) occur only with one value of Number
.
VERB
5 VERB tokens (5% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Aspect=EMPTY (5; 100%), Mood=Ind (3; 60%), Tense=Pres (3; 60%), VerbForm=Fin (3; 60%), Voice=Act (3; 60%).
VERB
tokens may have the following values of Number
:
Plur
(2; 40% of non-emptyNumber
): කරමු, ගනිතිSing
(3; 60% of non-emptyNumber
): කිරීමට, දරයි, යැවීමේEMPTY
(102): කර, තිබේ, ඇත්තේ, කළ, කළේ, දී, පාවා, වන්නේ, විය, වී
Paradigm කර | Sing | Plur |
---|---|---|
Definite=Def|VerbForm=Ger | කිරීමට | |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | කරමු |
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[nsubj]–> PRON (7; 78%),
PROPN –[flat]–> NOUN (7; 88%),
PROPN –[flat]–> PROPN (6; 67%),
NOUN –[acl]–> NOUN (1; 100%),
NOUN –[compound:prt]–> NOUN (1; 100%),
NOUN –[conj]–> NOUN (1; 100%),
NOUN –[nmod:poss]–> PROPN (1; 100%),
PROPN –[conj]–> PROPN (1; 100%),
VERB –[compound:lvc]–> NOUN (1; 100%).