Treebank Statistics: UD_Buryat-BDT: POS Tags: PROPN
There are 370 PROPN
lemmas (14%), 440 PROPN
types (10%) and 709 PROPN
tokens (7%).
Out of 16 observed tags, the rank of PROPN
is: 3 in number of lemmas, 3 in number of types and 6 in number of tokens.
The 10 most frequent PROPN
lemmas: Байгал, Булад, Улаан-Үдэ, Цыпелма, Энэдхэг, Баяр, Матти, Дандар, Хойто, Баянгаза
The 10 most frequent PROPN
types: Байгал, Булад, Цыпелма, Баяр, Энэдхэг, Дандар, Матти, Хойто, Екатерина, Магдан
The 10 most frequent ambiguous lemmas: зандин (PROPN 6, NOUN 1), _ (VERB 10, PART 8, ADJ 7, NOUN 7, ADV 4, PROPN 4, PRON 3, DET 2, ADP 1)
The 10 most frequent ambiguous types: Баянгаза (PROPN 5, NOUN 1), Зүүн (PROPN 4, ADJ 1), Буладай (PROPN 2, NOUN 1), Гүрэнэй (NOUN 1, PROPN 1)
- Баянгаза
- Зүүн
- Буладай
- Гүрэнэй
Morphology
The form / lemma ratio of PROPN
is 1.189189 (the average of all parts of speech is 1.638355).
The 1st highest number of forms (5) was observed with the lemma “Цыпелма”: Цыпелма, Цыпелмаае, Цыпелмагай, Цыпелмада, Цыпелмае.
The 2nd highest number of forms (4) was observed with the lemma “_”: Абагал, Кондратьев, Кондратьевай, Хатареев.
The 3rd highest number of forms (4) was observed with the lemma “Лыгденов”: Лыгденов, Лыгденовтэ, Лыгденовтэнэй, Лыгденовэй.
PROPN
occurs with 6 features: Case (700; 99% instances), Gender (380; 54% instances), Number (10; 1% instances), Reflex (8; 1% instances), Number[psor] (2; 0% instances), Person[psor] (2; 0% instances)
PROPN
occurs with 12 feature-value pairs: Case=Abl
, Case=Acc
, Case=Com
, Case=Dat
, Case=Gen
, Case=Nom
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number[psor]=Sing
, Person[psor]=3
, Reflex=Yes
PROPN
occurs with 26 feature combinations.
The most frequent feature combination is Case=Nom
(194 tokens).
Examples: Байгал, Энэдхэг, Хойто, Булад, Улаан-Үдэ, Баянгаза, Цыпелма, Баяр, Зүдхэлиин, Зүүн
Relations
PROPN
nodes are attached to their parents using 12 different relations: flat (200; 28% instances), nmod (193; 27% instances), nsubj (91; 13% instances), compound (89; 13% instances), conj (53; 7% instances), appos (38; 5% instances), root (13; 2% instances), list (12; 2% instances), obj (10; 1% instances), vocative (6; 1% instances), iobj (2; 0% instances), obl (2; 0% instances)
Parents of PROPN
nodes belong to 6 different parts of speech: NOUN (285; 40% instances), PROPN (256; 36% instances), VERB (132; 19% instances), ADJ (21; 3% instances), (13; 2% instances), ADP (2; 0% instances)
325 (46%) PROPN
nodes are leaves.
265 (37%) PROPN
nodes have one child.
90 (13%) PROPN
nodes have two children.
29 (4%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 9.
Children of PROPN
nodes are attached using 19 different relations: flat (218; 40% instances), punct (153; 28% instances), conj (53; 10% instances), compound (22; 4% instances), nmod (21; 4% instances), cc (19; 3% instances), appos (14; 3% instances), list (12; 2% instances), acl (10; 2% instances), amod (6; 1% instances), discourse (5; 1% instances), case (3; 1% instances), nsubj (3; 1% instances), orphan (3; 1% instances), advmod (2; 0% instances), det (2; 0% instances), aux (1; 0% instances), nummod (1; 0% instances), parataxis (1; 0% instances)
Children of PROPN
nodes belong to 11 different parts of speech: PROPN (256; 47% instances), PUNCT (153; 28% instances), NOUN (81; 15% instances), CCONJ (19; 3% instances), VERB (14; 3% instances), ADJ (10; 2% instances), ADV (5; 1% instances), PRON (5; 1% instances), ADP (4; 1% instances), AUX (1; 0% instances), NUM (1; 0% instances)