Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: NUM
There are 166 NUM
lemmas (2%), 374 NUM
types (2%) and 1089 NUM
tokens (1%).
Out of 17 observed tags, the rank of NUM
is: 6 in number of lemmas, 7 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: два, три, оба, одинъ, четыри, пять, сто, 10, шесть, двадцать
The 10 most frequent NUM
types: два, три, ѡбѣ, две, чотыри, 10, 5, шесть, сто, 4
The 10 most frequent ambiguous lemmas: одинъ (NUM 65, DET 20), 10 (NUM 22, ADJ 17), 5 (NUM 20, ADJ 16), 4 (NUM 17, ADJ 13), много (NUM 17, ADV 3), 3 (NUM 15, ADJ 8), 6 (ADJ 17, NUM 12), 7 (NUM 12, ADJ 10), 20 (ADJ 11, NUM 10), 8 (ADJ 15, NUM 10)
The 10 most frequent ambiguous types: 10 (NUM 20, ADJ 13), 5 (NUM 17, ADJ 16), 4 (NUM 14, ADJ 11), 3 (NUM 11, ADJ 7), 6 (ADJ 17, NUM 10), 7 (NUM 10, ADJ 8), семъ (NUM 10, DET 5, PRON 4, ADV 2), 20 (ADJ 11, NUM 9), много (NUM 9, ADV 3), 12 (ADJ 25, NUM 8)
- 10
- 5
- 4
- 3
- 6
- 7
- семъ
- NUM 10: Венку обыскати семъ чоловеков против Кричова , а Кричов Анъдреиковичу .
- DET 5: Бо есмо им , ѡкромъ ѡдно сел(ь)ских путниковъ , в том праве ничого не рꙋшили и все писаное в перъвомъ нашомъ привил(ь)ю и в семъ нашом листе , што естъ вышеи выписано , потверъжаемъ симъ нашим листомъ вечно имъ и напотомъ бꙋдꙋчымъ их счадкомъ .
- PRON 4: На семъ же целуите ко мнѣ кр(е)стъ по любьви и в правду , без всѧкого извѣта .
- ADV 2: А хто тобѣ продал , даи семъ того » .
- 20
- много
- 12
Morphology
The form / lemma ratio of NUM
is 2.253012 (the average of all parts of speech is 2.589846).
The 1st highest number of forms (28) was observed with the lemma “одинъ”: оди(н), один, одино, одинъ, одно, одног(о), одного, однои, одномъ, одною, однъ, одным, одінъ, ѡдин, ѡдинъ, ѡдины, ѡдна, ѡдно, ѡдног(о), ѡдного, ѡднои, ѡдному, ѡдномꙋ, ѡдною, ѡдным, ѡднымъ, ѡднымь, ѡдъномꙋ.
The 2nd highest number of forms (15) was observed with the lemma “оба”: абею, абою, обею, обо(х), обою, обу, обѣ, обꙋ, ѡба, ѡбаи, ѡбе, ѡбема, ѡбою, ѡбу, ѡбѣ.
The 3rd highest number of forms (11) was observed with the lemma “два”: два, две, двема, двоу, двохъ, дву, двух, двѣ, двѣма, двꙋ, двꙋхъ.
NUM
occurs with 7 features: NumForm (1089; 100% instances), NumType (1089; 100% instances), Case (1087; 100% instances), Gender (495; 45% instances), Number (87; 8% instances), Animacy (13; 1% instances), Degree (2; 0% instances)
NUM
occurs with 21 feature-value pairs: Animacy=Anim
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Degree=Cmp
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumForm=Combi
, NumForm=Cyril
, NumForm=Digit
, NumForm=Word
, NumType=Card
, NumType=Frac
, NumType=Sets
, Number=Dual
, Number=Plur
, Number=Sing
NUM
occurs with 75 feature combinations.
The most frequent feature combination is Case=Nom|NumForm=Word|NumType=Card
(174 tokens).
Examples: чотыри, двадцать, пять, много, семъ, сто, шесть, двадцат(ь), тридцать, четыриста
Relations
NUM
nodes are attached to their parents using 21 different relations: nummod:gov (456; 42% instances), nummod (399; 37% instances), parataxis (66; 6% instances), compound (37; 3% instances), conj (36; 3% instances), dep (30; 3% instances), obl (14; 1% instances), root (14; 1% instances), obj (8; 1% instances), nmod (6; 1% instances), orphan (5; 0% instances), advcl (3; 0% instances), nsubj (3; 0% instances), appos (2; 0% instances), iobj (2; 0% instances), nsubj:pass (2; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances)
Parents of NUM
nodes belong to 9 different parts of speech: NOUN (877; 81% instances), VERB (114; 10% instances), ADJ (41; 4% instances), NUM (26; 2% instances), (14; 1% instances), PROPN (7; 1% instances), PRON (6; 1% instances), DET (3; 0% instances), INTJ (1; 0% instances)
948 (87%) NUM
nodes are leaves.
82 (8%) NUM
nodes have one child.
24 (2%) NUM
nodes have two children.
35 (3%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 10.
Children of NUM
nodes are attached using 26 different relations: punct (68; 25% instances), cc (46; 17% instances), case (25; 9% instances), conj (22; 8% instances), advmod (16; 6% instances), nmod (16; 6% instances), compound (15; 6% instances), nsubj (13; 5% instances), cop (8; 3% instances), orphan (7; 3% instances), obl (6; 2% instances), det (5; 2% instances), acl:relcl (3; 1% instances), appos (2; 1% instances), iobj (2; 1% instances), mark (2; 1% instances), nummod (2; 1% instances), nummod:gov (2; 1% instances), amod (1; 0% instances), dep (1; 0% instances), discourse (1; 0% instances), expl (1; 0% instances), fixed (1; 0% instances), list (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)
Children of NUM
nodes belong to 15 different parts of speech: PUNCT (68; 25% instances), CCONJ (46; 17% instances), NOUN (37; 14% instances), NUM (26; 10% instances), ADP (25; 9% instances), PART (14; 5% instances), AUX (9; 3% instances), SYM (9; 3% instances), VERB (9; 3% instances), DET (8; 3% instances), PRON (5; 2% instances), PROPN (4; 1% instances), ADJ (3; 1% instances), ADV (3; 1% instances), SCONJ (2; 1% instances)