Treebank Statistics: UD_Serbian-SET: POS Tags: NUM
There are 314 NUM
lemmas (3%), 325 NUM
types (2%) and 1278 NUM
tokens (1%).
Out of 17 observed tags, the rank of NUM
is: 6 in number of lemmas, 6 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: jedan, dva, tri, pet, četiri, 20, deset, šest, 50, oba
The 10 most frequent NUM
types: jedan, tri, dve, dva, pet, četiri, 20, jedna, deset, šest
The 10 most frequent ambiguous lemmas: jedan (NUM 204, ADJ 2), dva (NUM 108, NOUN 1), oba (NUM 20, DET 1), 18 (NUM 4, ADJ 1), 19 (NUM 4, ADJ 1), 2008 (ADJ 4, NUM 2), 2009 (NUM 2, ADJ 1), jednom (ADV 4, NUM 2), 2000 (ADJ 1, NUM 1), 2005 (ADJ 3, NUM 1)
The 10 most frequent ambiguous types: dve (NUM 55, NOUN 1), jednog (NUM 17, ADJ 1), obe (NUM 14, DET 1), jednom (NUM 16, ADV 4), jednoj (NUM 11, ADJ 1), 18 (NUM 4, ADJ 1), 2008 (ADJ 4, NUM 2), 2009 (NUM 2, ADJ 1), 2000 (ADJ 1, NUM 1), 2005 (ADJ 3, NUM 1)
- dve
- jednog
- NUM 17: ” Dobro je da se postigne konsenzus i da imamo jednog kandidata .
- ADJ 1: Veliku odgovornost za ovo često snose sami korisnici jer radi pogodnosti da sve mogu da rade lako , brzo i sa jednog mesta , često ne ulažu dovoljno truda u zaštitu svojih pametnih sistema , čak ni kad je u pitanju najjednostavniji oblik zaštite poput duple lozinke .
- obe
- jednom
- jednoj
- 18
- 2008
- 2009
- 2000
- ADJ 1: Kao direktorka za životnu sredinu u periodu 2000 - 2004 , ona je vodila pripreme za prvu strategiju banke za životnu sredinu i nadgledala budžet od 11 milijardi dolara ekoloških zajmova .
- NUM 1: Grupa je dobila podršku EU , koja je nedavno odlučila da finansijski pomogne nastojanja u Alianiju usmerena na očuvanje grada , u okviru svog programa “ Kultura 2000 “ .
- 2005
- ADJ 3: Nekoliko meseci pre zvaničnog početka pregovora o članstvu sa Briselom u oktobru 2005 , Ankara je potpisala protokol o proširivanju carinskog sporazuma sa EU na sve države članice , uključujući Kipar .
- NUM 1: Pregovori su počeli u oktobru 2005 , a u međuvremenu je otvoreno 16 od 35 poglavlja , od kojih su dva privremeno zatvorena .
Morphology
The form / lemma ratio of NUM
is 1.035032 (the average of all parts of speech is 1.845258).
The 1st highest number of forms (10) was observed with the lemma “jedan”: jedan, jedna, jedne, jedni, jednim, jedno, jednog, jednoj, jednom, jednu.
The 2nd highest number of forms (3) was observed with the lemma “dva”: dva, dve, dveju.
The 3rd highest number of forms (3) was observed with the lemma “oba”: oba, obe, obeju.
NUM
occurs with 5 features: NumType (1278; 100% instances), Case (304; 24% instances), Gender (302; 24% instances), Number (279; 22% instances), Animacy (21; 2% instances)
NUM
occurs with 15 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumType=Card
, NumType=Mult
, Number=Plur
, Number=Sing
NUM
occurs with 34 feature combinations.
The most frequent feature combination is NumType=Card
(923 tokens).
Examples: tri, dva, jedan, pet, dve, četiri, 20, deset, šest, 50
Relations
NUM
nodes are attached to their parents using 13 different relations: nummod:gov (642; 50% instances), nummod (443; 35% instances), obl (53; 4% instances), conj (37; 3% instances), flat (34; 3% instances), nsubj (23; 2% instances), nmod (21; 2% instances), parataxis (11; 1% instances), amod (6; 0% instances), orphan (3; 0% instances), appos (2; 0% instances), obj (2; 0% instances), xcomp (1; 0% instances)
Parents of NUM
nodes belong to 10 different parts of speech: NOUN (1056; 83% instances), VERB (65; 5% instances), NUM (48; 4% instances), ADV (41; 3% instances), PROPN (28; 2% instances), ADJ (25; 2% instances), DET (6; 0% instances), PRON (5; 0% instances), SYM (3; 0% instances), AUX (1; 0% instances)
801 (63%) NUM
nodes are leaves.
311 (24%) NUM
nodes have one child.
122 (10%) NUM
nodes have two children.
44 (3%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 6.
Children of NUM
nodes are attached using 17 different relations: flat (244; 35% instances), advmod (197; 28% instances), case (100; 14% instances), punct (50; 7% instances), conj (32; 5% instances), cc (24; 3% instances), amod (15; 2% instances), nmod (12; 2% instances), det (8; 1% instances), acl (6; 1% instances), orphan (6; 1% instances), obl (5; 1% instances), cop (2; 0% instances), advcl (1; 0% instances), aux (1; 0% instances), discourse (1; 0% instances), parataxis (1; 0% instances)
Children of NUM
nodes belong to 15 different parts of speech: ADV (274; 39% instances), NOUN (152; 22% instances), ADP (108; 15% instances), PUNCT (50; 7% instances), NUM (48; 7% instances), CCONJ (24; 3% instances), ADJ (22; 3% instances), DET (9; 1% instances), PART (6; 1% instances), AUX (4; 1% instances), SCONJ (3; 0% instances), VERB (2; 0% instances), PRON (1; 0% instances), SYM (1; 0% instances), X (1; 0% instances)