Treebank Statistics: UD_Serbian-SET: POS Tags: DET
There are 51 DET
lemmas (0%), 218 DET
types (1%) and 3639 DET
tokens (4%).
Out of 17 observed tags, the rank of DET
is: 9 in number of lemmas, 7 in number of types and 8 in number of tokens.
The 10 most frequent DET
lemmas: koji, taj, svoj, ovaj, sav, njegov, neki, njihov, onaj, svaki
The 10 most frequent DET
types: koji, to, koje, koja, svoje, ove, sve, toga, nekoliko, koju
The 10 most frequent ambiguous lemmas: taj (DET 732, ADJ 1), svoj (DET 347, ADJ 1), sav (DET 138, ADJ 24), njegov (DET 135, ADJ 1), neki (DET 124, ADJ 2, SCONJ 1), nekoliko (DET 69, ADV 6), takav (DET 66, ADJ 1), njen (DET 63, ADJ 3), mnogo (ADV 119, DET 38), sve (DET 29, PART 22, ADV 7, ADJ 2)
The 10 most frequent ambiguous types: to (DET 192, PART 1), sve (DET 70, PART 21, ADJ 11, ADV 6), nekoliko (DET 65, ADV 1), svoj (DET 50, ADJ 1), te (DET 40, CCONJ 4, NOUN 1), više (ADV 90, DET 29, ADJ 3), svi (DET 13, ADJ 5), svih (DET 25, ADJ 1), tim (DET 24, NOUN 9), svojim (DET 23, ADJ 1)
- to
- DET 192: Bar na papiru , to izgleda kao odlična ideja .
- PART 1: S tim u vezi , i to ne samo povodom ovog slučaja , već , nažalost , povodom velikog broja sličnih , u mnoštvu različitih oblasti , treba se , u uslovima ( hiper ) produkcije novih zakona , suočiti s činjenicom da naš glavni problem nije kvalitet naših zakona ( kakav god da je ) , već način na koji se „ primenjuju ” .
- sve
- DET 70: A ako je sve u redu , mogu da otpočnu projekat “ , kaže Božinov .
- PART 21: I sve mi nešto ta slika poznata , kao da sam je negde video .
- ADJ 11: To će biti korisno za sve ljude na Kosovu . “
- ADV 6: S druge strane imamo i sve više aplikacija za pametne telefone , koje omogućavaju umrežavanje pametnih uređaja .
- nekoliko
- svoj
- te
- DET 40: A , čak će i te dugove biti teško otplatiti .
- CCONJ 4: Vasilis Zotos , istaknuti arhitekta i urbani planer , te predsednik lokalnog Društva britanskih diplomaca , ne slaže se i kritikuje sistem .
- NOUN 1: Sportisti iz 11 zemalja takmičili su se u gađanju lukom i strelom , košarci , gimnastici , fudbalu , plivanju , te kvon dou , atletici , odbojci i rvanju .
- više
- svi
- svih
- tim
- svojim
- DET 23: ” Deci u selu bilo je rečeno da ostanu u svojim kućama “ , izvestio je Tajms onlajn .
- ADJ 1: ” Preuranjeno pokretanje intenzivnog novog procesa ne bi bilo preporučljivo “ , rekao je podsekretar UN-a za politička pitanja Kieran Prendergast informišući Savet bezbednosti o svojim posetama Kipru , Grčkoj i Turskoj ranije ovog meseca .
Morphology
The form / lemma ratio of DET
is 4.274510 (the average of all parts of speech is 1.845258).
The 1st highest number of forms (15) was observed with the lemma “taj”: ove, ta, taj, te, ti, tih, tim, time, to, tog, toga, toj, tom, tome, tu.
The 2nd highest number of forms (14) was observed with the lemma “koji”: kog, koja, koje, kojeg, kojem, koji, kojih, kojim, kojima, kojoj, kojom, koju, kom, kome.
The 3rd highest number of forms (12) was observed with the lemma “onaj”: ona, onaj, one, oni, onih, onima, ono, onog, onoga, onoj, onom, onome.
DET
occurs with 11 features: PronType (3574; 98% instances), Case (3490; 96% instances), Gender (3490; 96% instances), Number (3486; 96% instances), Poss (762; 21% instances), Person (372; 10% instances), Number[psor] (371; 10% instances), Reflex (347; 10% instances), Gender[psor] (198; 5% instances), Animacy (160; 4% instances), Degree (149; 4% instances)
DET
occurs with 31 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Degree=Cmp
, Degree=Pos
, Degree=Sup
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Gender[psor]=Fem
, Gender[psor]=Masc,Neut
, Number=Plur
, Number=Sing
, Number[psor]=Plur
, Number[psor]=Sing
, Person=1
, Person=2
, Person=3
, Poss=Yes
, PronType=Dem
, PronType=Ind
, PronType=Int,Rel
, PronType=Neg
, PronType=Prs
, PronType=Tot
, Reflex=Yes
DET
occurs with 287 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|PronType=Int,Rel
(259 tokens).
Examples: koji, kakav, ko, nijedan, takođe
Relations
DET
nodes are attached to their parents using 17 different relations: det (1725; 47% instances), nsubj (1023; 28% instances), obl (367; 10% instances), obj (236; 6% instances), det:numgov (147; 4% instances), nmod (42; 1% instances), amod (22; 1% instances), conj (18; 0% instances), discourse (17; 0% instances), root (17; 0% instances), fixed (7; 0% instances), acl (6; 0% instances), ccomp (4; 0% instances), advmod (3; 0% instances), parataxis (3; 0% instances), advcl (1; 0% instances), appos (1; 0% instances)
Parents of DET
nodes belong to 13 different parts of speech: NOUN (2056; 56% instances), VERB (1213; 33% instances), ADJ (262; 7% instances), ADV (24; 1% instances), (17; 0% instances), AUX (16; 0% instances), DET (14; 0% instances), PRON (11; 0% instances), NUM (9; 0% instances), PROPN (8; 0% instances), ADP (4; 0% instances), PART (3; 0% instances), CCONJ (2; 0% instances)
3144 (86%) DET
nodes are leaves.
367 (10%) DET
nodes have one child.
88 (2%) DET
nodes have two children.
40 (1%) DET
nodes have three or more children.
The highest child degree of a DET
node is 8.
Children of DET
nodes are attached using 23 different relations: case (309; 43% instances), acl (115; 16% instances), punct (77; 11% instances), advmod (42; 6% instances), cop (29; 4% instances), nsubj (25; 3% instances), discourse (22; 3% instances), nmod (21; 3% instances), cc (16; 2% instances), amod (13; 2% instances), conj (12; 2% instances), obl (12; 2% instances), det (5; 1% instances), fixed (5; 1% instances), parataxis (5; 1% instances), aux (4; 1% instances), nummod (3; 0% instances), advcl (2; 0% instances), csubj (2; 0% instances), det:numgov (2; 0% instances), mark (2; 0% instances), flat (1; 0% instances), orphan (1; 0% instances)
Children of DET
nodes belong to 15 different parts of speech: ADP (308; 42% instances), VERB (104; 14% instances), PUNCT (77; 11% instances), NOUN (58; 8% instances), ADV (39; 5% instances), AUX (35; 5% instances), ADJ (26; 4% instances), PART (26; 4% instances), CCONJ (16; 2% instances), DET (14; 2% instances), SCONJ (9; 1% instances), NUM (6; 1% instances), PROPN (4; 1% instances), PRON (2; 0% instances), X (1; 0% instances)