Treebank Statistics: UD_Estonian-EDT: POS Tags: ADJ
There are 7249 ADJ
lemmas (17%), 13841 ADJ
types (17%) and 36870 ADJ
tokens (8%).
Out of 17 observed tags, the rank of ADJ
is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.
The 10 most frequent ADJ
lemmas: suur, uus, esimene, suurem, viimane, hea, erinev, võimalik, väike, oluline
The 10 most frequent ADJ
types: suur, hea, võimalik, eesti, suurem, uue, suure, raske, esimene, oluline
The 10 most frequent ambiguous lemmas: suur (ADJ 714, ADV 5, NOUN 4, PROPN 1), uus (ADJ 614, NOUN 2, PROPN 1), esimene (ADJ 493, DET 31, PRON 21, NUM 1), suurem (ADJ 453, ADV 1), viimane (ADJ 444, NOUN 20), hea (ADJ 389, NOUN 26), väike (ADJ 250, NOUN 4), teine (DET 538, PRON 339, ADJ 218, NUM 2, PROPN 1), keskmine (ADJ 204, NOUN 3), järgmine (ADJ 201, NOUN 1)
The 10 most frequent ambiguous types: hea (ADJ 198, NOUN 14), eesti (ADJ 158, PROPN 3), esimene (ADJ 90, DET 8, PRON 3), vana (ADJ 71, NOUN 10), seotud (ADJ 83, VERB 68), esimest (ADJ 65, PRON 2, NUM 1), nn (ADJ 77, ADV 1), teatud (ADJ 71, VERB 2), viimane (ADJ 47, NOUN 5), tehtud (ADJ 80, VERB 63, NOUN 1)
- hea
- eesti
- esimene
- vana
- seotud
- esimest
- ADJ 65: Neli esimest uue klassi allveelaeva on juba tellitud .
- PRON 2: Iga maalitud klaasi välisküljel oli teine , valge klaasosa , mis esimest kaitses .
- NUM 1: Sarnaseks mahukamate tekstide analüüsinäiteks võib tuua veel Dr. Charles Nicholas’ [ 19 ] piibliteksti sügavamad uurimused ( seda enam , et piibel on kirjutatud heebrea , kreeka ja aramea keeles ning kaks esimest on on-line-versioonidena Internetis levinud ) , kui ta proovis vastata teoloogidele kaua arutlusainet pakkunud küsimustele :
- nn
- teatud
- viimane
- tehtud
- ADJ 80: Täna kummardatakse tehtud töö numbreid .
- VERB 63: Jälle oli mõeldud ja tehtud .
- NOUN 1: Pärnu-Jaagupi keskkooli abituriendid Heli Künnapas ( 17 ) ja Liliana Vorobjova ( 17 ) rääkisid Postimehele , et neil on valik , kuhu edasi õppima minna , tehtud , kuid pidasid “ Teeviida ” külastamist sellest hoolimata enesestmõistetavaks .
Morphology
The form / lemma ratio of ADJ
is 1.909367 (the average of all parts of speech is 1.912964).
The 1st highest number of forms (27) was observed with the lemma “väike”: Väikegi, väike, väike-, väikese, väikesed, väikeseid, väikeseks, väikesel, väikesele, väikesena, väikeses, väikesesse, väikesest, väikesi, väikest, väikeste, väikesteks, väikestel, väikestele, väikestes, väikestesse, väikestest, väikse, väiksed, väikseid, väiksel, väikses.
The 2nd highest number of forms (19) was observed with the lemma “järgmine”: järgmine, järgmise, järgmised, järgmiseid, järgmiseks, järgmisel, järgmisele, järgmisena, järgmises, järgmisesse, järgmisest, järgmisi, järgmisse, järgmist, järgmiste, järgmisteks, järgmistel, järgmistesse, järgmistest.
The 3rd highest number of forms (19) was observed with the lemma “suur”: suur, suurde, suure, suured, suureks, suurel, suurele, suurelt, suures, suurest, suuri, suurt, suurte, suurteks, suurtel, suurtele, suurtes, suurtesse, suurtest.
ADJ
occurs with 15 features: Degree (33137; 90% instances), Case (30584; 83% instances), Number (30477; 83% instances), VerbForm (7934; 22% instances), Voice (7927; 21% instances), Tense (7759; 21% instances), NumType (2659; 7% instances), NumForm (2484; 7% instances), PronType (277; 1% instances), Abbr (119; 0% instances), Hyph (34; 0% instances), Typo (14; 0% instances), Foreign (12; 0% instances), Gender (2; 0% instances), Definite (1; 0% instances)
ADJ
occurs with 44 feature-value pairs: Abbr=Yes
, Case=Abe
, Case=Abl
, Case=Add
, Case=Ade
, Case=All
, Case=Com
, Case=Ela
, Case=Ess
, Case=Gen
, Case=Ill
, Case=Ine
, Case=Nom
, Case=Par
, Case=Ter
, Case=Tra
, Definite=Ind
, Degree=Cmp
, Degree=Pos
, Degree=Sup
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, Hyph=Yes
, NumForm=Digit
, NumForm=Roman
, NumForm=Word
, NumType=Card
, NumType=Ord
, Number=Plur
, Number=Sing
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Int,Rel
, PronType=Rel
, PronType=Tot
, Tense=Past
, Tense=Pres
, Typo=Yes
, VerbForm=Part
, VerbForm=Sup
, Voice=Act
, Voice=Pass
ADJ
occurs with 284 feature combinations.
The most frequent feature combination is Case=Nom|Degree=Pos|Number=Sing
(6910 tokens).
Examples: suur, võimalik, hea, oluline, raske, uus, väike, viimane, keskmine, selge
Relations
ADJ
nodes are attached to their parents using 25 different relations: amod (22010; 60% instances), acl (6843; 19% instances), root (2373; 6% instances), conj (2290; 6% instances), xcomp (999; 3% instances), ccomp (425; 1% instances), obl (422; 1% instances), advcl (391; 1% instances), parataxis (371; 1% instances), acl:relcl (213; 1% instances), obj (136; 0% instances), nsubj (122; 0% instances), nmod (78; 0% instances), nsubj:cop (73; 0% instances), csubj (33; 0% instances), flat (26; 0% instances), csubj:cop (22; 0% instances), orphan (18; 0% instances), appos (12; 0% instances), compound (5; 0% instances), advmod (4; 0% instances), compound:prt (1; 0% instances), dep (1; 0% instances), discourse (1; 0% instances), fixed (1; 0% instances)
Parents of ADJ
nodes belong to 14 different parts of speech: NOUN (28525; 77% instances), VERB (2774; 8% instances), (2373; 6% instances), ADJ (1937; 5% instances), PROPN (711; 2% instances), PRON (268; 1% instances), NUM (157; 0% instances), ADV (103; 0% instances), SYM (9; 0% instances), DET (5; 0% instances), INTJ (3; 0% instances), X (3; 0% instances), AUX (1; 0% instances), CCONJ (1; 0% instances)
23360 (63%) ADJ
nodes are leaves.
7253 (20%) ADJ
nodes have one child.
1730 (5%) ADJ
nodes have two children.
4527 (12%) ADJ
nodes have three or more children.
The highest child degree of a ADJ
node is 13.
Children of ADJ
nodes are attached using 36 different relations: obl (5695; 17% instances), advmod (5400; 17% instances), punct (5291; 16% instances), cop (3900; 12% instances), nsubj:cop (3146; 10% instances), conj (2430; 7% instances), cc (1777; 5% instances), advcl (898; 3% instances), mark (857; 3% instances), obj (737; 2% instances), csubj:cop (664; 2% instances), aux (500; 2% instances), parataxis (353; 1% instances), xcomp (261; 1% instances), compound:prt (90; 0% instances), case (88; 0% instances), cc:preconj (72; 0% instances), nmod (69; 0% instances), det (67; 0% instances), nummod (40; 0% instances), ccomp (37; 0% instances), acl:relcl (36; 0% instances), amod (35; 0% instances), orphan (30; 0% instances), acl (28; 0% instances), nsubj (26; 0% instances), csubj (25; 0% instances), obl:agent (23; 0% instances), discourse (19; 0% instances), appos (15; 0% instances), goeswith (14; 0% instances), vocative (10; 0% instances), flat (9; 0% instances), compound (6; 0% instances), fixed (4; 0% instances), dep (1; 0% instances)
Children of ADJ
nodes belong to 16 different parts of speech: NOUN (8051; 25% instances), ADV (5806; 18% instances), PUNCT (5291; 16% instances), AUX (4400; 13% instances), VERB (2040; 6% instances), ADJ (1937; 6% instances), CCONJ (1774; 5% instances), PRON (1360; 4% instances), PROPN (850; 3% instances), SCONJ (729; 2% instances), NUM (196; 1% instances), ADP (89; 0% instances), DET (68; 0% instances), SYM (22; 0% instances), X (22; 0% instances), INTJ (18; 0% instances)