Treebank Statistics: UD_Croatian-SET: POS Tags: X
There are 522 X
lemmas (3%), 528 X
types (1%) and 772 X
tokens (0%).
Out of 17 observed tags, the rank of X
is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: online, of, the, A1, de, times, international, and, company, world
The 10 most frequent X
types: online, of, the, A1, de, Company, and, world, European, International
The 10 most frequent ambiguous lemmas: online (X 14, ADJ 2, PROPN 1), de (X 9, PROPN 5), a (CCONJ 921, X 4, INTJ 1, NOUN 1), da (SCONJ 1807, PART 11, X 5), house (X 2, ADJ 1), in (X 5, ADJ 3), gay (X 4, ADJ 1), x (X 4, PUNCT 2), Club (PROPN 3, X 3), Directors (X 3, PROPN 1)
The 10 most frequent ambiguous types: online (X 8, ADJ 2), de (X 9, PROPN 5), Company (X 7, PROPN 1), International (X 6, PROPN 2), a (CCONJ 883, X 3), da (SCONJ 1783, PART 3, X 2), gay (X 4, ADJ 1), x (X 4, PUNCT 2), Bio (AUX 7, X 3, ADJ 1), Club (X 3, PROPN 2)
- online
- de
- Company
- International
- a
- da
- SCONJ 1783: ” Desetljeće kasnije , vidimo da je gospodarstvo užasno nestrukturirano .
- PART 3: O da .
- X 2: Na tu je snimku u montaži dodan njegov glas koji kaže : Uzoriti prvi ministre , Eduardo da Silva , broj 22 , najbolje zabija , na što premijer Sanader odgovara : Gospodine zastupniče , točno je ovo što ste kazali .
- gay
- x
- X 4: Zidar Mato Luladžić popravio je na opće zadovoljstvo kapelu svetog Luke u Šagovini za 65. f . i 25 x . Listopad 1888. .
- PUNCT 2: Uređaj bi trebao imati procesor s taktom od 1 GHz i rezoluciju ekrana od 800 x 400 piksela , što ga čini svojevrsnom uvećanom verzijom Samsungovog mobilnog telefona Galaxy S .
- Bio
- AUX 7: ” Bio sam u školi na dan izbora – imamo veliki televizor u dvorani .
- X 3: Bio Out proizvode možeš pronaći u trgovinama ili web shopu Gardena , glavnog distributera Bio Out proizvoda .
- ADJ 1: Međutim , kako bi se navedeni pozitivni trendovi odrazili i na inozemna tržišta , udruženja Bio Austria i Agrarmarkt Austria pokrenula su izvoznu ofenzivu ekoloških proizvoda pod motom “ Bio aus Österreich “ , kako bi se austrijski ekološki proizvodi diljem svijeta promovirao kao vrhunski brend .
- Club
- X 3: Za radove su dobile brojna međunarodna priznanja , primjerice , Type Directors Club Tokyo , I.D. Magazine , Type Directors Club New York , Art Directors Club New York , ICOGRADA Exellence Award i druga .
- PROPN 2: Za radove su dobile brojna međunarodna priznanja , primjerice , Type Directors Club Tokyo , I.D. Magazine , Type Directors Club New York , Art Directors Club New York , ICOGRADA Exellence Award i druga .
Morphology
The form / lemma ratio of X
is 1.011494 (the average of all parts of speech is 1.848309).
The 1st highest number of forms (4) was observed with the lemma “Times”: Times, Times-u, Timesa, Timesom.
The 2nd highest number of forms (2) was observed with the lemma “House”: House, Housea.
The 3rd highest number of forms (2) was observed with the lemma “International”: International, Internationala.
X
occurs with 1 features: Foreign (732; 95% instances)
X
occurs with 1 feature-value pairs: Foreign=Yes
X
occurs with 2 feature combinations.
The most frequent feature combination is Foreign=Yes
(732 tokens).
Examples: online, of, the, de, Company, and, world, European, International, Freedom
Relations
X
nodes are attached to their parents using 20 different relations: flat (365; 47% instances), nmod (102; 13% instances), amod (65; 8% instances), flat:foreign (54; 7% instances), nsubj (45; 6% instances), appos (44; 6% instances), conj (27; 3% instances), parataxis (16; 2% instances), obl (15; 2% instances), obj (11; 1% instances), fixed (9; 1% instances), case (6; 1% instances), root (5; 1% instances), discourse (2; 0% instances), acl (1; 0% instances), advmod (1; 0% instances), cc (1; 0% instances), ccomp (1; 0% instances), iobj (1; 0% instances), xcomp (1; 0% instances)
Parents of X
nodes belong to 9 different parts of speech: X (333; 43% instances), NOUN (271; 35% instances), PROPN (79; 10% instances), VERB (57; 7% instances), ADJ (16; 2% instances), NUM (5; 1% instances), (5; 1% instances), ADV (3; 0% instances), AUX (3; 0% instances)
479 (62%) X
nodes are leaves.
110 (14%) X
nodes have one child.
72 (9%) X
nodes have two children.
111 (14%) X
nodes have three or more children.
The highest child degree of a X
node is 9.
Children of X
nodes are attached using 22 different relations: flat (257; 36% instances), punct (187; 26% instances), flat:foreign (55; 8% instances), conj (50; 7% instances), nmod (46; 6% instances), case (29; 4% instances), amod (20; 3% instances), appos (15; 2% instances), cc (12; 2% instances), advmod (9; 1% instances), parataxis (7; 1% instances), fixed (5; 1% instances), cop (4; 1% instances), discourse (4; 1% instances), acl (3; 0% instances), det (3; 0% instances), nsubj (3; 0% instances), nummod (2; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances), mark (1; 0% instances), nummod:gov (1; 0% instances)
Children of X
nodes belong to 15 different parts of speech: X (333; 47% instances), PUNCT (187; 26% instances), NOUN (54; 8% instances), PROPN (46; 6% instances), ADP (27; 4% instances), ADJ (21; 3% instances), CCONJ (13; 2% instances), ADV (11; 2% instances), AUX (5; 1% instances), VERB (5; 1% instances), DET (4; 1% instances), NUM (4; 1% instances), PART (2; 0% instances), PRON (2; 0% instances), SCONJ (1; 0% instances)