home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Croatian-SET: POS Tags: X

There are 522 X lemmas (3%), 528 X types (1%) and 772 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: online, of, the, A1, de, times, international, and, company, world

The 10 most frequent X types: online, of, the, A1, de, Company, and, world, European, International

The 10 most frequent ambiguous lemmas: online (X 14, ADJ 2, PROPN 1), de (X 9, PROPN 5), a (CCONJ 921, X 4, INTJ 1, NOUN 1), da (SCONJ 1807, PART 11, X 5), house (X 2, ADJ 1), in (X 5, ADJ 3), gay (X 4, ADJ 1), x (X 4, PUNCT 2), Club (PROPN 3, X 3), Directors (X 3, PROPN 1)

The 10 most frequent ambiguous types: online (X 8, ADJ 2), de (X 9, PROPN 5), Company (X 7, PROPN 1), International (X 6, PROPN 2), a (CCONJ 883, X 3), da (SCONJ 1783, PART 3, X 2), gay (X 4, ADJ 1), x (X 4, PUNCT 2), Bio (AUX 7, X 3, ADJ 1), Club (X 3, PROPN 2)

Morphology

The form / lemma ratio of X is 1.011494 (the average of all parts of speech is 1.848309).

The 1st highest number of forms (4) was observed with the lemma “Times”: Times, Times-u, Timesa, Timesom.

The 2nd highest number of forms (2) was observed with the lemma “House”: House, Housea.

The 3rd highest number of forms (2) was observed with the lemma “International”: International, Internationala.

X occurs with 1 features: Foreign (732; 95% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (732 tokens). Examples: online, of, the, de, Company, and, world, European, International, Freedom

Relations

X nodes are attached to their parents using 20 different relations: flat (365; 47% instances), nmod (102; 13% instances), amod (65; 8% instances), flat:foreign (54; 7% instances), nsubj (45; 6% instances), appos (44; 6% instances), conj (27; 3% instances), parataxis (16; 2% instances), obl (15; 2% instances), obj (11; 1% instances), fixed (9; 1% instances), case (6; 1% instances), root (5; 1% instances), discourse (2; 0% instances), acl (1; 0% instances), advmod (1; 0% instances), cc (1; 0% instances), ccomp (1; 0% instances), iobj (1; 0% instances), xcomp (1; 0% instances)

Parents of X nodes belong to 9 different parts of speech: X (333; 43% instances), NOUN (271; 35% instances), PROPN (79; 10% instances), VERB (57; 7% instances), ADJ (16; 2% instances), NUM (5; 1% instances), (5; 1% instances), ADV (3; 0% instances), AUX (3; 0% instances)

479 (62%) X nodes are leaves.

110 (14%) X nodes have one child.

72 (9%) X nodes have two children.

111 (14%) X nodes have three or more children.

The highest child degree of a X node is 9.

Children of X nodes are attached using 22 different relations: flat (257; 36% instances), punct (187; 26% instances), flat:foreign (55; 8% instances), conj (50; 7% instances), nmod (46; 6% instances), case (29; 4% instances), amod (20; 3% instances), appos (15; 2% instances), cc (12; 2% instances), advmod (9; 1% instances), parataxis (7; 1% instances), fixed (5; 1% instances), cop (4; 1% instances), discourse (4; 1% instances), acl (3; 0% instances), det (3; 0% instances), nsubj (3; 0% instances), nummod (2; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances), mark (1; 0% instances), nummod:gov (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: X (333; 47% instances), PUNCT (187; 26% instances), NOUN (54; 8% instances), PROPN (46; 6% instances), ADP (27; 4% instances), ADJ (21; 3% instances), CCONJ (13; 2% instances), ADV (11; 2% instances), AUX (5; 1% instances), VERB (5; 1% instances), DET (4; 1% instances), NUM (4; 1% instances), PART (2; 0% instances), PRON (2; 0% instances), SCONJ (1; 0% instances)