home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDT: POS Tags: ADP

There are 58 ADP lemmas (0%), 69 ADP types (0%) and 31943 ADP tokens (10%). Out of 17 observed tags, the rank of ADP is: 9 in number of lemmas, 11 in number of types and 4 in number of tokens.

The 10 most frequent ADP lemmas: v, na, s, z, o, do, k, pro, za, po

The 10 most frequent ADP types: v, na, o, s, z, do, ve, k, pro, za

The 10 most frequent ambiguous lemmas: v (ADP 7919, NOUN 3), s (ADP 2504, NOUN 27, X 10, PART 6), o (ADP 2329, NOUN 33), do (ADP 1605, PROPN 2, X 2), k (ADP 1541, NOUN 4), u (ADP 500, NOUN 3), nad (ADP 199, ADJ 1), kolem (ADP 102, ADV 3), mimo (ADP 71, ADV 3), místo (NOUN 222, ADP 45, ADV 6)

The 10 most frequent ambiguous types: v (ADP 5786, NOUN 3), o (ADP 2182, NOUN 33, ADJ 6), s (ADP 1960, NOUN 72, X 10, PART 6), do (ADP 1513, PROPN 2, X 2), k (ADP 1196, NOUN 4), po (ADP 730, NOUN 1), u (ADP 417, NOUN 3), se (PRON 4768, ADP 430), nad (ADP 182, ADJ 1), kolem (ADP 96, ADV 3, NOUN 3)

Morphology

The form / lemma ratio of ADP is 1.189655 (the average of all parts of speech is 1.961704).

The 1st highest number of forms (3) was observed with the lemma “k”: k, ke, ku.

The 2nd highest number of forms (3) was observed with the lemma “nad”: n, nad, nade.

The 3rd highest number of forms (3) was observed with the lemma “před”: př, před, přede.

ADP occurs with 4 features: AdpType (31943; 100% instances), Case (31866; 100% instances), Abbr (10; 0% instances), Style (1; 0% instances)

ADP occurs with 11 feature-value pairs: Abbr=Yes, AdpType=Comprep, AdpType=Prep, AdpType=Voc, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Style=Coll

ADP occurs with 15 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Loc (11172 tokens). Examples: v, na, o, po, při

Relations

ADP nodes are attached to their parents using 10 different relations: case (31571; 99% instances), fixed (339; 1% instances), mark (7; 0% instances), nmod (7; 0% instances), obl (7; 0% instances), conj (6; 0% instances), dep (2; 0% instances), root (2; 0% instances), advcl (1; 0% instances), obj (1; 0% instances)

Parents of ADP nodes belong to 14 different parts of speech: NOUN (24868; 78% instances), PROPN (3202; 10% instances), PRON (1350; 4% instances), DET (1047; 3% instances), NUM (511; 2% instances), ADJ (417; 1% instances), ADP (249; 1% instances), X (134; 0% instances), ADV (118; 0% instances), SYM (24; 0% instances), VERB (13; 0% instances), PART (7; 0% instances), (2; 0% instances), INTJ (1; 0% instances)

31363 (98%) ADP nodes are leaves.

401 (1%) ADP nodes have one child.

177 (1%) ADP nodes have two children.

2 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 6.

Children of ADP nodes are attached using 15 different relations: fixed (737; 96% instances), advmod (3; 0% instances), cc (3; 0% instances), conj (3; 0% instances), nmod (3; 0% instances), punct (3; 0% instances), cop (2; 0% instances), det (2; 0% instances), nsubj (2; 0% instances), acl (1; 0% instances), aux (1; 0% instances), dep (1; 0% instances), mark (1; 0% instances), obl (1; 0% instances), obl:arg (1; 0% instances)

Children of ADP nodes belong to 10 different parts of speech: NOUN (497; 65% instances), ADP (249; 33% instances), ADV (3; 0% instances), AUX (3; 0% instances), CCONJ (3; 0% instances), PUNCT (3; 0% instances), DET (2; 0% instances), PROPN (2; 0% instances), SCONJ (1; 0% instances), VERB (1; 0% instances)