Treebank Statistics: UD_Czech-PDT: POS Tags: X
There are 962 X
lemmas (4%), 972 X
types (2%) and 1794 X
tokens (1%).
Out of 17 observed tags, the rank of X
is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: de, New, the, of, Open, play, off, York, Nature, and
The 10 most frequent X
types: de, New, the, of, play, Open, off, Nature, York, and
The 10 most frequent ambiguous lemmas: York (X 20, PROPN 16), s (ADP 2504, NOUN 27, X 10, PART 6), Gondwana (X 8, PROPN 2), a (CCONJ 7162, NOUN 17, X 6), O (NOUN 14, X 6), cup (NOUN 5, X 1), Telecom (PROPN 7, X 5), set (NOUN 17, X 4), to (PART 97, X 2), Jersey (PROPN 5, X 4)
The 10 most frequent ambiguous types: York (X 19, PROPN 1), s (ADP 1960, NOUN 72, X 10, PART 6), a (CCONJ 6945, ADJ 32, NOUN 17, X 6), O (ADP 147, NOUN 14, X 6), Telecom (PROPN 5, X 5), set (NUM 20, X 4, ADJ 1, NOUN 1), to (DET 1275, PART 93, X 2), Jersey (PROPN 5, X 4), Palace (X 4, PROPN 3), ad (PROPN 3, X 3, ADJ 1, ADP 1)
- York
- X 19: New York -
- PROPN 1: Chemik Phaedon Avouris a fyzik Robert E . Walkup z Thomas J . Watson Research Center firmy IBM v Yorktown Hights ( stát New York ) a Gerald Dujardin ( fyzik z univerzity Paříž - jih , Orsay , Francie ) předvedli tuto novou možnost s použitím molekul dekaboranu ( 14 ) ( B 10 H 14 ) adsorbovaných na křemíkovém povrchu .
- s
- a
- CCONJ 6945: Je naprosto bezbřehý a nevypočitatelný .
- ADJ 32: a . s . Malostranské nám . 2 118 00 Praha 1 Tel . / fax : 684 62 55
- NOUN 17: Ušetříte téměř 90 % proti variantě a ) .
- X 6: V gigantickém výstavním komplexu Porte de Versailles začal včera v Paříži jeden z největších a nejprestižnějších světových veletrhů módy Pret a Porter .
- O
- Telecom
- set
- NUM 20: Zájem o jeho recitál třikrát převýšil nabídku asi sedmi set míst .
- X 4: Z literatury je známo , že u laboratorních živočichů noradrenalin skutečně ovlivňuje termoregulační “ set point “ .
- ADJ 1: V Havířově včera mohlo přijímat programy a . s . Kabelová televize šest set domácností .
- NOUN 1: Když jsem vyhrála set , druhý často soupeřka vypustila , to profesionálka neudělá , už jsem prohrála i z mečbolu . “
- to
- Jersey
- Palace
- ad
- PROPN 3: ( ad ) New York
- X 3: ad MS hokej . juniorů
- ADJ 1: V kontextu zmíněných racionálních argumentů i názorů většiny ostatních ( Pavel Dostál , Milan Schulz , Ladislav Chudík , Oldřich Daněk ad . ) působilo ideologizované volání Petra Pavlovského po zákazech a trezorech paradoxně jako ozvěna právě oněch dob , s jejichž mrtvolou se rozhodl tak obřadně zúčtovat .
- ADP 1: ad hokej
Morphology
The form / lemma ratio of X
is 1.010395 (the average of all parts of speech is 1.961704).
The 1st highest number of forms (2) was observed with the lemma “And”: AND, And.
The 2nd highest number of forms (2) was observed with the lemma “Banshees”: BANSHEES, Banshees.
The 3rd highest number of forms (2) was observed with the lemma “Cup”: CUP, Cup.
X
occurs with 1 features: Foreign (1794; 100% instances)
X
occurs with 1 feature-value pairs: Foreign=Yes
X
occurs with 1 feature combinations.
The most frequent feature combination is Foreign=Yes
(1794 tokens).
Examples: de, New, the, of, play, Open, off, Nature, York, and
Relations
X
nodes are attached to their parents using 17 different relations: flat:foreign (827; 46% instances), nmod (552; 31% instances), conj (77; 4% instances), obl (76; 4% instances), nsubj (57; 3% instances), dep (53; 3% instances), root (53; 3% instances), appos (51; 3% instances), obj (20; 1% instances), case (14; 1% instances), cc (7; 0% instances), obl:arg (2; 0% instances), advcl (1; 0% instances), advmod:emph (1; 0% instances), amod (1; 0% instances), fixed (1; 0% instances), xcomp (1; 0% instances)
Parents of X
nodes belong to 9 different parts of speech: X (888; 49% instances), NOUN (510; 28% instances), PROPN (148; 8% instances), VERB (144; 8% instances), (53; 3% instances), ADJ (22; 1% instances), NUM (22; 1% instances), ADV (6; 0% instances), AUX (1; 0% instances)
1012 (56%) X
nodes are leaves.
307 (17%) X
nodes have one child.
232 (13%) X
nodes have two children.
243 (14%) X
nodes have three or more children.
The highest child degree of a X
node is 11.
Children of X
nodes are attached using 23 different relations: flat:foreign (821; 46% instances), punct (361; 20% instances), case (143; 8% instances), nmod (116; 6% instances), nummod (73; 4% instances), amod (63; 4% instances), conj (63; 4% instances), appos (44; 2% instances), cc (32; 2% instances), dep (26; 1% instances), acl:relcl (13; 1% instances), det (7; 0% instances), cop (6; 0% instances), advmod:emph (4; 0% instances), mark (4; 0% instances), nsubj (3; 0% instances), obl (3; 0% instances), parataxis (3; 0% instances), advmod (2; 0% instances), orphan (2; 0% instances), aux (1; 0% instances), fixed (1; 0% instances), obj (1; 0% instances)
Children of X
nodes belong to 16 different parts of speech: X (888; 50% instances), PUNCT (361; 20% instances), ADP (134; 7% instances), NOUN (116; 6% instances), NUM (87; 5% instances), PROPN (69; 4% instances), ADJ (64; 4% instances), VERB (22; 1% instances), CCONJ (20; 1% instances), ADV (8; 0% instances), DET (8; 0% instances), AUX (7; 0% instances), SCONJ (4; 0% instances), SYM (2; 0% instances), PART (1; 0% instances), PRON (1; 0% instances)