Treebank Statistics: UD_Czech-CAC: Features: NameType
This feature is language-specific.
It occurs with 7 different values: Com
, Geo
, Giv
, Nat
, Oth
, Pro
, Sur
.
Some words have combined values of the feature; 12 combinations have been observed: Com|Geo
, Com|Giv
, Com|Pro
, Com|Sur
, Geo|Giv
, Geo|Oth
, Geo|Sur
, Giv|Oth
, Giv|Pro
, Giv|Sur
, Nat|Sur
, Pro|Sur
.
10592 tokens (2%) have a non-empty value of NameType
.
4886 types (8%) occur at least once with a non-empty value of NameType
.
3816 lemmas (13%) occur at least once with a non-empty value of NameType
.
The feature is used with 6 part-of-speech tags: PROPN (9814; 2% instances), ADJ (768; 0% instances), ADP (4; 0% instances), PART (4; 0% instances), CCONJ (1; 0% instances), PRON (1; 0% instances).
PROPN
9814 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of NameType
.
The most frequent other feature values with which PROPN
and NameType
co-occurred: Abbr=EMPTY (7936; 81%), Number=Sing (7187; 73%), Gender=Masc (5431; 55%).
PROPN
tokens may have the following values of NameType
:
Com
(1750; 18% of non-emptyNameType
): KSČ, ROH, ÚJČ, SSM, ČSAV, TIBA, NF, OV, GŘ, VÚMCom,Geo
(2; 0% of non-emptyNameType
): Böhmen, PragCom,Giv
(2; 0% of non-emptyNameType
): KonstruktivaCom,Pro
(3; 0% of non-emptyNameType
): Dermacol, Gambrinus, PrazdrojCom,Sur
(4; 0% of non-emptyNameType
): Bell, Bramsch, VenturaGeo
(3418; 35% of non-emptyNameType
): Praze, SSSR, Praha, ČSSR, Prahy, ČSR, Československa, NDR, Země, USAGeo,Giv
(16; 0% of non-emptyNameType
): Boleslav, Mořici, Kuby, Louis, Lucie, MořiceGeo,Oth
(1; 0% of non-emptyNameType
): ÖsterreichGeo,Sur
(26; 0% of non-emptyNameType
): Pavlov, Hejná, Tigridem, Blatná, Hlinka, Hracholusk, Janského, Lachaise, Lhota, LhotuGiv
(1221; 12% of non-emptyNameType
): Karel, Julius, Václav, Jana, Karla, Jaroslav, Josef, Jiří, Zdeněk, KlementGiv,Oth
(1; 0% of non-emptyNameType
): LuciiGiv,Pro
(8; 0% of non-emptyNameType
): Claudio, Othelo, Pascal, Pascalu, Radka, Raduna, Sandy, ZoraGiv,Sur
(24; 0% of non-emptyNameType
): Ariadna, Kosmy, Perry, Joy, Dante, Dantem, Eliot, Figara, James, JameseNat
(129; 1% of non-emptyNameType
): Adygejci, Pražané, Angličan, Egypťané, Keltové, Afričanů, Američana, Američané, Američanů, AsyřanéNat,Sur
(1; 0% of non-emptyNameType
): SrbaOth
(21; 0% of non-emptyNameType
): Opeplatis, SNP, Plastex, Rena, Erotissimo, Gaudeamus, Intermóda, Invex, Luna, MusikbuchPro
(290; 3% of non-emptyNameType
): Škoda, Merkur, Duha, Octavia, FSČ, Klad, Romatic, SAPO, SaS, TatramatPro,Sur
(3; 0% of non-emptyNameType
): Baracchi, Burda, MendozaSur
(2894; 29% of non-emptyNameType
): Fučík, Erben, Horálek, Knappová, Němec, Těšitelová, Lenin, Záveský, Kraus, Fučíka
Paradigm Svoboda | Com | Geo | Pro | Sur |
---|---|---|---|---|
Animacy=Anim|Case=Acc|Gender=Masc | Svobodu | |||
Animacy=Anim|Case=Gen|Gender=Masc | Svobody | |||
Animacy=Anim|Case=Nom|Gender=Masc | Svoboda | |||
Case=Loc|Gender=Fem | Svobodě | |||
Case=Nom|Gender=Fem | Svoboda | Svoboda |
NameType
seems to be lexical feature of PROPN
. 98% lemmas (3400) occur only with one value of NameType
.
ADJ
768 ADJ tokens (1% of all ADJ
tokens) have a non-empty value of NameType
.
The most frequent other feature values with which ADJ
and NameType
co-occurred: VerbForm=EMPTY (768; 100%), Voice=EMPTY (768; 100%), Degree=EMPTY (565; 74%), Polarity=EMPTY (565; 74%), Number=Sing (520; 68%), Animacy=EMPTY (489; 64%).
ADJ
tokens may have the following values of NameType
:
Com
(18; 2% of non-emptyNameType
): Koh, i, Telephone, Tonkünstler, Červeného, Deutscher, Jazykovedným, Povážské, Pražská, UnitedCom,Pro
(1; 0% of non-emptyNameType
): VereinGeo
(165; 21% of non-emptyNameType
): Králové, Kutná, Kašperských, Lužických, České, Mariánských, Nové, Vrátné, Bassa, JanskéGeo,Giv
(16; 2% of non-emptyNameType
): Karlovy, Josefův, Karlových, Františkových, Jindřichova, KonstantinovyGeo,Sur
(28; 4% of non-emptyNameType
): Gottwaldově, Vančurově, Jiráskova, Melantrichova, Alšově, Chotkově, Engelsových, Fučíkovy, Fučíkově, GottwaldovyGiv
(51; 7% of non-emptyNameType
): Karlovy, Karlův, Karlově, Angelino, Anglický, Ariadnin, Bozděchovy, Božetěchově, Buridanův, ChefrenovyOth
(1; 0% of non-emptyNameType
): SantaPro
(15; 2% of non-emptyNameType
): Rudého, Illustrierte, Mladé, Prague, Rouge, Schweizer, Super, Touring, Tourist, linguistiqueSur
(473; 62% of non-emptyNameType
): Erbenových, Erbenovy, Fučíkova, Mohorovičićovy, Bohrův, Erbenova, Erbenově, Fučíkovy, Fučíkův, Fučíkovo
Paradigm Karlův | Geo,Giv | Geo,Sur | Giv |
---|---|---|---|
Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing | Karlův | ||
Animacy=Inan|Case=Acc|Gender=Masc|Number=Sing | Karlův | ||
Animacy=Inan|Case=Acc|Gender=Masc|Number=Plur | Karlovy | ||
Animacy=Inan|Case=Loc|Gender=Masc|Number=Plur | Karlových | ||
Animacy=Inan|Case=Nom|Gender=Masc|Number=Sing | Karlův | ||
Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur | Karlovy | Karlovy | |
Case=Gen|Gender=Fem|Number=Sing | Karlovy | ||
Case=Gen|Gender=Fem|Number=Plur | Karlových | ||
Case=Loc|Gender=Fem|Number=Sing | Karlově | ||
Case=Nom|Gender=Neut|Number=Sing | Karlovo |
NameType
seems to be lexical feature of ADJ
. 97% lemmas (346) occur only with one value of NameType
.
ADP
4 ADP tokens (0% of all ADP
tokens) have a non-empty value of NameType
.
The most frequent other feature values with which ADP
and NameType
co-occurred: AdpType=Prep (4; 100%), Case=EMPTY (3; 75%).
ADP
tokens may have the following values of NameType
:
Com
(2; 50% of non-emptyNameType
): Pro, pourOth
(1; 25% of non-emptyNameType
): ausPro
(1; 25% of non-emptyNameType
): della
PART
4 PART tokens (0% of all PART
tokens) have a non-empty value of NameType
.
PART
tokens may have the following values of NameType
:
Geo
(3; 75% of non-emptyNameType
): el, LaOth
(1; 25% of non-emptyNameType
): Al
CCONJ
1 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of NameType
.
CCONJ
tokens may have the following values of NameType
:
Com
(1; 100% of non-emptyNameType
): and
PRON
1 PRON tokens (0% of all PRON
tokens) have a non-empty value of NameType
.
The most frequent other feature values with which PRON
and NameType
co-occurred: Case=Acc (1; 100%), Gender=Masc (1; 100%), Number=Plur (1; 100%), Person=EMPTY (1; 100%), PrepCase=EMPTY (1; 100%), PronType=Tot (1; 100%), Reflex=EMPTY (1; 100%), Variant=EMPTY (1; 100%).
PRON
tokens may have the following values of NameType
:
Com
(1; 100% of non-emptyNameType
): Tous
Relations with Agreement in NameType
The 10 most frequent relations where parent and child node agree in NameType
:
PROPN –[conj]–> PROPN (1114; 94%),
ADJ –[conj]–> ADJ (24; 89%),
PROPN –[dep]–> PROPN (7; 54%),
PROPN –[xcomp]–> PROPN (7; 100%),
ADJ –[flat:foreign]–> PROPN (4; 57%),
PROPN –[appos]–> PROPN (3; 60%),
PROPN –[obl]–> PROPN (3; 100%),
PROPN –[flat:foreign]–> ADJ (2; 100%),
PROPN –[flat]–> ADJ (2; 67%),
PROPN –[nsubj]–> PROPN (2; 67%).