Treebank Statistics: UD_Upper_Sorbian-UFAL: Features: Abbr
This feature is universal.
It occurs with 1 different values: Yes
.
192 tokens (2%) have a non-empty value of Abbr
.
37 types (1%) occur at least once with a non-empty value of Abbr
.
44 lemmas (1%) occur at least once with a non-empty value of Abbr
.
The feature is used with 11 part-of-speech tags: NOUN (74; 1% instances), ADP (38; 0% instances), DET (37; 0% instances), PROPN (10; 0% instances), X (8; 0% instances), NUM (7; 0% instances), ADV (6; 0% instances), ADJ (4; 0% instances), PRON (3; 0% instances), VERB (3; 0% instances), SYM (2; 0% instances).
NOUN
74 NOUN tokens (3% of all NOUN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which NOUN
and Abbr
co-occurred: Number=Sing (61; 82%), Animacy=EMPTY (51; 69%).
NOUN
tokens may have the following values of Abbr
:
Yes
(74; 100% of non-emptyAbbr
): l, př, km, m, CEST, hodź, jan, dr, nakł, přirEMPTY
(2469): město, rěč, woda, rěčow, lěta, stolica, lěće, kilometrow, mócnarstwo, nastawki
Abbr
seems to be lexical feature of NOUN
. 100% lemmas (12) occur only with one value of Abbr
.
ADP
38 ADP tokens (3% of all ADP
tokens) have a non-empty value of Abbr
.
ADP
tokens may have the following values of Abbr
:
Yes
(38; 100% of non-emptyAbbr
): př, nEMPTY
(1060): w, na, z, wot, za, do, k, přez, po, při
DET
37 DET tokens (11% of all DET
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which DET
and Abbr
co-occurred: Case=Ins (36; 97%), Number=Sing (36; 97%), Number[psor]=Plur (36; 97%), Person=1 (36; 97%), Poss=Yes (36; 97%), PronType=Prs (36; 97%), Animacy=EMPTY (33; 89%), Gender=Fem (31; 84%).
DET
tokens may have the following values of Abbr
:
Yes
(37; 100% of non-emptyAbbr
): nEMPTY
(290): kotrež, tute, jeho, jich, kotryž, wjele, kotraž, tutón, swoje, tuta
PROPN
10 PROPN tokens (2% of all PROPN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which PROPN
and Abbr
co-occurred: Animacy=EMPTY (10; 100%), Gender=EMPTY (7; 70%), Number=EMPTY (7; 70%), Case=EMPTY (6; 60%).
PROPN
tokens may have the following values of Abbr
:
Yes
(10; 100% of non-emptyAbbr
): C, GNU, CET, KPD, OZN, H, ISGV, NDREMPTY
(586): Mezopotamiskeje, Aššur, Mezopotamiska, Mezopotamiskej, Sumeričanow, Wikimedia, Łužicy, Europje, Assur, Assyriska
X
8 X tokens (4% of all X
tokens) have a non-empty value of Abbr
.
X
tokens may have the following values of Abbr
:
Yes
(8; 100% of non-emptyAbbr
): APG, DDR, PD, SORBISCHES, dr, m, mjEMPTY
(191): a, i, Vitis, al, backen, o, H, au, b, c
NUM
7 NUM tokens (2% of all NUM
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which NUM
and Abbr
co-occurred: NumType=Card (7; 100%).
NUM
tokens may have the following values of Abbr
:
Yes
(7; 100% of non-emptyAbbr
): III, Mio, 02625EMPTY
(375): 2, 1, 6, 4, 3, jedyn, 5, 7, I, 000
ADV
6 ADV tokens (1% of all ADV
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADV
and Abbr
co-occurred: Degree=EMPTY (5; 83%), PronType=EMPTY (5; 83%).
ADV
tokens may have the following values of Abbr
:
Yes
(6; 100% of non-emptyAbbr
): resp, atd, łać, jendź, tEMPTY
(529): tež, tak, hišće, zwjetša, hač, něhdźe, hižo, tu, wjace, najprjedy
ADJ
4 ADJ tokens (0% of all ADJ
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADJ
and Abbr
co-occurred: Animacy=EMPTY (4; 100%), Degree=EMPTY (3; 75%), Number=Sing (3; 75%).
ADJ
tokens may have the following values of Abbr
:
Yes
(4; 100% of non-emptyAbbr
): d, jendź, mj, zEMPTY
(1417): serbski, druhe, druhich, najwjetše, prěni, prěnje, serbskeje, Serbskeho, wulke, wulki
PRON
3 PRON tokens (1% of all PRON
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which PRON
and Abbr
co-occurred: Gender=Neut (3; 100%), Number=Sing (3; 100%), Person=EMPTY (3; 100%), Reflex=EMPTY (3; 100%), Case=EMPTY (2; 67%), PronType=EMPTY (2; 67%).
PRON
tokens may have the following values of Abbr
:
Yes
(3; 100% of non-emptyAbbr
): tEMPTY
(335): so, to, toho, tym, wona, wón, kiž, je, wone, wono
VERB
3 VERB tokens (0% of all VERB
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which VERB
and Abbr
co-occurred: Mood=Ind (3; 100%), Number=Sing (3; 100%), Person=3 (3; 100%), Tense=Pres (3; 100%), VerbForm=Fin (3; 100%).
VERB
tokens may have the following values of Abbr
:
Yes
(3; 100% of non-emptyAbbr
): mj, rEMPTY
(819): ma, leži, móže, wobsahuje, móžeš, su, hlej, maja, rěči, běchu
SYM
2 SYM tokens (6% of all SYM
tokens) have a non-empty value of Abbr
.
SYM
tokens may have the following values of Abbr
:
Yes
(2; 100% of non-emptyAbbr
): O2, O3EMPTY
(30): °, %, ‘, *, ², ³, †, :, =, ~
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr
:
NOUN –[det]–> DET (36; 97%),
NOUN –[case]–> ADP (35; 90%),
PRON –[fixed]–> VERB (3; 100%),
ADV –[fixed]–> ADJ (1; 100%),
SYM –[conj]–> SYM (1; 100%),
X –[fixed]–> X (1; 100%).