Treebank Statistics: UD_Catalan-AnCora: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
261785 tokens (48%) have a non-empty value of Number
.
20436 types (63%) occur at least once with a non-empty value of Number
.
11481 lemmas (49%) occur at least once with a non-empty value of Number
.
The feature is used with 8 part-of-speech tags: NOUN (89723; 16% instances), DET (87168; 16% instances), ADJ (29667; 5% instances), VERB (25265; 5% instances), AUX (20479; 4% instances), PRON (6820; 1% instances), NUM (2655; 0% instances), PROPN (8; 0% instances).
NOUN
89723 NOUN tokens (91% of all NOUN
tokens) have a non-empty value of Number
.
NOUN
tokens may have the following values of Number
:
Plur
(28005; 31% of non-emptyNumber
): anys, milions, pessetes, persones, obres, mesos, joves, dies, euros, empresesSing
(61718; 69% of non-emptyNumber
): any, president, part, terme, grup, projecte, cap, lloc, cas, portaveuEMPTY
(8923): any, través, temps, juny, partir, dia, fa, tal, maig, mes
Paradigm any | Sing | Plur |
---|---|---|
any | anys |
DET
87168 DET tokens (100% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: PronType=Art (68418; 78%), Definite=Def (63638; 73%).
DET
tokens may have the following values of Number
:
Plur
(18491; 21% of non-emptyNumber
): els, les, seus, altres, aquests, seves, tots, aquestes, uns, diferentsSing
(68677; 79% of non-emptyNumber
): el, la, l’, un, una, aquest, seva, aquesta, seu, totEMPTY
(99): meva, prou, gaire, massa, meves, cada, força, teus, teva, Que
Paradigm el | Sing | Plur |
---|---|---|
Definite=Def|Foreign=Yes|Gender=Masc|PronType=Art | el | |
Definite=Def|Gender=Masc|PronType=Art | el | els |
Definite=Def|Gender=Fem|PronType=Art | la, L' | les |
Definite=Def|PronType=Art | l' | |
Definite=Ind|Gender=Fem|PronType=Art | la | |
Definite=Ind|PronType=Art | l' | |
Gender=Masc|PronType=Art | el | els |
Gender=Fem|Person=3|Poss=Yes|PronType=Prs | les | |
Gender=Fem|PronType=Art | la | les |
PronType=Art | l' |
ADJ
29667 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: VerbForm=EMPTY (24078; 81%).
ADJ
tokens may have the following values of Number
:
Plur
(9066; 31% of non-emptyNumber
): grans, principals, importants, municipals, noves, nous, socials, locals, últims, culturalsSing
(20601; 69% of non-emptyNumber
): gran, general, passat, primer, nou, primera, actual, nova, important, socialEMPTY
(415): baix, gran, clau, especial, directe, nord, pilot, límit, xàrter, sud
Paradigm nou | Sing | Plur |
---|---|---|
Gender=Masc | nou | nous |
Gender=Fem | nova | noves |
VERB
25265 VERB tokens (60% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Gender=EMPTY (18450; 73%), VerbForm=Fin (18450; 73%), Person=3 (17604; 70%), Mood=Ind (16136; 64%), Tense=Pres (13043; 52%).
VERB
tokens may have the following values of Number
:
Plur
(5346; 21% of non-emptyNumber
): tenen, fan, tenim, faran, volen, van, formen, consideren, destaquen, volemSing
(19919; 79% of non-emptyNumber
): té, ha, fa, fet, explicat, dit, considera, cal, farà, volEMPTY
(16633): fer, dir, tenir, donar, arribar, aconseguir, veure, passar, presentar, deixar
Paradigm fer | Sing | Plur |
---|---|---|
Gender=Masc|Tense=Past|VerbForm=Part | fet | |
Gender=Fem|Tense=Past|VerbForm=Part | fetes | |
Mood=Cnd|Person=1|VerbForm=Fin | faria | faríem |
Mood=Cnd|Person=3|VerbForm=Fin | faria | farien |
Mood=Imp|Person=1|VerbForm=Fin | fem | |
Mood=Imp|Person=3|VerbForm=Fin | Faci | |
Mood=Ind|Person=1|Tense=Fut|VerbForm=Fin | faré | farem |
Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | feia | fèiem |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | faig | fem |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | fas | |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | farà | faran |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | feia | feien |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | fa | fan |
Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | faci | |
Mood=Sub|Person=2|Tense=Pres|VerbForm=Fin | feu | |
Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fes | fessin |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | faci | facin |
AUX
20479 AUX tokens (93% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (19829; 97%), Person=3 (19284; 94%), Mood=Ind (18754; 92%), Tense=Pres (17562; 86%).
AUX
tokens may have the following values of Number
:
Plur
(4881; 24% of non-emptyNumber
): van, han, són, estan, hem, poden, havien, seran, podran, erenSing
(15598; 76% of non-emptyNumber
): va, ha, és, estat, està, havia, pot, serà, era, siguiEMPTY
(1573): ser, haver, poder, estar, sent, saber, anar, essent, estant, havent
Paradigm haver | Sing | Plur |
---|---|---|
Gender=Masc|Tense=Past|VerbForm=Part | hagut | |
Mood=Cnd|Person=1|VerbForm=Fin | hauríem | |
Mood=Cnd|Person=3|VerbForm=Fin | hauria | haurien |
Mood=Ind|Person=1|Tense=Fut|VerbForm=Fin | hauré | haurem |
Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | havia | havíem |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | he | hem |
Mood=Ind|Person=2|Tense=Fut|VerbForm=Fin | haureu | |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | has | heu |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | haurà | hauran |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | havia | havien |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | ha | han |
Mood=Sub|Person=1|Tense=Imp|VerbForm=Fin | haguéssim | |
Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | haguem, hàgim | |
Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | hagués | haguessin, haguéssin |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | hagi | hagin |
PRON
6820 PRON tokens (29% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Reflex=EMPTY (6820; 100%), PrepCase=EMPTY (6594; 97%), Case=EMPTY (4112; 60%), Person=EMPTY (3657; 54%).
PRON
tokens may have the following values of Number
:
Plur
(1873; 27% of non-emptyNumber
): els, ens, quals, altres, uns, ells, les, los, alguns, nosaltresSing
(4947; 73% of non-emptyNumber
): un, li, tot, això, ho, qual, la, el, l’, ellEMPTY
(16634): que, es, s’, hi, se, on, què, qui, n’, en
Paradigm ell | Sing | Plur |
---|---|---|
Case=Acc,Dat | els, los, 'ls | |
Case=Acc|Gender=Masc | el, lo, 'l, -lo, l | |
Case=Acc|Gender=Fem,Masc | l' | |
Case=Acc|Gender=Fem | la, -la | les |
Case=Acc|Gender=Neut | ho, -ho | |
Case=Acc | els, 'ls | |
Case=Dat | li | els |
Gender=Masc | ell | ells |
Gender=Fem | ella | elles |
NUM
2655 NUM tokens (27% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=Card (2655; 100%), NumForm=Word (2653; 100%), Gender=EMPTY (1345; 51%).
NUM
tokens may have the following values of Number
:
Plur
(2219; 84% of non-emptyNumber
): dos, tres, dues, quatre, cinc, sis, set, vuit, deu, nouSing
(436; 16% of non-emptyNumber
): un, una, mig, mitja, doble, quart, triple, desena, X, cinquenaEMPTY
(7307): cent, 10, 15, 30, 5, 20, 2, 12, 4, 50
Paradigm dos | Sing | Plur |
---|---|---|
Gender=Masc | dos | dos |
Gender=Fem | dues | |
dos |
Number
seems to be lexical feature of NUM
. 95% lemmas (62) occur only with one value of Number
.
PROPN
8 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of Number
.
PROPN
tokens may have the following values of Number
:
Sing
(8; 100% of non-emptyNumber
): Seu, Cobain, Companyia, Font, Justícia, Kurt, PlaEMPTY
(46582): Catalunya, Barcelona, Generalitat, Govern, sant, Ajuntament, Girona, Josep, CiU, PP
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[det]–> DET (65326; 96%),
NOUN –[amod]–> ADJ (22825; 97%),
NOUN –[nmod]–> NOUN (13623; 54%),
VERB –[nsubj]–> NOUN (8233; 66%),
NOUN –[conj]–> NOUN (4055; 78%),
NOUN –[acl]–> VERB (3985; 52%),
ADJ –[cop]–> AUX (1672; 86%),
VERB –[conj]–> VERB (1589; 68%),
DET –[det]–> DET (1493; 98%),
NOUN –[cop]–> AUX (1486; 70%).