Treebank Statistics: UD_Portuguese-PUD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
13912 tokens (59%) have a non-empty value of Number
.
5371 types (91%) occur at least once with a non-empty value of Number
.
3632 lemmas (96%) occur at least once with a non-empty value of Number
.
The feature is used with 9 part-of-speech tags: NOUN (4600; 20% instances), DET (3540; 15% instances), ADJ (1550; 7% instances), VERB (1473; 6% instances), PROPN (1393; 6% instances), AUX (681; 3% instances), PRON (640; 3% instances), NUM (24; 0% instances), ADP (11; 0% instances).
NOUN
4600 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Gender=Masc (2528; 55%).
NOUN
tokens may have the following values of Number
:
Plur
(1340; 29% of non-emptyNumber
): anos, pessoas, vezes, estados, meses, ações, dados, partes, terras, diasSing
(3260; 71% of non-emptyNumber
): vez, guerra, ano, parte, governo, cidade, estado, mundo, acordo, século
DET
3540 DET tokens (100% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: PronType=Art (3220; 91%), Definite=EMPTY (3040; 86%), Gender=Masc (1879; 53%).
DET
tokens may have the following values of Number
:
Plur
(769; 22% of non-emptyNumber
): os, as, muitos, várias, outras, muitas, outros, vários, alguns, estesSing
(2771; 78% of non-emptyNumber
): o, a, um, uma, esta, este, cada, isso, outro, mesmoEMPTY
(1): uma
Paradigm o | Sing | Plur |
---|---|---|
Case=Acc|Definite=Def|Person=1 | os | |
Case=Dat|Definite=Def|Person=1 | os | |
Definite=Def|Gender=Masc | o | os |
Definite=Def|Gender=Fem | a | as, os |
Gender=Masc | o | os |
Gender=Fem | a | as |
ADJ
1550 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Gender=Masc (843; 54%).
ADJ
tokens may have the following values of Number
:
Plur
(477; 31% of non-emptyNumber
): novos, grandes, últimos, mais, Unidos, agrícolas, indígenas, políticos, Olímpicos, americanosSing
(1073; 69% of non-emptyNumber
): grande, primeira, maior, nova, mais, primeiro, nacional, novo, melhor, segunda
VERB
1473 VERB tokens (72% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Gender=EMPTY (1116; 76%), Person=3 (1063; 72%), Mood=Ind (1042; 71%).
VERB
tokens may have the following values of Number
:
Plur
(423; 29% of non-emptyNumber
): têm, estão, incluem, começaram, dizem, tinham, conquistaram, decidiram, fornecem, ocorreramSing
(1050; 71% of non-emptyNumber
): disse, há, tem, começou, diz, é, está, fez, tornou, devidoEMPTY
(559): fazer, ter, partir, incluindo, manter, ajudar, criar, levar, deixar, encontrar
PROPN
1393 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Foreign=EMPTY (1229; 88%), Gender=Masc (826; 59%).
PROPN
tokens may have the following values of Number
:
Plur
(36; 3% of non-emptyNumber
): EUA, Alpes, Andes, Balcãs, Kitai, Américas, Antillas, Caribs, Estados, FilipinasSing
(1357; 97% of non-emptyNumber
): China, Trump, Mediterrâneo, América, the, Austrália, Europa, França, Grécia, Hong
Number
seems to be lexical feature of PROPN
. 100% lemmas (995) occur only with one value of Number
.
AUX
681 AUX tokens (84% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Person=3 (644; 95%), Mood=Ind (602; 88%).
AUX
tokens may have the following values of Number
:
Plur
(215; 32% of non-emptyNumber
): foram, são, estão, podem, tinham, eram, estavam, têm, poderiam, serãoSing
(466; 68% of non-emptyNumber
): é, foi, está, pode, tinha, estava, era, tem, seria, poderiaEMPTY
(126): ser, sido, ter, estar, sendo, tendo, tornar, tornado, tornando, começado
PRON
640 PRON tokens (69% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Person=3 (461; 72%), Number[psor]=EMPTY (406; 63%), PronType=EMPTY (394; 62%), Case=EMPTY (389; 61%), Gender=Masc (367; 57%).
PRON
tokens may have the following values of Number
:
Plur
(156; 24% of non-emptyNumber
): eles, suas, seus, quais, nós, estes, aqueles, elas, os, vocêsSing
(484; 76% of non-emptyNumber
): ele, sua, seu, ela, o, eu, qual, isso, isto, loEMPTY
(294): que, se, quem, si
NUM
24 NUM tokens (5% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: Gender=EMPTY (21; 88%).
NUM
tokens may have the following values of Number
:
Plur
(18; 75% of non-emptyNumber
): milhões, bilhões, bnSing
(6; 25% of non-emptyNumber
): bilhão, bn, milhão, um, CincoEMPTY
(445): dois, um, três, duas, quatro, uma, 10, 3, seis, 1
ADP
11 ADP tokens (0% of all ADP
tokens) have a non-empty value of Number
.
ADP
tokens may have the following values of Number
:
Plur
(2; 18% of non-emptyNumber
): Aqueles, nestesSing
(9; 82% of non-emptyNumber
): a, nessa, nesse, consigo, daquelaEMPTY
(3806): de, em, a, para, por, com, como, que, durante, entre
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[det]–> DET (3040; 100%),
NOUN –[amod]–> ADJ (1277; 100%),
NOUN –[nmod]–> NOUN (728; 60%),
VERB –[nsubj]–> NOUN (431; 83%),
PROPN –[det]–> DET (355; 99%),
NOUN –[nmod]–> PROPN (249; 71%),
NOUN –[det]–> PRON (232; 100%),
NOUN –[conj]–> NOUN (201; 77%),
VERB –[aux:pass]–> AUX (183; 80%),
VERB –[nsubj]–> PRON (169; 53%).