Treebank Statistics: UD_Portuguese-PetroGold: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
148375 tokens (59%) have a non-empty value of Number
.
14142 types (93%) occur at least once with a non-empty value of Number
.
9311 lemmas (89%) occur at least once with a non-empty value of Number
.
The feature is used with 10 part-of-speech tags: NOUN (57549; 23% instances), DET (36346; 15% instances), ADJ (17069; 7% instances), VERB (16246; 6% instances), PROPN (11944; 5% instances), AUX (5434; 2% instances), PRON (3513; 1% instances), ADV (217; 0% instances), NUM (56; 0% instances), X (1; 0% instances).
NOUN
57549 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Gender=Masc (28798; 50%).
NOUN
tokens may have the following values of Number
:
Plur
(16037; 28% of non-emptyNumber
): fluidos, dados, resultados, valores, fácies, propriedades, emissões, custos, poços, característicasSing
(41512; 72% of non-emptyNumber
): óleo, água, figura, fluido, petróleo, gás, produção, área, argila, processoEMPTY
(13): place, ,, cima, cm, d’água, e, hk, Å
Paradigm óleo | Sing | Plur |
---|---|---|
_ | Óleo | |
Gender=Masc | óleo | óleos |
Gender=Fem | óleo |
DET
36346 DET tokens (100% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: PronType=Art (31768; 87%), Definite=Def (29026; 80%), Gender=Fem (18204; 50%).
DET
tokens may have the following values of Number
:
Plur
(8217; 23% of non-emptyNumber
): os, as, estes, estas, suas, esses, todos, tais, essas, outrosSing
(28129; 77% of non-emptyNumber
): a, o, um, uma, este, esta, sua, esse, cada, seu
Paradigm o | Sing | Plur |
---|---|---|
Definite=Def|Gender=Masc|PronType=Art | o | os |
Definite=Def|Gender=Fem|PronType=Art | a, á | as, A |
Gender=Masc | o | |
Gender=Masc|PronType=Art | os |
ADJ
17069 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Gender=Fem (8545; 50%).
ADJ
tokens may have the following values of Number
:
Plur
(5977; 35% of non-emptyNumber
): diferentes, principais, grandes, maiores, presentes, magnéticos, magnéticas, sedimentares, químicos, altasSing
(11092; 65% of non-emptyNumber
): maior, grande, menor, possível, magnético, total, natural, magnética, presente, necessárioEMPTY
(11): subsea, primeira, próximo
Paradigm maior | Sing | Plur |
---|---|---|
Gender=Masc | maior | maiores |
Gender=Fem | maior | maiores |
VERB
16246 VERB tokens (80% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Voice=EMPTY (12245; 75%), Tense=EMPTY (8939; 55%), Mood=EMPTY (8854; 54%), Person=EMPTY (8784; 54%), VerbForm=Part (8776; 54%).
VERB
tokens may have the following values of Number
:
Plur
(5911; 36% of non-emptyNumber
): podem, apresentam, utilizados, obtidos, apresentados, possuem, realizados, associados, ocorrem, preparadosSing
(10335; 64% of non-emptyNumber
): pode, devido, apresenta, utilizado, tem, deve, mostra, ocorre, possui, sejaEMPTY
(4113): partir, utilizando, observar, seguir, aumentar, podendo, obter, formando, apresentar, contendo
Paradigm poder | Sing | Plur |
---|---|---|
Mood=Cnd|Person=3|VerbForm=Fin | poderia | poderiam |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | podemos | |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | poderá | poderão |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | podiam | |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | pôde | puderam |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | pode | podem |
Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | pudesse | pudessem |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | possa | possam |
Person=1|VerbForm=Inf | podermos | |
Person=3|VerbForm=Inf | poderem |
PROPN
11944 PROPN tokens (99% of all PROPN
tokens) have a non-empty value of Number
.
PROPN
tokens may have the following values of Number
:
Plur
(219; 2% of non-emptyNumber
): RCEs, GPM, estados, ARGILAS, Formações, MW, Barras, Camadas, Campos, CartasSing
(11725; 98% of non-emptyNumber
): et, al., CO2, Bacia, Cabo, Frio, Santos, Campos, &, grandeEMPTY
(69): ., ,, -, /, Quiricó, +, E, ;, =, cm2/g=
Paradigm Santos | Sing | Plur |
---|---|---|
_ | Santos | |
Gender=Masc | Santos, SANTOS | Santos |
Number
seems to be lexical feature of PROPN
. 98% lemmas (3116) occur only with one value of Number
.
AUX
5434 AUX tokens (83% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Person=3 (5428; 100%), VerbForm=Fin (5342; 98%), Mood=Ind (5079; 93%), Tense=Pres (3483; 64%).
AUX
tokens may have the following values of Number
:
Plur
(2032; 37% of non-emptyNumber
): são, foram, estão, serão, serem, eram, sejam, têm, seriam, estejamSing
(3402; 63% of non-emptyNumber
): é, foi, está, será, era, seja, seria, tem, for, iráEMPTY
(1140): ser, sendo, estar, sido, ter, tendo, estando, é, e, estado
Paradigm ser | Sing | Plur |
---|---|---|
Gender=Masc|VerbForm=Part | sido | |
Mood=Cnd|Person=3|VerbForm=Fin | seria | seriam |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | será | serão |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | eram |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | foi | |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | é, È, Ǻ | são |
Mood=Ind|Person=3|VerbForm=Fin | foram | |
Mood=Sub|Person=3|Tense=Fut|VerbForm=Fin | for, sera | forem |
Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fosse | fossem |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | seja | sejam |
Person=3|VerbForm=Inf | ser | serem |
Tense=Pres|VerbForm=Fin | é | |
VerbForm=Part | sido |
PRON
3513 PRON tokens (65% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Gender=Masc (2288; 65%), PronType=Rel (1988; 57%).
PRON
tokens may have the following values of Number
:
Plur
(1116; 32% of non-emptyNumber
): que, eles, estes, elas, os, quais, outros, as, estas, aquelesSing
(2397; 68% of non-emptyNumber
): que, o, isso, a, isto, este, qual, um, uma, estaEMPTY
(1886): se, um
Paradigm que | Sing | Plur |
---|---|---|
Gender=Masc | que | que |
Gender=Fem | que | que |
que |
ADV
217 ADV tokens (3% of all ADV
tokens) have a non-empty value of Number
.
ADV
tokens may have the following values of Number
:
Plur
(44; 20% of non-emptyNumber
): ondeSing
(173; 80% of non-emptyNumber
): onde, Antes, SIM, melhorEMPTY
(6225): mais, não, também, através, já, muito, assim, bem, ainda, além
Paradigm onde | Sing | Plur |
---|---|---|
Gender=Masc | Onde | |
Gender=Masc|PronType=Rel | onde | onde |
Gender=Fem|PronType=Rel | onde | onde |
NUM
56 NUM tokens (1% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=EMPTY (36; 64%).
NUM
tokens may have the following values of Number
:
Sing
(56; 100% of non-emptyNumber
): 1, 19, 2.3, 4, 8, II.7, III.2, ii, 36º, 43ºEMPTY
(7234): dois, 1, 3, 2, 5, 10, duas, três, 4, 2005
Number
seems to be lexical feature of NUM
. 100% lemmas (54) occur only with one value of Number
.
X
1 X tokens (0% of all X
tokens) have a non-empty value of Number
.
The most frequent other feature values with which X
and Number
co-occurred: Foreign=EMPTY (1; 100%).
X
tokens may have the following values of Number
:
Plur
(1; 100% of non-emptyNumber
): drill-inEMPTY
(215): in, drill, n, flow, core, ., booster, pin, situ, stripe
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[det]–> DET (32950; 100%),
NOUN –[amod]–> ADJ (14486; 99%),
NOUN –[nmod]–> NOUN (13126; 64%),
VERB –[obl]–> NOUN (4218; 53%),
NOUN –[acl]–> VERB (4088; 93%),
PROPN –[flat:name]–> PROPN (3671; 96%),
VERB –[nsubj]–> NOUN (3651; 93%),
NOUN –[conj]–> NOUN (3363; 77%),
VERB –[aux:pass]–> AUX (2703; 80%),
VERB –[nsubj:pass]–> NOUN (2492; 90%).