Treebank Statistics: UD_Portuguese-GSD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
60313 tokens (19%) have a non-empty value of Number
.
9505 types (31%) occur at least once with a non-empty value of Number
.
7576 lemmas (53%) occur at least once with a non-empty value of Number
.
The feature is used with 12 part-of-speech tags: DET (38837; 12% instances), NOUN (8351; 3% instances), PROPN (5452; 2% instances), VERB (3154; 1% instances), ADJ (2077; 1% instances), PRON (1562; 0% instances), AUX (847; 0% instances), ADV (10; 0% instances), NUM (8; 0% instances), ADP (7; 0% instances), X (6; 0% instances), CCONJ (2; 0% instances).
DET
38837 DET tokens (82% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: PronType=Art (38013; 98%), Definite=Def (37476; 96%), Gender=Masc (21023; 54%).
DET
tokens may have the following values of Number
:
Plur
(5302; 14% of non-emptyNumber
): as, os, seus, outros, suas, alguns, todos, todas, outras, váriosSing
(33535; 86% of non-emptyNumber
): o, a, um, uma, sua, seu, esta, este, essa, esseEMPTY
(8765): os, um, uma, sua, seu, o, seus, cada, a, suas
Paradigm o | Sing | Plur |
---|---|---|
Definite=Def|Gender=Masc|PronType=Art | o, a | os |
Definite=Def|Gender=Fem|PronType=Art | a | as |
Gender=Masc|PronType=Art | o | os |
Gender=Masc|PronType=Dem | o | |
Gender=Fem|PronType=Art | a | as |
NOUN
8351 NOUN tokens (15% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Gender=Masc (4644; 56%).
NOUN
tokens may have the following values of Number
:
Plur
(2046; 25% of non-emptyNumber
): anos, km, dias, pessoas, empresas, países, vezes, meses, pontos, minutosSing
(6305; 75% of non-emptyNumber
): feira, dia, ano, estado, presidente, acordo, país, governo, tempo, áreaEMPTY
(48239): anos, ano, dia, r, pessoas, presidente, cidade, acordo, governo, parte
Paradigm ano | Sing | Plur |
---|---|---|
ano | anos |
PROPN
5452 PROPN tokens (17% of all PROPN
tokens) have a non-empty value of Number
.
PROPN
tokens may have the following values of Number
:
Plur
(75; 1% of non-emptyNumber
): Estados, EUA, Jogos, Beatles, Set, APPs, Abid, Aflitos, Agentes, AncestraisSing
(5377; 99% of non-emptyNumber
): the, Paulo, Brasil, São, Federal, of, &, Sul, Rio, SantosEMPTY
(26827): feira, Brasil, São, Paulo, rio, Nacional, Estado, janeiro, Federal, quinta
Paradigm R | Sing | Plur |
---|---|---|
R | R |
Number
seems to be lexical feature of PROPN
. 100% lemmas (3386) occur only with one value of Number
.
VERB
3154 VERB tokens (11% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: VerbForm=Fin (2258; 72%).
VERB
tokens may have the following values of Number
:
Plur
(729; 23% of non-emptyNumber
): cobertos, podem, começaram, passam, têm, voltaram, considerados, continuam, devem, ficaramSing
(2425; 77% of non-emptyNumber
): disse, tem, acabou, chegou, começou, tornou, passou, afirmou, pode, voltouEMPTY
(24819): é, tem, disse, está, há, fazer, foi, estão, afirmou, partir
Paradigm ter | Sing | Plur |
---|---|---|
ExtPos=AUX|Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | Temos | |
ExtPos=AUX|Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | terá | |
ExtPos=AUX|Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | teve | |
ExtPos=AUX|Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | tem | |
Gender=Masc|VerbForm=Part | tido | |
Mood=Cnd|Person=3|VerbForm=Fin | teria | |
Mood=Ind|Person=1|Tense=Fut|VerbForm=Fin | teremos | |
Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | tínhamos | |
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | tive | tivemos |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | tenho | Temos |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | terá | |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | tinha | tinham |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | teve | tiveram |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | tem | têm |
Mood=Ind|Person=3|VerbForm=Fin | tiveram | |
Mood=Sub|Person=1|Tense=Fut|VerbForm=Fin | tivermos | |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | tenha | tenham |
ADJ
2077 ADJ tokens (14% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Gender=Masc (1155; 56%).
ADJ
tokens may have the following values of Number
:
Plur
(477; 23% of non-emptyNumber
): últimos, novos, principais, novas, primeiros, diferentes, maiores, pequenos, tropicais, anterioresSing
(1600; 77% of non-emptyNumber
): maior, grande, primeira, ex, primeiro, novo, segunda, última, último, melhorEMPTY
(12962): maior, grande, primeiro, primeira, novo, segundo, última, segunda, mesmo, nova
Paradigm primeiro | Sing | Plur |
---|---|---|
Gender=Masc | primeiro | primeiros |
Gender=Fem | primeira | primeiras |
PRON
1562 PRON tokens (20% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Gender=Masc (1103; 71%).
PRON
tokens may have the following values of Number
:
Plur
(334; 21% of non-emptyNumber
): que, se, eles, os, elas, quais, nós, outros, todos, losSing
(1228; 79% of non-emptyNumber
): que, se, o, ele, ela, isso, onde, a, quem, euEMPTY
(6158): que, se, ele, isso, o, eu, um, ela, eles, quem
Paradigm que | Sing | Plur |
---|---|---|
Gender=Masc|PronType=Dem | que | |
Gender=Masc|PronType=Ind | que | |
Gender=Masc|PronType=Int | que | |
Gender=Masc|PronType=Rel | que | que |
Gender=Fem|PronType=Rel | que | que |
AUX
847 AUX tokens (12% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (821; 97%), Person=3 (818; 97%), Mood=Ind (751; 89%).
AUX
tokens may have the following values of Number
:
Plur
(207; 24% of non-emptyNumber
): foram, são, estão, serão, estavam, serem, eram, têm, vamos, irãoSing
(640; 76% of non-emptyNumber
): é, foi, está, era, será, vai, estava, havia, tem, terEMPTY
(6089): é, foi, ser, foram, será, são, vai, pode, sendo, deve
Paradigm ser | Sing | Plur |
---|---|---|
_ | ser | |
Gender=Masc|VerbForm=Part | sido | |
Mood=Cnd|Person=3|VerbForm=Fin | seria | seriam |
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | fui | fomos |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | Sou | |
Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | será | serão |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | eram |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | foi | foram |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | é | são |
Mood=Ind|Person=3|VerbForm=Fin | foram | |
Mood=Sub|Person=3|Tense=Fut|VerbForm=Fin | for | forem |
Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fosse | fossem |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | seja | sejam |
Person=3|VerbForm=Inf | ser | serem |
ADV
10 ADV tokens (0% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: Polarity=EMPTY (10; 100%).
ADV
tokens may have the following values of Number
:
Plur
(2; 20% of non-emptyNumber
): juntosSing
(8; 80% of non-emptyNumber
): Mal, Nada, caro, devagarinho, entanto, independente, pouco, quantoEMPTY
(9760): não, mais, também, já, ainda, muito, depois, onde, além, apenas
NUM
8 NUM tokens (0% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=EMPTY (6; 75%).
NUM
tokens may have the following values of Number
:
Plur
(1; 13% of non-emptyNumber
): centenasSing
(7; 88% of non-emptyNumber
): 2012, 3, 470, cento, cem, sessentaEMPTY
(8504): dois, três, mil, duas, milhões, um, 1, quatro, 2012, 2
ADP
7 ADP tokens (0% of all ADP
tokens) have a non-empty value of Number
.
ADP
tokens may have the following values of Number
:
Sing
(7; 100% of non-emptyNumber
): in, Contra, Pra, at, queEMPTY
(51219): de, em, a, para, por, com, como, entre, sobre, até
X
6 X tokens (1% of all X
tokens) have a non-empty value of Number
.
X
tokens may have the following values of Number
:
Sing
(6; 100% of non-emptyNumber
): \epsilon=\epsilon_{0}, \kappa, center, market, on, spinEMPTY
(397): disso, deles, delas, dele, do, +, etc, @, comigo, nele
CCONJ
2 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of Number
.
CCONJ
tokens may have the following values of Number
:
Sing
(2; 100% of non-emptyNumber
): &, EEMPTY
(10424): e, que, mas, ou, se, quando, como, porque, enquanto, pois
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
PROPN –[flat:name]–> PROPN (1877; 98%),
NOUN –[amod]–> ADJ (1662; 98%),
NOUN –[nmod]–> NOUN (1481; 66%),
VERB –[obl]–> NOUN (824; 53%),
VERB –[nsubj]–> NOUN (744; 91%),
NOUN –[nmod]–> PROPN (567; 77%),
VERB –[nsubj]–> PRON (484; 92%),
PROPN –[conj]–> PROPN (460; 98%),
PROPN –[nmod]–> PROPN (443; 98%),
NOUN –[appos]–> PROPN (392; 88%).