Treebank Statistics: UD_Irish-IDT: POS Tags: VERB
There are 445 VERB
lemmas (5%), 1709 VERB
types (11%) and 8775 VERB
tokens (8%).
Out of 17 observed tags, the rank of VERB
is: 5 in number of lemmas, 4 in number of types and 5 in number of tokens.
The 10 most frequent VERB
lemmas: bí, cuir, déan, tabhair, tar, bain, abair, féad, faigh, téigh
The 10 most frequent VERB
types: tá, bhí, atá, bhfuil, raibh, beidh, bheidh, mbeadh, níl, mbeidh
The 10 most frequent ambiguous lemmas: bí (VERB 3697, INTJ 1), déan (VERB 381, NOUN 1), tar (VERB 191, ADP 96), meas (VERB 40, NOUN 15), ceap (VERB 32, NOUN 1), úsáid (NOUN 65, VERB 29), scríobh (VERB 26, NOUN 12), dar (VERB 25, ADP 6, SCONJ 1), ar (ADP 3229, PART 42, ADV 35, VERB 17, PRON 6, SCONJ 4), inis (VERB 16, PROPN 7, NOUN 1)
The 10 most frequent ambiguous types: bhfuil (VERB 381, NOUN 1), cuireadh (VERB 43, NOUN 13), dar (VERB 13, AUX 5, ADP 2, SCONJ 1), scríobh (VERB 16, NOUN 11), ar (ADP 2733, AUX 43, PART 35, ADV 33, VERB 15, PRON 6), thosaigh (VERB 11, NOUN 1), bronnadh (VERB 6, NOUN 3), cailleadh (VERB 6, NOUN 1), rith (NOUN 43, VERB 6), ceapadh (VERB 6, NOUN 4)
- bhfuil
- cuireadh
- dar
- VERB 13: Botún mór atá ansin , dar liom .
- AUX 5: Bhíodh colún rialta ag Máire dar theideal ‘ An Fáinne ‘ ar an Irish Independent sna blianta 1927 agus 1928 .
- ADP 2: An mhaidin dar gcionn , dhúisigh mé go tobann agus d’ amharc mé timpeall orm .
- SCONJ 1: Ba í Dearbhla Ní Bhrolacháin a léirigh an seó agus dar ndóigh tá aithne mhaith ar Dhearbhla mar amhránaí den scoth agus is cuimhin linn ar fad an dlúthdhiosca blasta a d’ eisigh sí roinnt blianta ó shin .
- scríobh
- ar
- ADP 2733: Ach beo bocht a bheadh ar an té nach mbeadh aige ach preátaí tura .
- AUX 43: Tá cúpla rud eile sa leabhar seo ar mhaith liom tagairt dóibh .
- PART 35: De ribeoga olla ar chuir mé casadh iontu a rinne mé na buaiceacha .
- ADV 33: Scairt mé ar ais air .
- VERB 15: ’ Tá téagar sna páirteanna a fhaigheann tú i scannán , ‘ ar sise .
- PRON 6: Bhí cloch mhór chuimhne , dúradh leis , ar na mbóthar go Páras a raibh ainm gach ar thit is nach dtángthas ar an gcorpán greanta uirthi .
- thosaigh
- bronnadh
- cailleadh
- rith
- ceapadh
Morphology
The form / lemma ratio of VERB
is 3.840449 (the average of all parts of speech is 1.648496).
The 1st highest number of forms (71) was observed with the lemma “bí”: Bheinn, Bhíomar, Bím, Bítear, Nílim, Táimse, ata, atá, atáid, atáim, atáimse, atáthar, beadh, beidh, beifear, beimid, bheadh, bheas, bheidh, bheidís, bheifeá, bheifí, bheimid, bheimís, bheitheá, bhfuil, bhfuilid, bhfuilimid, bhfuilimíd, bhfuiltear, bhéas, bhéidh, bhí, bhídís, bhímid, bhíodar, bhíodh, bhíonn, bhíos, bhíothas, bígí, bímid, bíodh, bíonn, fuil, mbeadh, mbeidh, mbeidís, mbeifear, mbeifeá, mbeimid, mbeimis, mbeinn, mbímid, mbímis, mbínn, mbínnse, mbíodh, mbíonn, níl, nílirse, rabhadar, rabhamar, rabhas, rabhthas, raibh, tá, táid, táim, táimid, táthar.
The 2nd highest number of forms (43) was observed with the lemma “déan”: Déanaimid, Rinneadar, deineadh, dhearna, dhein, dheánann, dhineann, dhéanadh, dhéanann, dhéananna, dhéanfadh, dhéanfaidh, dhéanfaidís, dhéanfainn, dhéanfar, dhéanfaí, dhéanfá, dhéantar, dineadh, dintar, déan, déanadh, déanaim, déanann, déanfaidh, déanfaimid, déanfar, déanfidh, déantar, ndearna, ndearnadh, ndintar, ndéanadh, ndéanann, ndéanfadh, ndéanfaidh, ndéanfaidís, ndéanfar, ndéanfaí, ndéantar, rinne, rinneadh, rinneamar.
The 3rd highest number of forms (35) was observed with the lemma “tabhair”: ‘thug, Tabharfadsa, Thugamar, Tugaim, dtabharfadh, dtabharfaidh, dtabharfar, dtabharfaí, dtug, dtugadh, dtugann, dtugtar, dtugtaí, tabhair, tabharfaidh, tabharfar, thabharfadh, thabharfaidh, thabharfar, thabharfas, thabharfaídh, thabharfá, thug, thugadar, thugadh, thugaidís, thugaimid, thugainn, thugann, thugtar, thugtaí, tugadh, tugaimid, tugann, tugtar.
VERB
occurs with 10 features: Mood (8650; 99% instances), Tense (8122; 93% instances), Form (4511; 51% instances), Person (1918; 22% instances), Polarity (659; 8% instances), PronType (598; 7% instances), Number (518; 6% instances), Aspect (299; 3% instances), Typo (22; 0% instances), Dialect (20; 0% instances)
VERB
occurs with 28 feature-value pairs: Aspect=Hab
, Aspect=Imp
, Dialect=Munster
, Form=Direct
, Form=Direct,Emp
, Form=Ecl
, Form=Ecl,Emp
, Form=Emp
, Form=Emp,Len
, Form=HPref
, Form=Len
, Mood=Cnd
, Mood=Cnd,Int
, Mood=Imp
, Mood=Ind
, Mood=Sub
, Number=Plur
, Number=Sing
, Person=0
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, PronType=Rel
, Tense=Fut
, Tense=Past
, Tense=Pres
, Typo=Yes
VERB
occurs with 191 feature combinations.
The most frequent feature combination is Form=Len|Mood=Ind|Tense=Past
(1496 tokens).
Examples: bhí, tháinig, thug, chuir, chuaigh, bhain, tharla, chaith, chonaic, fhág
Relations
VERB
nodes are attached to their parents using 17 different relations: root (3480; 40% instances), acl:relcl (2051; 23% instances), advcl (917; 10% instances), conj (767; 9% instances), ccomp (630; 7% instances), csubj:cleft (342; 4% instances), parataxis (340; 4% instances), csubj:cop (162; 2% instances), acl (44; 1% instances), xcomp (23; 0% instances), dislocated (6; 0% instances), obl (5; 0% instances), nmod (3; 0% instances), appos (2; 0% instances), nsubj (1; 0% instances), orphan (1; 0% instances), xcomp:pred (1; 0% instances)
Parents of VERB
nodes belong to 15 different parts of speech: (3480; 40% instances), NOUN (2553; 29% instances), VERB (2117; 24% instances), ADJ (224; 3% instances), PRON (182; 2% instances), PROPN (120; 1% instances), ADP (39; 0% instances), ADV (23; 0% instances), NUM (12; 0% instances), SCONJ (9; 0% instances), PART (5; 0% instances), AUX (4; 0% instances), X (4; 0% instances), DET (2; 0% instances), SYM (1; 0% instances)
11 (0%) VERB
nodes are leaves.
401 (5%) VERB
nodes have one child.
1202 (14%) VERB
nodes have two children.
7161 (82%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 14.
Children of VERB
nodes are attached using 35 different relations: nsubj (5877; 17% instances), obl (5391; 16% instances), punct (4948; 15% instances), obj (2718; 8% instances), mark:prt (2301; 7% instances), xcomp (2231; 7% instances), xcomp:pred (1749; 5% instances), advmod (1582; 5% instances), obl:prep (1140; 3% instances), mark (1050; 3% instances), advcl (1007; 3% instances), conj (863; 3% instances), cc (757; 2% instances), ccomp (668; 2% instances), obl:tmod (600; 2% instances), parataxis (439; 1% instances), case (125; 0% instances), nmod (66; 0% instances), amod (36; 0% instances), dislocated (35; 0% instances), list (34; 0% instances), vocative (34; 0% instances), discourse (23; 0% instances), acl:relcl (22; 0% instances), compound:prt (12; 0% instances), cop (9; 0% instances), csubj:cop (5; 0% instances), orphan (5; 0% instances), appos (3; 0% instances), det (3; 0% instances), nsubj:outer (3; 0% instances), acl (2; 0% instances), csubj:cleft (1; 0% instances), flat:name (1; 0% instances), nmod:poss (1; 0% instances)
Children of VERB
nodes belong to 17 different parts of speech: NOUN (13369; 40% instances), PUNCT (4948; 15% instances), PART (3974; 12% instances), VERB (2117; 6% instances), PRON (2005; 6% instances), ADP (1601; 5% instances), ADJ (1414; 4% instances), PROPN (1106; 3% instances), SCONJ (1023; 3% instances), ADV (1004; 3% instances), CCONJ (830; 2% instances), NUM (273; 1% instances), X (31; 0% instances), INTJ (17; 0% instances), AUX (12; 0% instances), DET (10; 0% instances), SYM (7; 0% instances)