Treebank Statistics: UD_Irish-IDT: POS Tags: NOUN
There are 4681 NOUN
lemmas (49%), 8430 NOUN
types (53%) and 33373 NOUN
tokens (29%).
Out of 17 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: duine, cur, bliain, cuid, fáil, déanamh, ceann, féidir, pobal, bheith
The 10 most frequent NOUN
types: chur, dhéanamh, fáil, bheith, duine, féidir, chuid, chéile, réir, daoine
The 10 most frequent ambiguous lemmas: ceann (NOUN 280, PROPN 4), bheith (NOUN 220, ADV 6), céile (NOUN 185, PROPN 1), clár (NOUN 165, PROPN 10), cathair (NOUN 162, PROPN 18), lá (NOUN 126, PROPN 1), réir (NOUN 125, SCONJ 2), cúirt (NOUN 116, PROPN 2), leath (NOUN 116, ADJ 2, VERB 2), áit (NOUN 116, SCONJ 1)
The 10 most frequent ambiguous types: chur (NOUN 286, VERB 1), bheith (NOUN 220, ADV 6), réir (NOUN 123, SCONJ 2), dtí (NOUN 91, ADP 1), áit (NOUN 77, SCONJ 1), linn (NOUN 71, ADP 22), láthair (NOUN 62, ADP 1), deireadh (NOUN 57, VERB 1), forbartha (NOUN 28, ADJ 5), rith (NOUN 43, VERB 6)
- chur
- bheith
- réir
- NOUN 123: Meascadh agus díol cógas de réir nós na Meánaoise .
- SCONJ 2: (4) D’ fhonn amhras a sheachaint , aon achtachán a ndéantar leasú air le halt den Acht seo a scoireann de bheith i ngníomh amhail ar an agus ón lá dá dtagraítear i bhfo-alt (1) nó , de réir mar a bheidh , amhail ar agus ó dháta éagtha na tréimhse ar lena linn a choimeádfar i ngníomh é faoi fho-alt (2) ( ‘ an t-éag ‘ ) , beidh feidhm aige agus beidh éifeacht leis amhail ar an agus ón lá sin nó , de réir mar a bheidh , amhail ar an agus ón éag , mar a bhí feidhm aige agus éifeacht leis díreach roimh dháta an Achta seo a rith ach sin faoi réir aon leasuithe a dhéanfar le haon Acht eile den Oireachtas tar éis an dáta rite sin .
- dtí
- áit
- linn
- láthair
- deireadh
- NOUN 57: ’ Tá deireadh an domhain ag teacht !
- VERB 1: Bím ag obair , mar sin , leis na coistí sin ina gceann agus ina gceann , agus i gcomhar chomh maith faoi bhrat an chomhchoiste , sé sin Chomhchoiste Ghaeltachtaí Chiarraí Theas agus Comharchumann Naomh Fhionáin Teo. B’ in é an t-aon lá sa mbliain a bhféadfá do chuid siúil a thaispeáint , mar a deireadh sé .
- forbartha
- rith
Morphology
The form / lemma ratio of NOUN
is 1.800897 (the average of all parts of speech is 1.648496).
The 1st highest number of forms (13) was observed with the lemma “teach”: TI, Tighe, Títhe, dteach, dtigh, dtithe, dtí, teach, theach, thigh, tigh, tithe, tí.
The 2nd highest number of forms (11) was observed with the lemma “údarás”: hÚdarás, húdaráis, nÚdaráis, nÚdarás, t-údarás, tArd-Údarás, tÚdaras, tÚdarás, údarais, údaráis, údarás.
The 3rd highest number of forms (10) was observed with the lemma “bás”: b(h)ás, bas, bháis, bhás, bhásanna, báis, bás, básanna, mbáis, mbás.
NOUN
occurs with 12 features: Number (29004; 87% instances), Case (28903; 87% instances), Gender (28277; 85% instances), Definite (13360; 40% instances), Form (10043; 30% instances), VerbForm (4083; 12% instances), NounType (1212; 4% instances), PrepForm (977; 3% instances), Typo (159; 0% instances), Abbr (101; 0% instances), Foreign (62; 0% instances), Dialect (13; 0% instances)
NOUN
occurs with 25 feature-value pairs: Abbr=Yes
, Case=Dat
, Case=Gen
, Case=Nom
, Case=Voc
, Definite=Def
, Dialect=Connaught
, Dialect=Munster
, Dialect=Ulster
, Foreign=Yes
, Form=Ecl
, Form=Emp
, Form=Emp,Len
, Form=HPref
, Form=Len
, Gender=Fem
, Gender=Masc
, NounType=Strong
, NounType=Weak
, Number=Plur
, Number=Sing
, PrepForm=Cmpd
, Typo=Yes
, VerbForm=Inf
, VerbForm=Vnoun
NOUN
occurs with 261 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing
(3964 tokens).
Examples: duine, rud, bith, gá, fad, ábhar, bás, iarratas, lá, ceann
Relations
NOUN
nodes are attached to their parents using 33 different relations: nmod (8454; 25% instances), obl (7116; 21% instances), obj (3626; 11% instances), nsubj (3571; 11% instances), xcomp (2935; 9% instances), conj (2844; 9% instances), fixed (1169; 4% instances), root (812; 2% instances), xcomp:pred (583; 2% instances), obl:tmod (503; 2% instances), csubj:cop (424; 1% instances), parataxis (276; 1% instances), advcl (270; 1% instances), appos (241; 1% instances), compound (157; 0% instances), ccomp (109; 0% instances), acl:relcl (91; 0% instances), acl (39; 0% instances), dislocated (38; 0% instances), vocative (28; 0% instances), list (24; 0% instances), flat:foreign (13; 0% instances), csubj:cleft (10; 0% instances), advmod (7; 0% instances), nsubj:outer (7; 0% instances), discourse (6; 0% instances), orphan (6; 0% instances), flat (3; 0% instances), mark (3; 0% instances), nummod (3; 0% instances), case (2; 0% instances), flat:name (2; 0% instances), cc (1; 0% instances)
Parents of NOUN
nodes belong to 17 different parts of speech: NOUN (15991; 48% instances), VERB (13369; 40% instances), ADP (1199; 4% instances), ADJ (896; 3% instances), (812; 2% instances), PROPN (470; 1% instances), PRON (384; 1% instances), NUM (119; 0% instances), ADV (66; 0% instances), X (20; 0% instances), DET (12; 0% instances), PART (12; 0% instances), AUX (10; 0% instances), SCONJ (7; 0% instances), SYM (4; 0% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances)
5717 (17%) NOUN
nodes are leaves.
9153 (27%) NOUN
nodes have one child.
9753 (29%) NOUN
nodes have two children.
8750 (26%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 19.
Children of NOUN
nodes are attached using 42 different relations: case (13518; 22% instances), nmod (10601; 18% instances), det (8186; 14% instances), amod (3939; 7% instances), punct (3576; 6% instances), conj (2734; 5% instances), obl (2254; 4% instances), cc (2200; 4% instances), mark (2189; 4% instances), acl:relcl (1943; 3% instances), obj (1828; 3% instances), nmod:poss (1068; 2% instances), cop (830; 1% instances), advmod (656; 1% instances), obl:prep (616; 1% instances), nummod (611; 1% instances), xcomp (610; 1% instances), nsubj (349; 1% instances), csubj:cop (339; 1% instances), appos (327; 1% instances), parataxis (216; 0% instances), csubj:cleft (210; 0% instances), advcl (203; 0% instances), xcomp:pred (202; 0% instances), mark:prt (178; 0% instances), compound (157; 0% instances), ccomp (145; 0% instances), obl:tmod (125; 0% instances), acl (86; 0% instances), flat (86; 0% instances), flat:name (65; 0% instances), list (56; 0% instances), fixed (49; 0% instances), case:voc (30; 0% instances), compound:prt (21; 0% instances), dislocated (16; 0% instances), flat:foreign (15; 0% instances), vocative (11; 0% instances), discourse (5; 0% instances), nsubj:outer (4; 0% instances), orphan (4; 0% instances), goeswith (3; 0% instances)
Children of NOUN
nodes belong to 17 different parts of speech: NOUN (15991; 27% instances), ADP (14358; 24% instances), DET (9166; 15% instances), ADJ (4137; 7% instances), PUNCT (3576; 6% instances), VERB (2553; 4% instances), CCONJ (2251; 4% instances), PROPN (2149; 4% instances), PART (2018; 3% instances), NUM (1355; 2% instances), PRON (854; 1% instances), AUX (834; 1% instances), ADV (541; 1% instances), SCONJ (383; 1% instances), X (79; 0% instances), SYM (14; 0% instances), INTJ (2; 0% instances)