Treebank Statistics: UD_Irish-Cadhan: POS Tags: NOUN
There are 614 NOUN
lemmas (52%), 891 NOUN
types (45%) and 1146 NOUN
tokens (24%).
Out of 15 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: duine, lá, saol, rí, cur, fear, bheith, lámh, bliain, ceann
The 10 most frequent NOUN
types: lá, bheith, duine, fhios, chur, dhéanamh, saoghal, tan, beatha, bith
The 10 most frequent ambiguous lemmas: mac (NOUN 9, PART 4), leath (NOUN 7, VERB 1), ní (PART 24, NOUN 7), athair (NOUN 5, PROPN 1), croch (NOUN 4, VERB 1), céanna (NOUN 4, ADJ 1), meas (NOUN 4, VERB 2), Gall (NOUN 3, PROPN 2), ais (NOUN 3, ADV 1), beag (ADJ 8, NOUN 2)
The 10 most frequent ambiguous types: mac (NOUN 4, PART 1), ais (NOUN 3, ADV 1), linn (NOUN 3, ADP 2), ní (PART 17, AUX 3, NOUN 3), ré (NOUN 2, ADP 1, PART 1), beag (ADJ 5, NOUN 2), fhuil (NOUN 2, VERB 2), fuil (NOUN 2, VERB 1), mic (NOUN 2, PART 1), Gall (PROPN 2, NOUN 1)
- mac
- NOUN 4: Siodhmall mac Cairbre chruim , mic Ealcmhair , mic Dealbhaoith .
- PART 1: Is breugach a deir , mar an gceudna , go raibhe Murchadh mac Cochlain ‘na rígh ar Éirinn an tan fá haois do’n Tighearna sé bliadhna ar thrí fhichid ar chéad ar mhíle , óir is dearbh gurab é Ruaidhrí Ua Conchubhair do bhí ag gabháil cheannais Éireann re a ais an tan soin , agus gurab cheithre bliadhna ria ngabháltas Gall an uair sin .
- ais
- NOUN 3: Bhíos ag cuimilt mo shúl agus ag ceapadh néal beag eile bheith agam , nuair a buaileadh an fhuinneóg a bhí le m’ ais go haireach le slait .
- ADV 1: An treas easbaidh , do bhí sé uaillmhianach , agus d’á réir sin , do bhí súil aige le meudughadh d’ fhaghbháil ó’n droing lér’ gríosadh é le scríobhadh go holc ar Éirinn : agus fós , re linn bheith ‘na shagart ‘na dhiaidh sin dó , do gheall gairm tar ais do dhéanamh ar mhórán do na neithibh maslaightheacha do scríobh ar Éirinn , agus do chluinim go bhfuil sé i gclódh anois re n-a thaisbéanadh i n-Éirinn .
- linn
- ní
- PART 17: Ach ní bhíonn sé leis féin mar sin i gcomhnaidhe .
- AUX 3: Adubhairt Símón Peadar ris , a thighearna , ní hiád mó chosa amhaín , achd mo lámha agus mó cheann mar an gcédna .
- NOUN 3: Fuair tusa ‘s as cuirthe a suim a mheic Cathbharr Í Dhomhnuill ní ó Dhia et re fa rath ni dlighe bheith dhe diomdhach .
- ré
- NOUN 2: Do chosain tu íad amhlaidh re ré an chogaidh chatharmaigh ceithre bliaghna doirbhe dég nar léigis faill ‘na ccoimhéd .
- ADP 1: An beagán d’ Fhearaibh Bolg teurna as an gcath so , do chuadar ar teitheadh ré Tuathaibh Dé Danann , gur háitigheadh riu Árainn , Íle , Reachrainn , Inse Gall , agus iomad oiléan ar cheana , agus do chomhnuigh siad ionnta go haimsir na gcúigeadhach do bheith i bhflaitheas Éireann , gur dhíbirsiod na Cruithnigh , eadhon ‘ Picti , ’ as na hoiléanaibh sin iad , go dtángadar d’ fhios Chairbre Niadhfir , rí Laighean , go bhfuairsiod fearann ar ghabháltas uaidh .
- PART 1: Na trí ríogha so do bhíodh i bhflaitheas Éireann gach ré mbliadhain ; agus is é ainm mná gach fir díobh do bhíodh ar an oiléan an bhliadhain do bhíodh féin ‘na rígh .
- beag
- fhuil
- fuil
- NOUN 2: Tanaig Iosa trenar ccoir do nimh a-nuas on Athoir ‘s do dhoirt fuil a chuirp uile air ar ngradh tre thrócuire .
- VERB 1: Achd an mhéid do ghabh chuca hé , tug sé na cumhachda so dhóibh bheith ana gcloinn ag Dia , eadhón don droing chreideas ann a ainmsean : Nách fuil arna ngeineamhain ó fhuil , na ó ainmián ná colla , ná ó thoil fhir , achd ó Dhía .
- mic
- Gall
- PROPN 2: Tuig , a léaghthóir , gurab mó , fa dhó nó fa thrí , acra do thomhas na nGaedheal , ioná acra do roinn Gall anois .
- NOUN 1: Is breugach a deir , mar an gceudna , go raibhe Murchadh mac Cochlain ‘na rígh ar Éirinn an tan fá haois do’n Tighearna sé bliadhna ar thrí fhichid ar chéad ar mhíle , óir is dearbh gurab é Ruaidhrí Ua Conchubhair do bhí ag gabháil cheannais Éireann re a ais an tan soin , agus gurab cheithre bliadhna ria ngabháltas Gall an uair sin .
Morphology
The form / lemma ratio of NOUN
is 1.451140 (the average of all parts of speech is 1.662997).
The 1st highest number of forms (9) was observed with the lemma “rí”: Rig, Righ, Riogha, righe, riogh, rí, rígh, ríogha, ríoghaibh.
The 2nd highest number of forms (7) was observed with the lemma “bliain”: bhliadhain, bliadhna, bliaghain, bliaghan, bliaghna, mbliadhain, mbliaghan.
The 3rd highest number of forms (7) was observed with the lemma “ceann”: cceann, ccionn, ceann, cheann, cinn, gcinn, gcionn.
NOUN
occurs with 9 features: Number (1024; 89% instances), Case (1023; 89% instances), Gender (985; 86% instances), Definite (570; 50% instances), Form (378; 33% instances), VerbForm (119; 10% instances), NounType (35; 3% instances), PrepForm (24; 2% instances), Typo (1; 0% instances)
NOUN
occurs with 21 feature-value pairs: Case=Dat
, Case=Gen
, Case=Nom
, Case=Voc
, Definite=Def
, Form=Ecl
, Form=Ecl,Emp
, Form=Emp
, Form=Emp,Len
, Form=HPref
, Form=Len
, Gender=Fem
, Gender=Masc
, NounType=Strong
, NounType=Weak
, Number=Plur
, Number=Sing
, PrepForm=Cmpd
, Typo=Yes
, VerbForm=Inf
, VerbForm=Vnoun
NOUN
occurs with 105 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing
(158 tokens).
Examples: duine, lá, bith, ainm, ní, saoghal, acra, beag, bás, ceann
Relations
NOUN
nodes are attached to their parents using 22 different relations: nmod (209; 18% instances), obl (207; 18% instances), obj (146; 13% instances), nsubj (134; 12% instances), conj (95; 8% instances), xcomp (78; 7% instances), xcomp:pred (41; 4% instances), obl:tmod (38; 3% instances), root (38; 3% instances), fixed (34; 3% instances), appos (28; 2% instances), parataxis (25; 2% instances), advcl (16; 1% instances), vocative (16; 1% instances), acl (10; 1% instances), csubj:cop (8; 1% instances), acl:relcl (7; 1% instances), ccomp (7; 1% instances), dislocated (3; 0% instances), flat:name (3; 0% instances), csubj:cleft (2; 0% instances), case (1; 0% instances)
Parents of NOUN
nodes belong to 9 different parts of speech: VERB (499; 44% instances), NOUN (452; 39% instances), ADJ (59; 5% instances), (38; 3% instances), ADP (35; 3% instances), PROPN (33; 3% instances), PRON (22; 2% instances), NUM (4; 0% instances), PART (4; 0% instances)
146 (13%) NOUN
nodes are leaves.
349 (30%) NOUN
nodes have one child.
356 (31%) NOUN
nodes have two children.
295 (26%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 8.
Children of NOUN
nodes are attached using 33 different relations: case (436; 20% instances), det (296; 14% instances), nmod (292; 13% instances), punct (157; 7% instances), nmod:poss (146; 7% instances), amod (115; 5% instances), conj (93; 4% instances), acl:relcl (88; 4% instances), cc (83; 4% instances), obl (68; 3% instances), obj (53; 2% instances), mark (46; 2% instances), obl:prep (38; 2% instances), advmod (35; 2% instances), nummod (34; 2% instances), cop (33; 2% instances), appos (22; 1% instances), nsubj (21; 1% instances), parataxis (20; 1% instances), acl (16; 1% instances), case:voc (15; 1% instances), xcomp (11; 1% instances), csubj:cleft (10; 0% instances), xcomp:pred (8; 0% instances), ccomp (7; 0% instances), obl:tmod (7; 0% instances), advcl (5; 0% instances), csubj:cop (4; 0% instances), fixed (4; 0% instances), vocative (3; 0% instances), dislocated (1; 0% instances), flat:name (1; 0% instances), mark:prt (1; 0% instances)
Children of NOUN
nodes belong to 14 different parts of speech: ADP (510; 24% instances), NOUN (452; 21% instances), DET (430; 20% instances), PUNCT (157; 7% instances), ADJ (121; 6% instances), VERB (111; 5% instances), PROPN (89; 4% instances), CCONJ (83; 4% instances), PART (52; 2% instances), NUM (43; 2% instances), PRON (42; 2% instances), ADV (35; 2% instances), AUX (33; 2% instances), SCONJ (11; 1% instances)