Treebank Statistics: UD_Irish-IDT: POS Tags: X
There are 256 X
lemmas (3%), 260 X
types (2%) and 358 X
tokens (0%).
Out of 17 observed tags, the rank of X
is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: the, Irish, of, Life, for, a, _, port, preparing, read
The 10 most frequent X
types: the, Irish, of, Life, for, a, Port, Preparing, Read, Right
The 10 most frequent ambiguous lemmas: the (X 26, DET 1), Irish (X 10, PROPN 4), of (X 10, NOUN 3), Life (X 7, PROPN 3), a (PART 4074, DET 736, PRON 23, ADV 7, X 5, NUM 2, NOUN 1), _ (X 4, PUNCT 1), port (NOUN 8, PROPN 2, X 1), on (X 2, ADP 1), Spring (X 2, PROPN 1), Times (X 2, PROPN 1)
The 10 most frequent ambiguous types: the (X 8, ADJ 1), Irish (X 10, PROPN 4), of (X 10, NOUN 3), Life (X 7, PROPN 3), a (PART 4050, DET 745, PRON 23, ADV 7, ADP 2, X 2, NOUN 1), Port (X 4, PROPN 2), National (X 3, PROPN 1), on (X 2, ADP 1), Spring (X 2, PROPN 1), Times (X 2, PROPN 1)
- the
- X 8: Ar na leabhair eile atá scríofa agus maisithe ag an údar , Douglas Alvord , tá : On the Water .
- ADJ 1: Ag éisteacht le meidhir na bhfuiseog idir aer agus uisce thaibhsigh do dhánta chugam thar thalamh eadrána na síoraíochta , líne ar líne , stadach , scáfar mar shaighdiúirí ó bhéal an áir agus bhain siad an gus asam lena gcuntas ar an uafás : as duibheagán dubh na dtrinsí , as dóchas daortha na n-óg , as ár agus anbhás , d’ éirigh siad chugam as corrabhuais coinsiasa - mise nach raibh ariamh sa bhearna bhaoil , nach dtug ruathar marfach thar an mhullach isteach sa chreach , nár fhulaing i dtreascairt dhian na fola ; nach bhfaca saighdiúirí óga mar bheadh sopóga ann , caite i gcuibhrinn mhéithe an áir , boladh bréan an bháis ag éirí ina phláigh ó bhláth feoite a n-óige ; nach raibh ar maos i nglár is i gclábar bhlár an chatha , nár chaill mo mheabhair i bpléasc , nár mhothaigh an piléar mar bheach thapaidh the ag diúl mhil fhiáin m’ óige .
- Irish
- of
- X 10: Agus bhí , i gcábán bád canála a raibh an Rose of Mooncoin uirthi .
- NOUN 3: Faoi dheireadh , don Sun Alliance Chase ag Cheltenham , ag seo na geallta ante-post - Mr. Mulligan ( 5 phointe , bua , ag 7/1 ) ; Hill of Tullow ( 3 phointe , bua ag 10/1 ) ; agus Nahthen Lad ( 2 phointe , bua , ag 14/1 ) .
- Life
- a
- PART 4050: Ach beo bocht a bheadh ar an té nach mbeadh aige ach preátaí tura .
- DET 745: Meatachán scanraithe agus a lámha cáidheach le roidealach an bhóthair .
- PRON 23: Níl de dhíth ach aon fhocal amháin , sin a bhfuil .
- ADV 7: Tá an teicníocht inste a roghnaíonn sé thar a bheith éifeachtach .
- ADP 2: ’ Séard atá De Róiste a rá anois ná gur cuireadh ina choinne go raibh baint aige le Saor Éire agus gur thug sin leithscéal dóibh é a bhriseadh .
- X 2: ’ Cad a bhí ann ach go raibh sé ‘ cut off with a shilling ‘ , agus tugadh an scilling dó !
- NOUN 1: Sa Deibhí bíonn seacht siolla sa líne freisin , ach bíonn siolla sa bhreis i bhfocal deireanach b ar a , agus in d ar c .
- Port
- National
- X 3: B’ iad seo Frank P. Walsh , iar-Chathaoirleach an National Labour Board ; an t-iar-Ghobharnóir Edward F. Dunne agus Micheal J. Ryan .
- PROPN 1: Luath nó mall , tabharfaidh RTÉ agus an ‘ National ‘ Union of Journalists oiread airde ar an difríocht idir Dún Éideann agus Londain agus a thugann siad do Bhaile Átha Cliath agus Béal Feirste , ‘ in the two parts of this island ‘ .
- on
- Spring
- Times
- X 2: Is file agus is scríbhneoir chruthaitheach é Pól Ó Muirí agus is Eagarthóir Gaeilge an Irish Times é .
- PROPN 1: Tá scáth na linne ar an aiste aige , ceann nach dtaitneodh lenár gcáinteoir úd ón Irish Times - ‘ Tairgim an Aiste seo dosna Fearaibh dfhuiling an t-ocras ar son na hÉireann Abrán 1920 ‘ - ná ní thaithneodh an chéad dán sa leabhar leis ach an oiread , is dócha .
Morphology
The form / lemma ratio of X
is 1.015625 (the average of all parts of speech is 1.648496).
The 1st highest number of forms (4) was observed with the lemma “_”: ,000, Chluichí, chorr-Óglach, t.
The 2nd highest number of forms (1) was observed with the lemma “#explorehistory”: #explorehistory.
The 3rd highest number of forms (1) was observed with the lemma “.i.”: .i..
X
occurs with 3 features: Foreign (349; 97% instances), Abbr (5; 1% instances), Form (2; 1% instances)
X
occurs with 4 feature-value pairs: Abbr=Yes
, Foreign=Yes
, Form=Ecl
, Form=HPref
X
occurs with 5 feature combinations.
The most frequent feature combination is Foreign=Yes
(347 tokens).
Examples: the, Irish, of, Life, for, Port, Preparing, Read, Right, to
Relations
X
nodes are attached to their parents using 16 different relations: flat:foreign (210; 59% instances), nmod (46; 13% instances), appos (30; 8% instances), obl (16; 4% instances), nsubj (13; 4% instances), root (11; 3% instances), conj (9; 3% instances), obj (6; 2% instances), parataxis (6; 2% instances), goeswith (4; 1% instances), flat (2; 1% instances), advcl (1; 0% instances), dislocated (1; 0% instances), flat:name (1; 0% instances), list (1; 0% instances), xcomp:pred (1; 0% instances)
Parents of X
nodes belong to 10 different parts of speech: X (203; 57% instances), NOUN (79; 22% instances), VERB (31; 9% instances), (11; 3% instances), ADJ (9; 3% instances), PROPN (8; 2% instances), ADP (7; 2% instances), NUM (6; 2% instances), PRON (3; 1% instances), DET (1; 0% instances)
223 (62%) X
nodes are leaves.
27 (8%) X
nodes have one child.
42 (12%) X
nodes have two children.
66 (18%) X
nodes have three or more children.
The highest child degree of a X
node is 10.
Children of X
nodes are attached using 22 different relations: flat:foreign (197; 49% instances), punct (81; 20% instances), case (34; 8% instances), nmod (29; 7% instances), det (21; 5% instances), conj (7; 2% instances), cc (4; 1% instances), flat (4; 1% instances), mark:prt (4; 1% instances), advcl (3; 1% instances), amod (3; 1% instances), xcomp (3; 1% instances), appos (2; 0% instances), nsubj (2; 0% instances), parataxis (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), advmod (1; 0% instances), cop (1; 0% instances), flat:name (1; 0% instances), obj (1; 0% instances), vocative (1; 0% instances)
Children of X
nodes belong to 15 different parts of speech: X (203; 50% instances), PUNCT (81; 20% instances), ADP (34; 8% instances), DET (21; 5% instances), NOUN (20; 5% instances), PROPN (17; 4% instances), NUM (8; 2% instances), CCONJ (5; 1% instances), VERB (4; 1% instances), ADJ (3; 1% instances), PRON (2; 0% instances), SCONJ (2; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances), PART (1; 0% instances)