Treebank Statistics: UD_Irish-IDT: POS Tags: DET
There are 21 DET
lemmas (0%), 40 DET
types (0%) and 10287 DET
tokens (9%).
Out of 17 observed tags, the rank of DET
is: 13 in number of lemmas, 12 in number of types and 4 in number of tokens.
The 10 most frequent DET
lemmas: an, a, seo, sin, eile, aon, gach, do, mo, uile
The 10 most frequent DET
types: an, na, a, seo, sin, eile, aon, gach, do, mo
The 10 most frequent ambiguous lemmas: an (DET 7265, PART 57, X 1), a (PART 4074, DET 736, PRON 23, ADV 7, X 5, NUM 2, NOUN 1), seo (DET 564, PRON 145), sin (DET 425, PRON 414, INTJ 1), aon (DET 286, NUM 20, NOUN 4), gach (DET 232, NOUN 1), do (ADP 1447, PART 316, DET 175), uile (DET 59, NOUN 5, ADJ 1), ár (DET 42, NOUN 5), cibé (DET 18, PRON 2)
The 10 most frequent ambiguous types: an (DET 4548, PART 28, AUX 5, ADP 1, X 1), na (DET 2540, ADP 1), a (PART 4050, DET 745, PRON 23, ADV 7, ADP 2, X 2, NOUN 1), seo (DET 547, PRON 125, AUX 1), sin (DET 403, PRON 346, AUX 1), aon (DET 259, NUM 12, NOUN 3), gach (DET 199, NOUN 1), do (ADP 514, DET 111, PART 40), d’ (ADP 172, PART 138, DET 55), uile (DET 42, NOUN 5)
- an
- DET 4548: Ach beo bocht a bheadh ar an té nach mbeadh aige ach preátaí tura .
- PART 28: Cén áit an bhfágfaidh mé iad seo , a Tom .
- AUX 5: Nó cúig an ea ?
- ADP 1: Má imríonn siad faoi mar is féidir leo , is dóigh liom go bhfillfidh siad ar Staid Semple , Lá ‘ le Pádraig , ach is ina lámha féin a bheidh sé an ag imirt nó i measc an tslua a bheidh siad .
- X 1: AR 10 Feabhra 1968 , cúig lá tar éis do William Conor bás a fháil , d’ fhoilsigh an Belfast Telegraph cartún le Rowel Friers ( 1920- ) dar teideal ‘ The End of an Era , ‘ in ómós don phrionsa fir a bhí ar lár .
- na
- a
- PART 4050: Ach beo bocht a bheadh ar an té nach mbeadh aige ach preátaí tura .
- DET 745: Meatachán scanraithe agus a lámha cáidheach le roidealach an bhóthair .
- PRON 23: Níl de dhíth ach aon fhocal amháin , sin a bhfuil .
- ADV 7: Tá an teicníocht inste a roghnaíonn sé thar a bheith éifeachtach .
- ADP 2: ’ Séard atá De Róiste a rá anois ná gur cuireadh ina choinne go raibh baint aige le Saor Éire agus gur thug sin leithscéal dóibh é a bhriseadh .
- X 2: ’ Cad a bhí ann ach go raibh sé ‘ cut off with a shilling ‘ , agus tugadh an scilling dó !
- NOUN 1: Sa Deibhí bíonn seacht siolla sa líne freisin , ach bíonn siolla sa bhreis i bhfocal deireanach b ar a , agus in d ar c .
- seo
- DET 547: An dtuigtear fós sa tír seo cé chomh mór de athrú is a bhí ansin ?
- PRON 125: Duradh gur seo ceann dena fadhbanna is mó atá sa cheantar .
- AUX 1: Cothú sa Duine Tá cúig chéim i gceist le cothú sa duine : Daoibhse atá ar bheagán Shakespeare , nó daoibhse a d’ fhág cúrsaí staidéir níos mó ná cúpla lá ó shin , seo é , go gonta , scéal traigéideach marfach Phrionsa na Danmhairge .
- sin
- DET 403: Thug na páistí ruaig amháin ar an siopa sin .
- PRON 346: Cé nár dhúirt tú é bhí ‘ fhios agam gur thuig tú sin i do chroí .
- AUX 1: ’ Tógann sé deich mbliana ar an spéis léitheoireachta theacht in inmhe agus caithfear díriú ar pháistí atá ag sroichint na léitheoireachta ar bhun neamhspleách , mar sin an áit a gcailltear iad faoi láthair .
- aon
- DET 259: Ní raibh aon ghá le cuireadh .
- NUM 12: Is ionann sin agus a rá nach féidir na breoslaí seo a úsáid ach aon uair amháin .
- NOUN 3: Is mar gheall ar infheistíocht ón Údarás agus ó Choimisiún Forbartha an Iarthair a tharla an bisiú seo mar aon le cinneadh na dtáirgeoirí bogadh go dtí táirgeadh orgánach breisluacha .
- gach
- do
- d’
- uile
Morphology
The form / lemma ratio of DET
is 1.904762 (the average of all parts of speech is 1.648496).
The 1st highest number of forms (7) was observed with the lemma “an”: ‘n, ‘na, a, a’, an, na, un.
The 2nd highest number of forms (4) was observed with the lemma “do”: d’, dh’, do, d’.
The 3rd highest number of forms (3) was observed with the lemma “a”: a, n-a, á.
DET
occurs with 11 features: PronType (8953; 87% instances), Number (8338; 81% instances), Definite (7523; 73% instances), Gender (2418; 24% instances), Case (2342; 23% instances), Person (1073; 10% instances), Poss (1073; 10% instances), Form (61; 1% instances), Dialect (42; 0% instances), Foreign (3; 0% instances), Typo (3; 0% instances)
DET
occurs with 21 feature-value pairs: Case=Gen
, Definite=Def
, Dialect=Munster
, Dialect=Ulster
, Foreign=Yes
, Form=Ecl
, Form=HPref
, Form=Len
, Gender=Fem
, Gender=Fem,Masc
, Gender=Masc
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Poss=Yes
, PronType=Art
, PronType=Dem
, PronType=Ind
, Typo=Yes
DET
occurs with 28 feature combinations.
The most frequent feature combination is Definite=Def|Number=Sing|PronType=Art
(3836 tokens).
Examples: an, ‘n, a, a’, na
Relations
DET
nodes are attached to their parents using 14 different relations: det (9185; 89% instances), nmod:poss (1070; 10% instances), fixed (10; 0% instances), conj (7; 0% instances), flat:name (4; 0% instances), obj (2; 0% instances), obl (2; 0% instances), advcl (1; 0% instances), advmod (1; 0% instances), appos (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), nsubj (1; 0% instances), obl:tmod (1; 0% instances)
Parents of DET
nodes belong to 12 different parts of speech: NOUN (9166; 89% instances), PROPN (990; 10% instances), NUM (36; 0% instances), PRON (24; 0% instances), X (21; 0% instances), ADJ (13; 0% instances), DET (11; 0% instances), VERB (10; 0% instances), ADP (9; 0% instances), SCONJ (4; 0% instances), ADV (2; 0% instances), PART (1; 0% instances)
10183 (99%) DET
nodes are leaves.
92 (1%) DET
nodes have one child.
8 (0%) DET
nodes have two children.
4 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 4.
Children of DET
nodes are attached using 18 different relations: punct (28; 23% instances), fixed (24; 20% instances), case (17; 14% instances), det (10; 8% instances), nmod (8; 7% instances), cc (7; 6% instances), obl:prep (6; 5% instances), acl:relcl (3; 2% instances), conj (3; 2% instances), cop (3; 2% instances), flat:foreign (3; 2% instances), appos (2; 2% instances), mark:prt (2; 2% instances), advmod (1; 1% instances), amod (1; 1% instances), ccomp (1; 1% instances), nsubj (1; 1% instances), parataxis (1; 1% instances)
Children of DET
nodes belong to 15 different parts of speech: ADP (35; 29% instances), PUNCT (28; 23% instances), ADJ (14; 12% instances), NOUN (12; 10% instances), DET (11; 9% instances), CCONJ (7; 6% instances), AUX (3; 2% instances), PRON (2; 2% instances), PROPN (2; 2% instances), VERB (2; 2% instances), ADV (1; 1% instances), NUM (1; 1% instances), PART (1; 1% instances), SCONJ (1; 1% instances), X (1; 1% instances)