Treebank Statistics: UD_Irish-TwittIrish: POS Tags: PUNCT
There are 79 PUNCT
lemmas (1%), 82 PUNCT
types (1%) and 6159 PUNCT
tokens (13%).
Out of 17 observed tags, the rank of PUNCT
is: 11 in number of lemmas, 12 in number of types and 4 in number of tokens.
The 10 most frequent PUNCT
lemmas: ., :, ,, !, …, -, ?, ‘, …, “
The 10 most frequent PUNCT
types: ., :, ,, !, …, -, ?, ‘, …, “
The 10 most frequent ambiguous lemmas: . (PUNCT 1312, PROPN 2), , (PUNCT 837, CCONJ 1), … (PUNCT 358, SYM 2, X 1), - (PUNCT 338, NUM 2, ADV 1, SYM 1), ) (PUNCT 119, NOUN 1), / (PUNCT 52, CCONJ 2, SYM 1), | (PUNCT 21, SYM 1), ’ (PUNCT 18, ADP 1), = (PUNCT 16, SYM 2, NUM 1), #BricBlasta (NOUN 1, PUNCT 1)
The 10 most frequent ambiguous types: . (PUNCT 1312, PROPN 2), , (PUNCT 837, CCONJ 1), … (PUNCT 359, SYM 2, X 1), - (PUNCT 338, NUM 2, ADV 1, SYM 1), ) (PUNCT 119, NOUN 1), / (PUNCT 52, CCONJ 2, SYM 1), | (PUNCT 21, SYM 1), ’ (PUNCT 18, ADP 1), = (PUNCT 16, SYM 2, NUM 1), #BricBlasta (NOUN 1, PUNCT 1)
- .
- ,
- …
- PUNCT 359: RT @user263 : @user353 … má tá tú idir dhá thine Bhealtaine ! Nó a rock and a hard place .
- SYM 2: Is as an tSeapáin mé イス・アス・アン・チャパーン・メ 私は日本出身です。 Is as … mé で「私は~出身です。」
- X 1: RT @user1663 : Ar mhaith leat an Ghaeilge a mhúineadh i gCeanada ? 🇨🇦 Is féidir le #ICUF tacú leat !! ℹ️ : https://t.co/2G1JX972o8 #Gaeilge # …
- -
- PUNCT 338: Iontach ar fad - maith sibh ! #WRWC2017 #BRINGIT https://t.co/7PTmbvocKs
- NUM 2: RT @user1631 : FT Lios Póil 1 - 11 Naomh Seanáin 0 - 10 . Hup !
- ADV 1: @user155 Níl , ach ba mhaith liom trathnóna comhrá as Gaeilge a eagrú i Stócólm , tá roinn gaeilgeoirí anseo - éasca !
- SYM 1: Poet Mícheál Ó Ruairc joins me on @user1483 ‘s #CaintChiarraí frm 8 - 9 pm : éist beo ag http://t.co/bdZoLhnWfK #Gaeilge http://t.co/z6LlVclfEK
- )
- /
- PUNCT 52: @user1008 Cé a bhí ag canadh / seinm An Cúlfhionn ?
- CCONJ 2: Más ann don eachtra E , b’ ionann sin agus a rá gurbh ann do dhóthain fianaise gur tharla / go dtarlóidh E . #principleofsufficientreason
- SYM 1: RT @user1747 : @user835 |  ̄ ̄ ̄ ̄ ̄ ̄| |scaoil amach | | an bobailín | | ______ | ( __/ ) || (•ㅅ•) || / づ
- |
- ’
- =
- PUNCT 16: @user590 Chaill tú mo mheas = You lost my respect http://t.co/MKO8e38B9S
- SYM 2: @user1606 @user241 @user263 branda a bhfuil tóir air = aspirational brand . http://t.co/yQOZyExWTu
- NUM 1: @user588 Ag Ó Duinnín tá ‘ saol-ré ‘ = biography . Mar sin caidé fá ‘ Scannán saol-ré ‘ agus mar ghiorrúchán : ‘ saolscann ‘
- #BricBlasta
Morphology
The form / lemma ratio of PUNCT
is 1.037975 (the average of all parts of speech is 1.212231).
The 1st highest number of forms (6) was observed with the lemma “…”: …, …., ……, ……., …….., ….
The 2nd highest number of forms (4) was observed with the lemma “!”: !, !!, !!!, !!!!!!!.
The 3rd highest number of forms (4) was observed with the lemma “!!”: !!, !!!, !!!!, !!!!!!!.
PUNCT
does not occur with any features.
Relations
PUNCT
nodes are attached to their parents using 2 different relations: punct (6157; 100% instances), root (2; 0% instances)
Parents of PUNCT
nodes belong to 17 different parts of speech: NOUN (2297; 37% instances), VERB (1899; 31% instances), PROPN (713; 12% instances), ADJ (430; 7% instances), NUM (232; 4% instances), PRON (162; 3% instances), SYM (153; 2% instances), INTJ (77; 1% instances), X (53; 1% instances), ADV (49; 1% instances), ADP (29; 0% instances), CCONJ (19; 0% instances), DET (16; 0% instances), PART (14; 0% instances), PUNCT (9; 0% instances), AUX (5; 0% instances), (2; 0% instances)
6152 (100%) PUNCT
nodes are leaves.
5 (0%) PUNCT
nodes have one child.
0 (0%) PUNCT
nodes have two children.
2 (0%) PUNCT
nodes have three or more children.
The highest child degree of a PUNCT
node is 7.
Children of PUNCT
nodes are attached using 8 different relations: punct (9; 50% instances), vocative:mention (3; 17% instances), obl:prep (1; 6% instances), parataxis (1; 6% instances), parataxis:rt (1; 6% instances), parataxis:sentence (1; 6% instances), xcomp (1; 6% instances), xcomp:pred (1; 6% instances)
Children of PUNCT
nodes belong to 7 different parts of speech: PUNCT (9; 50% instances), PROPN (3; 17% instances), VERB (2; 11% instances), ADJ (1; 6% instances), ADP (1; 6% instances), NOUN (1; 6% instances), SYM (1; 6% instances)