Statistics of PRON in UD_Skolt

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Skolt_Sami-Giellagas: POS Tags: `PRON`

There are 20 PRON lemmas (4%), 53 PRON types (7%) and 311 PRON tokens (11%). Out of 16 observed tags, the rank of PRON is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent PRON lemmas: son, tõt, mon, ton, mii, dõõt, mâiʹd, puk, tut, jiõčč

The 10 most frequent PRON types: son, tõt, tõn, ton, mon, mâiʹd, suu, mii, muu, miʹjjid

The 10 most frequent ambiguous lemmas: tõt (PRON 78, DET 4), mii (PRON 13, ADV 1), puk (PRON 8, ADV 1), nåkkam (PRON 5, ADJ 2), tät (PRON 5, DET 1), nuʹbb (ADJ 4, NOUN 2, PRON 2)

The 10 most frequent ambiguous types: tõt (PRON 29, DET 1), tõn (PRON 28, DET 2), puk (PRON 7, ADV 1), nåkkam (PRON 4, ADJ 2), jeeʹres (DET 1, PRON 1), nuuʹbb (ADJ 2, PRON 1), nuʹbb (ADJ 2, PRON 1), tõʹst (ADV 7, PRON 1)

tõt
- PRON 29: tõt sluužbäiʹǧǧ leäi kuâhttlovitt eeʹjj .
- DET 1: Dõõt šõõddi vââšš , vââšš šõõddi di tõt bieʹss jieʹli …
tõn
- PRON 28: Joo mon tõn räjja teâđam , jäänab jiõm tieʹđ .
- DET 2: Suʹst leäi õlggâm leeʹd tõn peeiʹv čååǥǥâʹttmen vuõptees , leša mii leežž šõddâm ǥu ij tâʹl ni vuäittam .
puk
- PRON 7: Di sij koozz vueʹlǧǧe , de puk vuäbbses poʹhtte põʹrtte .
- ADV 1: No inn šõõddi , tuk âʹtte vuäđđje puk … stäʹlmmstääll … dõõk leʹjje , vuäđđje .
nåkkam
- PRON 4: – ” No ij kâʹl leäkku šurr , mâʹte ton , tuu šoora ooumaž lij nåkkam . ”
- ADJ 2: Mii leežž nåkkam peästtõõǥǥid koozz-a čõõnõõđi , čõõnõõđi .
jeeʹres
- DET 1: De son jeeʹres tõzz-e tok kuäʹđ … jiijjâs kuäʹđ jeeʹres årra tok raaji pääiʹǩi ja de tok pääkkai mõõnnâd .
- PRON 1: De son jeeʹres tõzz-e tok kuäʹđ … jiijjâs kuäʹđ jeeʹres årra tok raaji pääiʹǩi ja de tok pääkkai mõõnnâd .
nuuʹbb
- ADJ 2: Son lij suukkâm jooǥǥ nuuʹbb peälla .
- PRON 1: Čõõnõõđi âʹtte jiõčč di tõn nuuʹbb viʹllje ceälkk : ton ååʹn õõk lueʹštškueʹtted suu tok .
nuʹbb
- ADJ 2: Näʹde nuʹbb villj näʹde čõõnõõđškuõʹđi .
- PRON 1: Di tut nuʹbb âʹtte lueʹšti , lueʹšti , lueʹšti di peästtõõǥǥ puʹtte .
tõʹst
- ADV 7: Čääʹccskääll pâi åårr tõʹst jiõŋ âʹlnn .
- PRON 1: Näʹde son âʹtte vuõʹlji , näʹde tät jeäʹnnes teâđast nuʹt-i paaʹʒʒi pääʹltâʹstted , di leuʹdd âʹtte leäi mâŋŋa tõʹst :

Morphology

The form / lemma ratio of PRON is 2.650000 (the average of all parts of speech is 1.476015).

The 1st highest number of forms (7) was observed with the lemma “mon”: mij, miʹjjid, mon, muu, muännaid, muʹnne, muʹst.

The 2nd highest number of forms (7) was observed with the lemma “son”: seeʹst, sij, son, suu, suäna, suännast, suʹst.

The 3rd highest number of forms (7) was observed with the lemma “tõt”: tõid, tõin, tõk, tõn, tõt, tõt-i, tõʹst.

PRON occurs with 6 features: Case (299; 96% instances), Number (298; 96% instances), PronType (278; 89% instances), Person (154; 50% instances), Clitic (8; 3% instances), Reflex (5; 2% instances)

PRON occurs with 21 feature-value pairs: Case=Acc, Case=Com, Case=Gen, Case=Ill, Case=Loc, Case=Nom, Clitic=AddI, Clitic=Os, Clitic=QstA, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Int, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes

PRON occurs with 50 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=3|PronType=Prs (66 tokens). Examples: son

Relations

PRON nodes are attached to their parents using 22 different relations: nsubj (142; 46% instances), det (50; 16% instances), obj (44; 14% instances), obl (18; 6% instances), nsubj:cop (13; 4% instances), conj (9; 3% instances), reparandum (7; 2% instances), expl (4; 1% instances), nmod (3; 1% instances), nmod:poss (3; 1% instances), obl:lmod (3; 1% instances), root (3; 1% instances), dep (2; 1% instances), orphan (2; 1% instances), acl (1; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), obl:agent (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)

Parents of PRON nodes belong to 8 different parts of speech: VERB (214; 69% instances), NOUN (73; 23% instances), PRON (7; 2% instances), AUX (5; 2% instances), ADJ (4; 1% instances), ADV (4; 1% instances), (3; 1% instances), PROPN (1; 0% instances)

268 (86%) PRON nodes are leaves.

22 (7%) PRON nodes have one child.

6 (2%) PRON nodes have two children.

15 (5%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 20 different relations: punct (31; 33% instances), advmod (9; 9% instances), cop (7; 7% instances), nsubj:cop (6; 6% instances), advmod:eval (5; 5% instances), cc (5; 5% instances), orphan (5; 5% instances), case (4; 4% instances), discourse (4; 4% instances), advmod:neg (3; 3% instances), conj (3; 3% instances), mark (3; 3% instances), fixed (2; 2% instances), nsubj (2; 2% instances), acl:relcl (1; 1% instances), advcl (1; 1% instances), advmod:lmod (1; 1% instances), ccomp (1; 1% instances), det (1; 1% instances), reparandum (1; 1% instances)

Children of PRON nodes belong to 10 different parts of speech: PUNCT (31; 33% instances), ADV (21; 22% instances), AUX (7; 7% instances), PRON (7; 7% instances), VERB (7; 7% instances), NOUN (6; 6% instances), CCONJ (5; 5% instances), PART (5; 5% instances), ADP (4; 4% instances), SCONJ (2; 2% instances)

Treebank Statistics: UD_Skolt_Sami-Giellagas: POS Tags: PRON

Morphology

Relations

Treebank Statistics: UD_Skolt_Sami-Giellagas: POS Tags: `PRON`