home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Beja-Autogramm: POS Tags: PRON

There are 1 PRON lemmas (6%), 93 PRON types (5%) and 820 PRON tokens (7%). Out of 16 observed tags, the rank of PRON is: 11 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent PRON lemmas: _

The 10 most frequent PRON types: =heːb, =i, =oː, =eː, ani, =hoːk, kna, =oːk, =oːn, aneːb

The 10 most frequent ambiguous lemmas: _ (VERB 2410, PUNCT 2363, DET 1737, NOUN 1719, PRON 820, ADP 766, SCONJ 592, CCONJ 338, PART 321, AUX 284, ADV 191, ADJ 149, X 73, INTJ 66, PROPN 63, NUM 59)

The 10 most frequent ambiguous types: =i (ADP 132, PRON 68, AUX 31, SCONJ 18), =eː (PRON 54, ADP 25, SCONJ 20, DET 1), ani (PRON 54, VERB 12, AUX 3), =hoːk (PRON 41, PART 1), kna (PRON 32, NOUN 2), hoːj (ADP 73, PRON 17, NOUN 1), ti= (DET 115, PRON 15, SCONJ 3), =ji (ADP 13, PRON 11, SCONJ 6, AUX 2), =eːk (SCONJ 34, PRON 9), =jeː (SCONJ 14, PRON 9, ADP 6)

Morphology

The form / lemma ratio of PRON is 93.000000 (the average of all parts of speech is 126.812500).

The 1st highest number of forms (93) was observed with the lemma “_”: -b, -t, =aː, =aːk, =aːn, =eː, =eːheː, =eːhi, =eːk, =eːsoː, =heːb, =hi, =hina, =hoːk, =hoːka, =hoːkna, =hoːn, =i, =iheː, =ihi, =iji, =ijoː, =ijoːk, =iːsi, =iːsiː, =iːsoː, =iːsoːk, =iːsoːn, =jaː, =jeː, =jeːsoːn, =ji, =joː, =joːk, =joːn, =ju, =juː, =juːk, =oː, =oːk, =oːki, =oːkna, =oːn, =saj, =siːsi, =uː, =uːk, =uːn, abij, ambataː, aneːb, ani, aniː, baraː, baraːk, bareːsoːkna, barhi, barijoːk, barjoː, barjoːk, baroː, baroːk, baruː, baruːk, beːn, hinin, hoː, hoːj, hoːk, hoːka, hoːn, i=, imbareː, imbateː, ji=, kna, nafs, naː, naːn, ombarijoːk, ombaroː, ombatoː, oːn, t=, ti=, umbaroː, umbaruː, umbaruːk, w=, wi=, wʔi=, ʔani, ʔaːw.

PRON occurs with 10 features: Number (713; 87% instances), Person (686; 84% instances), Case (607; 74% instances), Poss (368; 45% instances), Gender (110; 13% instances), PronType (64; 8% instances), Definite (34; 4% instances), Reflex (33; 4% instances), Polite (13; 2% instances), Deixis (2; 0% instances)

PRON occurs with 22 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Deixis=Prox, Deixis=Remt, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Form, Poss=Yes, PronType=Dem, PronType=Int, PronType=Rel, Reflex=Yes

PRON occurs with 79 feature combinations. The most frequent feature combination is Number=Sing|Person=1 (81 tokens). Examples: =heːb, =oː, hoː, =ji, =joː, aneːb

Relations

PRON nodes are attached to their parents using 17 different relations: nmod:poss (305; 37% instances), obj (232; 28% instances), nsubj (92; 11% instances), dep (43; 5% instances), iobj (31; 4% instances), dep:comp (30; 4% instances), discourse (26; 3% instances), nmod (15; 2% instances), dislocated:subj (14; 2% instances), obl:mod (12; 1% instances), dislocated:obj (6; 1% instances), reparandum (6; 1% instances), nsubj:outer (3; 0% instances), dep:conj (2; 0% instances), ccomp (1; 0% instances), dislocated (1; 0% instances), root (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: VERB (406; 50% instances), NOUN (295; 36% instances), ADP (60; 7% instances), ADJ (23; 3% instances), PRON (17; 2% instances), NUM (7; 1% instances), PROPN (3; 0% instances), X (3; 0% instances), PART (2; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances), (1; 0% instances)

682 (83%) PRON nodes are leaves.

79 (10%) PRON nodes have one child.

51 (6%) PRON nodes have two children.

8 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 5.

Children of PRON nodes are attached using 15 different relations: det (76; 36% instances), punct (75; 36% instances), discourse (21; 10% instances), nmod:poss (11; 5% instances), cc (7; 3% instances), dep (5; 2% instances), acl:relcl (3; 1% instances), dep:conj (3; 1% instances), nmod (3; 1% instances), advmod (1; 0% instances), cop (1; 0% instances), dep:comp (1; 0% instances), dislocated:subj (1; 0% instances), reparandum (1; 0% instances), vocative (1; 0% instances)

Children of PRON nodes belong to 14 different parts of speech: DET (83; 40% instances), PUNCT (75; 36% instances), PRON (17; 8% instances), PART (10; 5% instances), CCONJ (7; 3% instances), NOUN (7; 3% instances), ADP (2; 1% instances), INTJ (2; 1% instances), SCONJ (2; 1% instances), ADV (1; 0% instances), AUX (1; 0% instances), PROPN (1; 0% instances), VERB (1; 0% instances), X (1; 0% instances)