Treebank Statistics: UD_Gwichin-TueCL: POS Tags: PROPN
There are 28 PROPN
lemmas (9%), 28 PROPN
types (6%) and 33 PROPN
tokens (3%).
Out of 15 observed tags, the rank of PROPN
is: 3 in number of lemmas, 3 in number of types and 6 in number of tokens.
The 10 most frequent PROPN
lemmas: gwichyaa, John, K’ǫǫ, Vashrą̀įį, Alice, April, Bishop, Burke, Dodson, Dr.
The 10 most frequent PROPN
types: Gwichyaa, John, K’ǫǫ, Vashrą̀įį, Alice, April, Bishop, Burke, Dodson, Dr.
The 10 most frequent ambiguous lemmas: _ (PUNCT 319, VERB 37, ADV 7, ADP 1, NOUN 1, PROPN 1, X 1), dinjii (NOUN 1, PROPN 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PROPN
is 1.000000 (the average of all parts of speech is 1.693333).
The 1st highest number of forms (1) was observed with the lemma “Alice”: Alice.
The 2nd highest number of forms (1) was observed with the lemma “April”: April.
The 3rd highest number of forms (1) was observed with the lemma “Bishop”: Bishop.
PROPN
does not occur with any features.
Relations
PROPN
nodes are attached to their parents using 8 different relations: flat (9; 27% instances), obl (7; 21% instances), nsubj (5; 15% instances), conj (4; 12% instances), compound (3; 9% instances), obj (3; 9% instances), appos (1; 3% instances), dep (1; 3% instances)
Parents of PROPN
nodes belong to 4 different parts of speech: VERB (14; 42% instances), PROPN (12; 36% instances), NOUN (5; 15% instances), PRON (2; 6% instances)
15 (45%) PROPN
nodes are leaves.
13 (39%) PROPN
nodes have one child.
3 (9%) PROPN
nodes have two children.
2 (6%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 3.
Children of PROPN
nodes are attached using 7 different relations: flat (9; 36% instances), case (6; 24% instances), cc (3; 12% instances), punct (3; 12% instances), conj (2; 8% instances), compound (1; 4% instances), nummod (1; 4% instances)
Children of PROPN
nodes belong to 6 different parts of speech: PROPN (12; 48% instances), ADP (6; 24% instances), PUNCT (3; 12% instances), CCONJ (2; 8% instances), ADV (1; 4% instances), NUM (1; 4% instances)