Treebank Statistics: UD_Tswana-Popapolelo: POS Tags: NOUN
There are 1 NOUN
lemmas (9%), 24 NOUN
types (21%) and 26 NOUN
tokens (12%).
Out of 11 observed tags, the rank of NOUN
is: 5 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent NOUN
lemmas: _
The 10 most frequent NOUN
types: koloi, lekwalo, Moagisani, Mosetsana, Rre, baesekele, bohibidu, boronse, gauta, kakanyo
The 10 most frequent ambiguous lemmas: _ (PRON 43, VERB 34, PART 32, NOUN 26, PUNCT 23, PROPN 15, ADV 12, AUX 12, CCONJ 8, SCONJ 5, ADJ 4)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NOUN
is 24.000000 (the average of all parts of speech is 10.181818).
The 1st highest number of forms (24) was observed with the lemma “_”: Moagisani, Mosetsana, Rre, baesekele, bohibidu, boronse, gauta, kakanyo, koloi, lebaka, lebelo, legora, lekwalo, letlhabaphefo, letsatsi, monna, moriri, morwarraagwe, mošate, naga, phaposing, pula, selefera, tsala.
NOUN
occurs with 1 features: NounClass (26; 100% instances)
NOUN
occurs with 6 feature-value pairs: NounClass=Bantu1
, NounClass=Bantu14
, NounClass=Bantu3
, NounClass=Bantu5
, NounClass=Bantu7
, NounClass=Bantu9
NOUN
occurs with 6 feature combinations.
The most frequent feature combination is NounClass=Bantu9
(10 tokens).
Examples: koloi, baesekele, boronse, gauta, kakanyo, naga, phaposing, pula, tsala
Relations
NOUN
nodes are attached to their parents using 11 different relations: obj (8; 31% instances), nsubj (6; 23% instances), orphan (3; 12% instances), obl (2; 8% instances), advcl (1; 4% instances), appos (1; 4% instances), conj (1; 4% instances), iobj (1; 4% instances), obl:lmod (1; 4% instances), obl:tmod (1; 4% instances), root (1; 4% instances)
Parents of NOUN
nodes belong to 6 different parts of speech: VERB (17; 65% instances), PROPN (3; 12% instances), AUX (2; 8% instances), NOUN (2; 8% instances), ADJ (1; 4% instances), (1; 4% instances)
11 (42%) NOUN
nodes are leaves.
9 (35%) NOUN
nodes have one child.
5 (19%) NOUN
nodes have two children.
1 (4%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 5.
Children of NOUN
nodes are attached using 11 different relations: nmod (9; 38% instances), case (5; 21% instances), punct (2; 8% instances), advmod (1; 4% instances), amod (1; 4% instances), cc (1; 4% instances), conj (1; 4% instances), cop (1; 4% instances), discourse (1; 4% instances), nsubj (1; 4% instances), orphan (1; 4% instances)
Children of NOUN
nodes belong to 9 different parts of speech: PART (6; 25% instances), PRON (6; 25% instances), PROPN (3; 13% instances), ADJ (2; 8% instances), NOUN (2; 8% instances), PUNCT (2; 8% instances), ADV (1; 4% instances), AUX (1; 4% instances), CCONJ (1; 4% instances)