Treebank Statistics: UD_English-Pronouns: POS Tags: NOUN
There are 4 NOUN
lemmas (6%), 6 NOUN
types (8%) and 240 NOUN
tokens (14%).
Out of 13 observed tags, the rank of NOUN
is: 5 in number of lemmas, 4 in number of types and 4 in number of tokens.
The 10 most frequent NOUN
lemmas: dealer, car, paint, bump
The 10 most frequent NOUN
types: dealer, car, dealers, cars, paint, bumps
The 10 most frequent ambiguous lemmas: paint (NOUN 10, VERB 5)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NOUN
is 1.500000 (the average of all parts of speech is 1.212121).
The 1st highest number of forms (2) was observed with the lemma “car”: car, cars.
The 2nd highest number of forms (2) was observed with the lemma “dealer”: dealer, dealers.
The 3rd highest number of forms (1) was observed with the lemma “bump”: bumps.
NOUN
occurs with 1 features: Number (240; 100% instances)
NOUN
occurs with 2 feature-value pairs: Number=Plur
, Number=Sing
NOUN
occurs with 2 feature combinations.
The most frequent feature combination is Number=Sing
(175 tokens).
Examples: dealer, car, paint
Relations
NOUN
nodes are attached to their parents using 3 different relations: nsubj (190; 79% instances), obj (45; 19% instances), conj (5; 2% instances)
Parents of NOUN
nodes belong to 3 different parts of speech: VERB (195; 81% instances), PRON (40; 17% instances), NOUN (5; 2% instances)
40 (17%) NOUN
nodes are leaves.
160 (67%) NOUN
nodes have one child.
30 (13%) NOUN
nodes have two children.
10 (4%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 4.
Children of NOUN
nodes are attached using 9 different relations: det (185; 73% instances), nmod (20; 8% instances), amod (10; 4% instances), case (10; 4% instances), punct (10; 4% instances), appos (5; 2% instances), cc (5; 2% instances), conj (5; 2% instances), nsubj (5; 2% instances)
Children of NOUN
nodes belong to 7 different parts of speech: DET (185; 73% instances), PRON (30; 12% instances), ADJ (10; 4% instances), PART (10; 4% instances), PUNCT (10; 4% instances), CCONJ (5; 2% instances), NOUN (5; 2% instances)