Treebank Statistics: UD_English-ESL: POS Tags: NOUN
There are 1 NOUN
lemmas (6%), 1 NOUN
types (6%) and 15635 NOUN
tokens (16%).
Out of 17 observed tags, the rank of NOUN
is: 8 in number of lemmas, 8 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: _
The 10 most frequent NOUN
types: _
The 10 most frequent ambiguous lemmas: _ (NOUN 15635, VERB 12250, PRON 10618, DET 10057, PUNCT 9580, ADP 8546, AUX 7363, ADJ 5857, ADV 5704, PART 3531, CCONJ 3198, SCONJ 2516, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)
The 10 most frequent ambiguous types: _ (NOUN 15635, VERB 12250, PRON 10618, DET 10057, PUNCT 9580, ADP 8546, AUX 7363, ADJ 5857, ADV 5704, PART 3531, CCONJ 3198, SCONJ 2516, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)
- _
- NOUN 15635: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 12250: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 10618: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 10057: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 9580: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADP 8546: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 7363: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADJ 5857: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADV 5704: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 3531: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CCONJ 3198: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 2516: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 1795: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 844: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- INTJ 80: _ _ _ _ _ _ _ _ _ _ _ _
- X 68: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SYM 39: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Morphology
The form / lemma ratio of NOUN
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “_”: _.
NOUN
occurs with 1 features: Foreign (1; 0% instances)
NOUN
occurs with 1 feature-value pairs: Foreign=Yes
NOUN
occurs with 2 feature combinations.
The most frequent feature combination is _
(15634 tokens).
Examples: _
Relations
NOUN
nodes are attached to their parents using 34 different relations: obl (3833; 25% instances), obj (3785; 24% instances), nsubj (2071; 13% instances), nmod (1998; 13% instances), conj (1035; 7% instances), compound (994; 6% instances), root (510; 3% instances), obl:tmod (265; 2% instances), nsubj:pass (197; 1% instances), advcl (146; 1% instances), obl:npmod (145; 1% instances), ccomp (122; 1% instances), nmod:poss (119; 1% instances), appos (116; 1% instances), parataxis (64; 0% instances), xcomp (58; 0% instances), acl:relcl (50; 0% instances), fixed (31; 0% instances), nmod:npmod (17; 0% instances), goeswith (16; 0% instances), iobj (14; 0% instances), acl (10; 0% instances), list (9; 0% instances), amod (7; 0% instances), csubj (4; 0% instances), csubj:pass (3; 0% instances), discourse (3; 0% instances), nummod (3; 0% instances), vocative (3; 0% instances), dislocated (2; 0% instances), mark (2; 0% instances), case (1; 0% instances), det (1; 0% instances), flat (1; 0% instances)
Parents of NOUN
nodes belong to 16 different parts of speech: VERB (9213; 59% instances), NOUN (4274; 27% instances), ADJ (1043; 7% instances), (510; 3% instances), ADV (152; 1% instances), PROPN (131; 1% instances), PRON (113; 1% instances), NUM (99; 1% instances), ADP (33; 0% instances), DET (29; 0% instances), X (13; 0% instances), AUX (10; 0% instances), SYM (8; 0% instances), SCONJ (5; 0% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances)
1916 (12%) NOUN
nodes are leaves.
4700 (30%) NOUN
nodes have one child.
5073 (32%) NOUN
nodes have two children.
3946 (25%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 18.
Children of NOUN
nodes are attached using 39 different relations: det (6962; 23% instances), case (6233; 20% instances), amod (3386; 11% instances), nmod (2329; 8% instances), nmod:poss (2072; 7% instances), punct (1593; 5% instances), conj (1102; 4% instances), compound (1049; 3% instances), cop (914; 3% instances), cc (855; 3% instances), acl:relcl (825; 3% instances), nsubj (787; 3% instances), advmod (537; 2% instances), nummod (463; 1% instances), acl (462; 1% instances), mark (225; 1% instances), det:predet (186; 1% instances), advcl (167; 1% instances), aux (150; 0% instances), appos (145; 0% instances), obl (120; 0% instances), parataxis (93; 0% instances), expl (58; 0% instances), csubj (52; 0% instances), goeswith (34; 0% instances), nmod:npmod (25; 0% instances), obj (16; 0% instances), obl:tmod (16; 0% instances), xcomp (15; 0% instances), cc:preconj (14; 0% instances), list (13; 0% instances), discourse (6; 0% instances), ccomp (5; 0% instances), reparandum (2; 0% instances), vocative (2; 0% instances), compound:prt (1; 0% instances), flat (1; 0% instances), iobj (1; 0% instances), orphan (1; 0% instances)
Children of NOUN
nodes belong to 17 different parts of speech: DET (9111; 29% instances), ADP (6025; 19% instances), NOUN (4274; 14% instances), ADJ (3545; 11% instances), VERB (1624; 5% instances), PUNCT (1592; 5% instances), AUX (1068; 3% instances), CCONJ (859; 3% instances), PRON (843; 3% instances), ADV (513; 2% instances), NUM (506; 2% instances), PROPN (465; 2% instances), PART (246; 1% instances), SCONJ (181; 1% instances), X (45; 0% instances), SYM (14; 0% instances), INTJ (6; 0% instances)