home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-ParisStories: POS Tags: X

There are 20 X lemmas (1%), 20 X types (1%) and 65 X tokens (0%). Out of 15 observed tags, the rank of X is: 11 in number of lemmas, 12 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: XXX, s~, d~, m~, pl~, _, b~, comple~, c~, c~…

The 10 most frequent X types: XXX, s~, d~, m~, pl~, b~, comple~, c~, c~…, dans

The 10 most frequent ambiguous lemmas: s~ (X 6, VERB 3), d~ (X 3, ADP 2, NOUN 1, VERB 1), _ (NOUN 15, VERB 2, ADV 1, PUNCT 1, X 1), dans (ADP 202, X 1)

The 10 most frequent ambiguous types: s~ (X 6, VERB 3), d~ (X 3, ADP 2, NOUN 1, VERB 1), dans (ADP 202, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.332572).

The 1st highest number of forms (1) was observed with the lemma “XXX”: XXX.

The 2nd highest number of forms (1) was observed with the lemma “_”: port.

The 3rd highest number of forms (1) was observed with the lemma “b~”: b~.

X occurs with 1 features: ExtPos (15; 23% instances)

X occurs with 2 feature-value pairs: ExtPos=PROPN, ExtPos=VERB

X occurs with 3 feature combinations. The most frequent feature combination is _ (50 tokens). Examples: XXX, s~, d~, m~, pl~, b~, comple~, c~, c~…, dans

Relations

X nodes are attached to their parents using 13 different relations: reparandum (13; 20% instances), dep (10; 15% instances), discourse (8; 12% instances), obj (7; 11% instances), dislocated (5; 8% instances), nmod (4; 6% instances), obl:arg (4; 6% instances), root (4; 6% instances), conj (3; 5% instances), vocative (3; 5% instances), obl:mod (2; 3% instances), ccomp (1; 2% instances), xcomp (1; 2% instances)

Parents of X nodes belong to 10 different parts of speech: VERB (36; 55% instances), NOUN (9; 14% instances), ADV (4; 6% instances), PRON (4; 6% instances), (4; 6% instances), ADP (2; 3% instances), AUX (2; 3% instances), X (2; 3% instances), ADJ (1; 2% instances), DET (1; 2% instances)

24 (37%) X nodes are leaves.

25 (38%) X nodes have one child.

6 (9%) X nodes have two children.

10 (15%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 16 different relations: punct (29; 37% instances), case (9; 12% instances), det (6; 8% instances), discourse (6; 8% instances), conj (5; 6% instances), reparandum (5; 6% instances), cc (4; 5% instances), nsubj (3; 4% instances), cop (2; 3% instances), mark (2; 3% instances), nmod (2; 3% instances), acl:relcl (1; 1% instances), aux:tense (1; 1% instances), ccomp (1; 1% instances), dislocated (1; 1% instances), nummod (1; 1% instances)

Children of X nodes belong to 13 different parts of speech: PUNCT (29; 37% instances), ADP (8; 10% instances), DET (6; 8% instances), INTJ (6; 8% instances), PRON (5; 6% instances), CCONJ (4; 5% instances), NOUN (4; 5% instances), VERB (4; 5% instances), ADV (3; 4% instances), AUX (3; 4% instances), SCONJ (3; 4% instances), X (2; 3% instances), NUM (1; 1% instances)