Treebank Statistics: UD_French-GSD: POS Tags: PROPN
There are 15365 PROPN
lemmas (43%), 15367 PROPN
types (33%) and 27718 PROPN
tokens (7%).
Out of 16 observed tags, the rank of PROPN
is: 1 in number of lemmas, 1 in number of types and 6 in number of tokens.
The 10 most frequent PROPN
lemmas: France, Paris, Europe, États-Unis, de, Jean, Maroc, Espagne, New, York
The 10 most frequent PROPN
types: France, Paris, Europe, États-Unis, de, Jean, Maroc, Espagne, New, York
The 10 most frequent ambiguous lemmas: Europe (PROPN 126, X 2), de (ADP 31306, PROPN 113, X 32), New (PROPN 63, X 5), York (PROPN 62, X 3), Canada (PROPN 54, X 1), San (PROPN 46, X 1), saint (NOUN 16, ADJ 10, PROPN 10), John (PROPN 45, X 1), The (X 53, PROPN 32), le (DET 43300, PROPN 12, X 2)
The 10 most frequent ambiguous types: Europe (PROPN 126, X 2), de (ADP 26315, DET 390, PROPN 113, X 32, ADV 1), New (PROPN 63, X 5), York (PROPN 62, X 3), Canada (PROPN 54, X 1), San (PROPN 46, X 1), saint (PROPN 10, NOUN 8, ADJ 4), John (PROPN 45, X 1), The (X 53, PROPN 32), le (DET 13757, PRON 281, PROPN 12, X 3)
- Europe
- de
- ADP 26315: Aviator , un film sur la vie de Hughes .
- DET 390: Pas de sèche cheveux ni de prise rasoir dans la salle de bains .
- PROPN 113: Une bonne raison pour faire un détour à saint Jean de Maurienne .
- X 32: Il a fait partie de groupes musicaux comme Los Abuelos de la Nada et Los Rodríguez .
- ADV 1: Ayant renié sa foi pour s’ adonner à les sciences , notamment l’ alchimie et la médecine , il espère découvrir un remède qui soignera Calixte atteinte d’ une maladie qu’ aucun savant de parvient à guérir .
- New
- York
- Canada
- San
- saint
- John
- The
- le
- DET 13757: C’ est véritablement pour le futur baptisé un changement de cap .
- PRON 281: Il a fini par le faire .
- PROPN 12: Les Drangiens furent soumis par les Mèdes , puis par Cyrus le Grand .
- X 3: Dès le Ve siècle , à Chantelle , sur les bords de la Bouble , existaient un château fort et une église dédiée à saint Vincent , dont s’ empare Pépin le Bref à le VIIIe siècle .
Morphology
The form / lemma ratio of PROPN
is 1.000130 (the average of all parts of speech is 1.308785).
The 1st highest number of forms (2) was observed with the lemma “Côte”: COTE, Côte.
The 2nd highest number of forms (2) was observed with the lemma “Gaule”: Gaule, Gaulle.
The 3rd highest number of forms (2) was observed with the lemma “Ivoire”: IVOIRE, Ivoire.
PROPN
occurs with 5 features: Number (4988; 18% instances), Gender (3216; 12% instances), ExtPos (23; 0% instances), Typo (13; 0% instances), Foreign (1; 0% instances)
PROPN
occurs with 8 feature-value pairs: ExtPos=ADJ
, ExtPos=PROPN
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
, Typo=Yes
PROPN
occurs with 18 feature combinations.
The most frequent feature combination is _
(22706 tokens).
Examples: France, Paris, de, Jean, Europe, York, New, Pierre, Charles, Louis
Relations
PROPN
nodes are attached to their parents using 24 different relations: nmod (7717; 28% instances), flat:name (6323; 23% instances), appos (3317; 12% instances), nsubj (3067; 11% instances), conj (2437; 9% instances), obl:mod (1621; 6% instances), obl:arg (1321; 5% instances), obj (619; 2% instances), obl:agent (509; 2% instances), nsubj:pass (275; 1% instances), xcomp (189; 1% instances), root (157; 1% instances), acl:relcl (40; 0% instances), nsubj:caus (25; 0% instances), parataxis (24; 0% instances), advcl (23; 0% instances), orphan (22; 0% instances), ccomp (7; 0% instances), dislocated (7; 0% instances), obl (6; 0% instances), obj:agent (5; 0% instances), vocative (5; 0% instances), acl (1; 0% instances), advcl:cleft (1; 0% instances)
Parents of PROPN
nodes belong to 13 different parts of speech: NOUN (10932; 39% instances), PROPN (9620; 35% instances), VERB (6296; 23% instances), ADJ (278; 1% instances), (157; 1% instances), PRON (147; 1% instances), ADV (96; 0% instances), X (86; 0% instances), NUM (66; 0% instances), ADP (21; 0% instances), SYM (12; 0% instances), DET (4; 0% instances), INTJ (3; 0% instances)
9055 (33%) PROPN
nodes are leaves.
8136 (29%) PROPN
nodes have one child.
6428 (23%) PROPN
nodes have two children.
4099 (15%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 34.
Children of PROPN
nodes are attached using 29 different relations: case (11529; 32% instances), flat:name (6621; 18% instances), det (4815; 13% instances), punct (4452; 12% instances), conj (2498; 7% instances), nmod (2092; 6% instances), cc (1360; 4% instances), appos (1047; 3% instances), amod (424; 1% instances), acl (402; 1% instances), acl:relcl (376; 1% instances), advmod (202; 1% instances), cop (130; 0% instances), nsubj (100; 0% instances), nummod (39; 0% instances), orphan (31; 0% instances), advcl:cleft (26; 0% instances), expl:subj (26; 0% instances), mark (24; 0% instances), parataxis (23; 0% instances), obl:mod (21; 0% instances), advcl (3; 0% instances), dep (3; 0% instances), dislocated (2; 0% instances), aux:tense (1; 0% instances), discourse (1; 0% instances), nsubj:outer (1; 0% instances), obl:arg (1; 0% instances), parataxis:insert (1; 0% instances)
Children of PROPN
nodes belong to 16 different parts of speech: ADP (11499; 32% instances), PROPN (9620; 27% instances), DET (4815; 13% instances), PUNCT (4452; 12% instances), NOUN (1624; 4% instances), CCONJ (1323; 4% instances), VERB (795; 2% instances), NUM (640; 2% instances), X (466; 1% instances), ADJ (454; 1% instances), ADV (183; 1% instances), PRON (134; 0% instances), AUX (132; 0% instances), SYM (59; 0% instances), SCONJ (54; 0% instances), INTJ (1; 0% instances)