Statistics of NUM in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Chinese-GSD: POS Tags: `NUM`

There are 1255 NUM lemmas (6%), 1255 NUM types (6%) and 6660 NUM tokens (5%). Out of 16 observed tags, the rank of NUM is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent NUM lemmas: 一、兩、三、 1、第一、 3、 12、 5、 2、 8

The 10 most frequent NUM types: 一、兩、三、 1、第一、 3、 12、 5、 2、 8

The 10 most frequent ambiguous lemmas: 一 (NUM 1124, NOUN 1), 第一 (NUM 117, ADJ 1, PROPN 1), 多 (NUM 83, ADV 28, ADJ 16, PART 3), 雙 (NUM 35, NOUN 1), 很多 (NUM 33, ADJ 4), 單 (NUM 26, PART 2), 半 (NUM 24, PART 6), 數 (NUM 22, PART 15), 九 (NUM 16, PROPN 2), 眾多 (ADJ 8, NUM 8)

The 10 most frequent ambiguous types: 一 (NUM 1124, NOUN 1), 第一 (NUM 117, ADJ 1, PROPN 1), 多 (NUM 83, ADV 28, ADJ 16, PART 3), 雙 (NUM 35, NOUN 1), 很多 (NUM 33, ADJ 4), 單 (NUM 26, PART 2), 半 (NUM 24, PART 6), 數 (NUM 22, PART 15), 九 (NUM 16, PROPN 2), 眾多 (ADJ 8, NUM 8)

一
- NUM 1124: 其測試包含了美術治療法，認知行為治療和洞察療法，同時給行為分析提供了一個理論性的交流平台。
- NOUN 1: 這一修正案涉及公民權利和平等法律保護，最初提出是為了解決南北戰爭後昔日奴隸的相關問題。
第一
- NUM 117: 北京站是當時中國大陸規模最大、設備最先進的鐵路車站，也是第一個現代化大型鐵路客運站。
- ADJ 1: KKR 的資本募集主要局限於一小部分投資者，這其中就包括希爾曼（ Hillman ）家族和第一芝加哥銀行。
- PROPN 1: 而此前《第一財經日報》的報道稱中國的油價已經高於美國，在這些問題下人們開始疑問為什麼中國油價會如此之高。
多
- NUM 83: 而且學校的伙食和住宿條件也多年遭到在校生的詬病。
- ADV 28: 近年來，肯亞女子長距離田徑項目也開始嶄露頭角，而這些女運動員們也多為卡倫金人。
- ADJ 16: 後來卡通造型的桑德斯上校（由演員 Randy Quaid 配音），出現在越來越多的肯德基廣告中。
- PART 3: 研討會和講座可以享用多媒體演示、視頻會議和同時由數個不同地點的通訊的設備的支持。
雙
- NUM 35: 實際上的雙筒望遠鏡當然多少有些誤差。
- NOUN 1: 而復寫眼持有者因為這雙特殊的眼睛可以直接跳過這一步，在其後的學習中也要比一般修習者快上數倍。
很多
- NUM 33: 由於這次失事原因涉及很多敏感的爭議性，因此最後仍未有一個具體及統一的事故調查報告。
- ADJ 4: 在很多城市，羅素被扣上異端的帽子，隨之而來的批評家數量也是直線上升。
單
- NUM 26: 一般來說，同一款間格的單邊單位比非單邊的呎價約貴 20% 。
- PART 2: 各地的分部辦事處之前會準備好足夠的邀請單，以便在發放時給住戶。
半
- NUM 24: 半腰座椅，亦稱半腰位、半截座椅，鐵路車輛座位的一種。
- PART 6: 她的姥姥講俄語，並且是半俄國血統半威爾士血統。
數
- NUM 22: 研討會和講座可以享用多媒體演示、視頻會議和同時由數個不同地點的通訊的設備的支持。
- PART 15: 與靜態酒不同，較高糖份的葡萄並不是氣泡酒的上選原料，所以葡萄植株的掛果數也會比較多。
九
- NUM 16: 1920 年（大正九年） 8 月，澤田俊郎歸國到仙台結婚，後偕新婚妻子再度回到滿洲。
- PROPN 2: 2000 年代九廣鐵路計劃興建落馬洲支線經過該地，引起了對該處生態影響的關注。
眾多
- ADJ 8: 在眾多書迷的推動下，惡靈系列在 2010 年 11 月 19 日再度以 Ghost Hunt 之名重新出版。
- NUM 8: 酒店成為眾多社會名流，富商聚集的場所。

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.004819).

The 1st highest number of forms (1) was observed with the lemma “-15”: -15.

The 2nd highest number of forms (1) was observed with the lemma “-154”: -154.

The 3rd highest number of forms (1) was observed with the lemma “-300”: -300.

NUM occurs with 1 features: NumType (6659; 100% instances)

NUM occurs with 2 feature-value pairs: NumType=Card, NumType=Ord

NUM occurs with 3 feature combinations. The most frequent feature combination is NumType=Card (6258 tokens). Examples: 一、兩、三、 1、 3、 12、 5、 2、 8、 10

Relations

NUM nodes are attached to their parents using 17 different relations: nummod (6237; 94% instances), obj (61; 1% instances), conj (58; 1% instances), obl (58; 1% instances), root (53; 1% instances), nmod (51; 1% instances), parataxis (44; 1% instances), nsubj (30; 0% instances), acl (12; 0% instances), appos (12; 0% instances), nmod:tmod (12; 0% instances), compound (10; 0% instances), advcl (6; 0% instances), amod (6; 0% instances), ccomp (5; 0% instances), xcomp (4; 0% instances), nsubj:pass (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (6201; 93% instances), VERB (171; 3% instances), PART (96; 1% instances), NUM (72; 1% instances), (53; 1% instances), PROPN (25; 0% instances), X (22; 0% instances), ADJ (16; 0% instances), DET (3; 0% instances), SYM (1; 0% instances)

4241 (64%) NUM nodes are leaves.

2239 (34%) NUM nodes have one child.

62 (1%) NUM nodes have two children.

118 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 25 different relations: clf (2033; 72% instances), punct (183; 6% instances), nmod (143; 5% instances), nsubj (97; 3% instances), cop (93; 3% instances), case (59; 2% instances), conj (57; 2% instances), cc (44; 2% instances), advmod (34; 1% instances), acl (22; 1% instances), det (15; 1% instances), parataxis (12; 0% instances), nummod (10; 0% instances), appos (8; 0% instances), nmod:tmod (7; 0% instances), obl (5; 0% instances), csubj (4; 0% instances), flat:foreign (4; 0% instances), mark (2; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances), obj (1; 0% instances), xcomp (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: NOUN (2234; 79% instances), PUNCT (183; 6% instances), AUX (93; 3% instances), PART (77; 3% instances), NUM (72; 3% instances), CCONJ (44; 2% instances), ADV (34; 1% instances), VERB (24; 1% instances), ADP (17; 1% instances), DET (15; 1% instances), PROPN (13; 0% instances), PRON (12; 0% instances), X (10; 0% instances), SYM (5; 0% instances), ADJ (3; 0% instances), SCONJ (2; 0% instances)

Treebank Statistics: UD_Chinese-GSD: POS Tags: NUM

Morphology

Relations

Treebank Statistics: UD_Chinese-GSD: POS Tags: `NUM`