Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Vietnamese-VTB: POS Tags: `NOUN`

There are 2694 NOUN lemmas (36%), 2697 NOUN types (36%) and 16489 NOUN tokens (28%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: người, ông, anh, nhà, khi, bà, con, ngày, năm, cô

The 10 most frequent NOUN types: người, ông, anh, nhà, khi, bà, con, ngày, năm, cô

The 10 most frequent ambiguous lemmas: ông (NOUN 395, PRON 12, PROPN 1), anh (NOUN 271, PRON 1), bà (NOUN 227, PRON 1), con (NOUN 202, ADJ 3), ngày (NOUN 156, ADV 1), năm (NOUN 151, NUM 19), việc (NOUN 105, VERB 1), sau (NOUN 99, ADP 59, ADJ 27), chị (NOUN 91, PROPN 1), lần (NOUN 91, VERB 8)

The 10 most frequent ambiguous types: ông (NOUN 292, PRON 12, PROPN 1), anh (NOUN 226, PRON 1), bà (NOUN 174, PRON 1), con (NOUN 191, ADJ 3), ngày (NOUN 143, ADV 1), năm (NOUN 137, NUM 17), việc (NOUN 94, VERB 1), sau (NOUN 78, ADP 31, ADJ 27), lần (NOUN 76, VERB 5), cái (NOUN 79, PART 4)

ông
- NOUN 292: Thưa ông , sổ đỏ đã được giao cho người dân .
- PRON 12: ” Tại sao bà ta lại thích nhà của ông ? “ .
- PROPN 1: Một cựu chiến binh như ông , một người cha như ông có bốn đứa con ở huyện An Lão ( Hải Phòng ) .
anh
- NOUN 226: Nghe báo cáo có các anh Ba Bường , Ba Hương , Mười Thơ , Vũ Đình Liệu .
- PRON 1: Cường hỏi : “ thâm tâm anh có nghi ngờ vợ anh không ? “ .
bà
- NOUN 174: Nhưng chỉ đến lúc cháu gái đủ tuổi đi học , bà Năm mới thấm thía .
- PRON 1: Nguyễn văn nhã , anh con trai lớn của bà , đi thanh niên xung phong vài tháng thì bị trả về vì có biểu hiện bất bình thường .
con
- NOUN 191: ” bắt kiểu này ngày được bao nhiêu con ? “ - tôi hỏi .
- ADJ 3: Tôi nghe câu chuyện anh Thanh cho anh Đu mượn chú heo con .
ngày
- NOUN 143: ” bắt kiểu này ngày được bao nhiêu con ? “ - tôi hỏi .
- ADV 1: Đến ngày 10 - 8 , đội lặn tiếp tục khảo sát và phát hiện lượng nước rò rỉ ngày nhiều hơn .
năm
- NOUN 137: Bốn năm sau , con gái bà cũng chết vì AIDS , để lại hai đứa trẻ mồ côi .
- NUM 17: Ba , năm rồi 60 em …
việc
- NOUN 94: Gia đình đã khác mọi mặt , ngoại trừ việc vẫn nghèo .
- VERB 1: Cũng theo ông Thành , làm Bách khoa toàn thư là một quá trình , không giống như việc tạo ra sản phẩm đóng gói là xong nhiệm vụ .
sau
- NOUN 78: Bốn năm sau , con gái bà cũng chết vì AIDS , để lại hai đứa trẻ mồ côi .
- ADP 31: Nỗi đau sau cuộc chiến .
- ADJ 27: Như dự đoán của Thanh , mấy hôm sau Hùng gặp bà Loan thưa chuyện .
lần
- NOUN 76: Tức là lần đó tôi không gặp được anh Hai Địa .
- VERB 5: Tôi cũng ngậm ống hơi , đeo băng chì rồi lần dây mồi xuống theo .
cái
- NOUN 79: Còn chỗ này , cái mũi nhọn này , Thanh nhìn rõ không , là Mũi Đèn .
- PART 4: Nhưng khổ cái vẫn câm điếc không biết gì .

Morphology

The form / lemma ratio of NOUN is 1.001114 (the average of all parts of speech is 1.001997).

The 1st highest number of forms (2) was observed with the lemma “quân y”: Quân Y, Quân y.

The 2nd highest number of forms (2) was observed with the lemma “tp . hcm”: TP . HCM, Tp . hcm.

The 3rd highest number of forms (2) was observed with the lemma “vks”: VKS, Vks.

NOUN does not occur with any features.

Relations

NOUN nodes are attached to their parents using 56 different relations: obj (3945; 24% instances), nsubj (2622; 16% instances), nmod (1975; 12% instances), compound (1193; 7% instances), obl:tmod (982; 6% instances), obl (939; 6% instances), clf:det (882; 5% instances), obl:comp (721; 4% instances), conj (686; 4% instances), nmod:poss (399; 2% instances), root (322; 2% instances), clf (249; 2% instances), nsubj:pass (194; 1% instances), obl:with (140; 1% instances), appos (131; 1% instances), obl:adj (126; 1% instances), appos:nmod (119; 1% instances), obl:iobj (97; 1% instances), advcl (85; 1% instances), nsubj:nn (84; 1% instances), obl:agent (72; 0% instances), obl:about (66; 0% instances), ccomp (60; 0% instances), compound:verbnoun (53; 0% instances), parataxis (53; 0% instances), case (35; 0% instances), dislocated (33; 0% instances), list (29; 0% instances), iobj (23; 0% instances), acl:subj (21; 0% instances), flat:number (16; 0% instances), acl (14; 0% instances), amod (12; 0% instances), flat:time (12; 0% instances), acl:tmod (11; 0% instances), flat (11; 0% instances), compound:vmod (10; 0% instances), csubj (10; 0% instances), fixed (6; 0% instances), nsubj:xsubj (6; 0% instances), csubj:pass (5; 0% instances), nummod (5; 0% instances), vocative (5; 0% instances), compound:pron (4; 0% instances), compound:z (4; 0% instances), dep (4; 0% instances), flat:name (4; 0% instances), obl:adv (3; 0% instances), xcomp (3; 0% instances), acl:tonp (2; 0% instances), advcl:objective (1; 0% instances), compound:amod (1; 0% instances), compound:dir (1; 0% instances), compound:svc (1; 0% instances), discourse (1; 0% instances), flat:date (1; 0% instances)

Parents of NOUN nodes belong to 14 different parts of speech: VERB (9098; 55% instances), NOUN (5342; 32% instances), ADJ (784; 5% instances), PROPN (554; 3% instances), (322; 2% instances), NUM (180; 1% instances), DET (75; 0% instances), ADP (55; 0% instances), PRON (40; 0% instances), ADV (16; 0% instances), X (10; 0% instances), AUX (8; 0% instances), SYM (4; 0% instances), PART (1; 0% instances)

6066 (37%) NOUN nodes are leaves.

5034 (31%) NOUN nodes have one child.

3087 (19%) NOUN nodes have two children.

2302 (14%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 15.

Children of NOUN nodes are attached using 70 different relations: case (2479; 13% instances), nmod (2407; 12% instances), compound (1821; 9% instances), punct (1726; 9% instances), nummod (1373; 7% instances), acl:subj (1263; 6% instances), amod (1060; 5% instances), det (1017; 5% instances), conj (713; 4% instances), compound:vmod (703; 4% instances), det:pmod (672; 3% instances), nmod:poss (560; 3% instances), clf:det (486; 2% instances), cop (343; 2% instances), cc (290; 1% instances), advmod (260; 1% instances), acl (233; 1% instances), acl:tmod (232; 1% instances), nsubj (202; 1% instances), obl (194; 1% instances), advmod:adj (185; 1% instances), appos:nmod (129; 1% instances), discourse (126; 1% instances), flat:date (118; 1% instances), appos (114; 1% instances), nsubj:nn (104; 1% instances), acl:tonp (97; 0% instances), compound:amod (96; 0% instances), obl:tmod (80; 0% instances), mark (77; 0% instances), nummod:det (70; 0% instances), clf (45; 0% instances), compound:pron (44; 0% instances), parataxis (43; 0% instances), obl:comp (42; 0% instances), advcl (38; 0% instances), list (37; 0% instances), advmod:neg (32; 0% instances), advcl:objective (28; 0% instances), ccomp (25; 0% instances), xcomp (22; 0% instances), acl:relcl (21; 0% instances), obl:about (19; 0% instances), obl:with (19; 0% instances), csubj (15; 0% instances), aux (13; 0% instances), obj (13; 0% instances), aux:pass (12; 0% instances), compound:z (12; 0% instances), csubj:asubj (12; 0% instances), csubj:vsubj (11; 0% instances), flat (9; 0% instances), flat:time (9; 0% instances), fixed (5; 0% instances), flat:name (5; 0% instances), nsubj:pass (5; 0% instances), dep (4; 0% instances), obl:adj (3; 0% instances), obl:iobj (3; 0% instances), vocative (3; 0% instances), compound:dir (2; 0% instances), compound:svc (2; 0% instances), compound:verbnoun (2; 0% instances), dislocated (2; 0% instances), mark:pcomp (2; 0% instances), flat:number (1; 0% instances), nsubj:xsubj (1; 0% instances), obl:adv (1; 0% instances), xcomp:dir (1; 0% instances), xcomp:vcomp (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: NOUN (5342; 27% instances), VERB (2627; 13% instances), ADP (2350; 12% instances), PUNCT (1726; 9% instances), NUM (1646; 8% instances), ADJ (1598; 8% instances), PROPN (1175; 6% instances), PRON (1073; 5% instances), DET (935; 5% instances), AUX (376; 2% instances), ADV (303; 2% instances), CCONJ (244; 1% instances), SCONJ (224; 1% instances), PART (105; 1% instances), SYM (36; 0% instances), X (31; 0% instances), INTJ (3; 0% instances)

Treebank Statistics: UD_Vietnamese-VTB: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Vietnamese-VTB: POS Tags: `NOUN`