การใส่ค่าข้อมูลที่ขาดหายโดยวิธีเพื่อนบ้านใกล้ที่สุดเคตัวเพื่อการจำแนกประเภทในชุดข้อมูลอสมดุล

จินตนา ตาคำ

Please use this identifier to cite or link to this item: http://cmuir.cmu.ac.th/jspui/handle/6653943832/69257

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	ผู้ช่วยศาสตราจารย์ ดร.ชุมพล บุญคุ้มพรภัทร	-
dc.contributor.author	จินตนา ตาคำ	en_US
dc.date.accessioned	2020-08-03T07:38:25Z	-
dc.date.available	2020-08-03T07:38:25Z	-
dc.date.issued	2015-04	-
dc.identifier.uri	http://cmuir.cmu.ac.th/jspui/handle/6653943832/69257	-
dc.description.abstract	Class imbalance is a problem that aims to improve the accuracy of a minority class, while imputation is a process to replace missing values. Traditionally, class imbalance and imputation problems are considered independently. In addition, filled-in minority-class values that are substituted by traditional methods are not sufficient for imbalance datasets. In this paper, we provide a new parameter-free imputation to operate on imbalance datasets by estimating a random value between the mean of the missing value attribute and a value in this attribute of the closet record instance from the missing value record. Our proposed algorithm ignores mean of instances to avoid an over-fitting problem. Consequently, experimental results on imbalance datasets reveal that our imputation outperforms other techniques, when class imbalance measures are used.	en_US
dc.language.iso	th	en_US
dc.publisher	เชียงใหม่ : บัณฑิตวิทยาลัย มหาวิทยาลัยเชียงใหม่	en_US
dc.title	การใส่ค่าข้อมูลที่ขาดหายโดยวิธีเพื่อนบ้านใกล้ที่สุดเคตัวเพื่อการจำแนกประเภทในชุดข้อมูลอสมดุล	en_US
dc.title.alternative	k–Nearest Neighbour Imputation for Classification in Imbalance Datasets	en_US
dc.type	Thesis
thesis.degree	master	en_US
thesis.description.thaiAbstract	ชั้นข้อมูลอสมดุลเป็นปัญหาที่ต้องการปรับปรุงความถูกต้องของชั้นข้อมูลกลุ่มด้อย ในขณะที่การใส่ค่าคือกระบวนการเติมค่าที่ขาดหาย โดยทั่วไปชั้นข้อมูลอสมดุลและปัญหาการใส่ค่าคือปัญหาที่เป็นอิสระต่อกัน นอกเหนือจากนี้การใส่ค่าในชั้นข้อมูลด้อยโดยวิธีการแทนที่แบบเดิมยังไม่เพียงพอสำหรับชุดข้อมูลอสมดุล ในงานวิจัยนี้เราได้นำเสนอเทคนิคใหม่ในการใส่ค่าโดยไม่ต้องมีการกำหนดค่าพารามิเตอร์ ซึ่งวิธีการดำเนินการในชุดข้อมูลอสมดุลจะทำโดยการประมาณค่าจากการสุ่มระหว่างค่าเฉลี่ยของแอตทริบิวต์ที่เกิดการขาดหายและค่าของข้อมูลในแอตทริบิวต์ที่ใกล้ที่สุดจากค่าที่ขาดหายไป อัลกอริทึมที่นำเสนอไม่สนใจค่าเฉลี่ยของข้อมูลเพื่อหลีกเลี่ยงปัญหาการซับซ้อนกันของข้อมูล ดังนั้นผลการทดลองของชุดข้อมูลอสมดุลแสดงให้เห็นว่าเทคนิคการใส่ค่ามีประสิทธิภาพดีกว่าเทคนิคอื่นๆเมื่อถูกนำมาใช้กับชั้นข้อมูลอสมดุล	en_US
Appears in Collections:	SCIENCE: Theses

Files in This Item:

File	Description	Size	Format
Full.pdf		3.48 MB	Adobe PDF	View/Open Request a copy

Show simple item record