Please use this identifier to cite or link to this item: http://cmuir.cmu.ac.th/jspui/handle/6653943832/79399
Title: การเปรียบเทียบเทคนิคการเพิ่มประสิทธิภาพของการประมวลผลแมพรีดิวซ์สำหรับข้อมูลที่มีความเบ้
Other Titles: Comparision of Efficiency Improvement Techniques for MapReduce with Skewed Data
Authors: นักปราชญ์ กันตีวงศ์
Authors: พฤษภ์ บุญมา
นักปราชญ์ กันตีวงศ์
Issue Date: 11-Nov-2566
Publisher: เชียงใหม่ : บัณฑิตวิทยาลัย มหาวิทยาลัยเชียงใหม่
Abstract: Nowadays, data is large in terms of volume, variety of format and changes rapidly. Therefore, it is necessary to rely on processing to make data accuracy and precision. To ensure real-time utilization and maximum benefit. MapReduce is a framework for processing big data, including 2 functions: Map function and Reduce function. Map function processes input data set as key/value pairs and generates intermediate key/value pairs with the same key as output. Reduce function merges all intermediate values in each set. In this process, the reducer must wait for all maps to finish before the reducer starts. Due to non-uniform distribution of big data. MapReduce process will be delayed because data that split to all nodes is not same size in map stages. As a result, each node complete processing at different times. Nodes that finish earlier must wait for the final mapper node to complete before they can proceed with reducing. The results show that all of the algorithms studied in this paper can improve the execution time of MapReduce with skewed data. However, there are some limitations to improvement, especially when data is not heavily skewed; the overhead of the algorithms might overcome their benefits.
URI: http://cmuir.cmu.ac.th/jspui/handle/6653943832/79399
Appears in Collections:ENG: Theses

Files in This Item:
File Description SizeFormat 
620631056 nakprad kanteewong.pdf9.51 MBAdobe PDFView/Open    Request a copy


Items in CMUIR are protected by copyright, with all rights reserved, unless otherwise indicated.