Please use this identifier to cite or link to this item:
Title: The Effective Redistribution for Imbalance Dataset : Relocating Safe-Level SMOTE with Minority Outcast Handling
Authors: Wacharasak Siriseriwan
Krung Sinapiromsaran
Authors: Wacharasak Siriseriwan
Krung Sinapiromsaran
Keywords: class imbalance problem;oversampling;SMOTE;Safe-level SMOTE;minority outcast handling
Issue Date: 2016
Publisher: Science Faculty of Chiang Mai University
Citation: Chiang Mai Journal of Science 43, 1 (Jan 2016), 234 - 246
Abstract: The redistribution of the target class by oversampling synthetic minority instances is one of the effective directions for class imbalance problem. Safe-level SMOTE generates synthetic minority instances around original instances while avoiding nearby majority ones. However, despite of this intention, it is still possible that some synthetic instances can be placed too close to nearby majority instances which possibly confuse some classifiers. Moreover, Safe-Level SMOTE technically avoids using minority outcast instances for generating synthetic instances. This generated dataset may lose some precious information of minority class. Our paper aims to remedy these two drawbacks of Safe-Level SMOTE by combining two processes. The first one is checking and moving these synthetic instances away from possibly surrounding majority instances. The second is handling minority outcast with 1-nearest neighbor model. The empirical results on UCI and PROMISE datasets show the improvements of F-measure, which is the performance measure used in the class imbalance problem, for various classifiers such as decision tree, naïve Bayes classifier, multilayer perceptron, support vector machine and K-nearest neighbor. The improvements are tested by Wilcoxon sign test to show its significance.
ISSN: 0125-2526
Appears in Collections:CMUL: Journal Articles

Files in This Item:
There are no files associated with this item.

Items in CMUIR are protected by copyright, with all rights reserved, unless otherwise indicated.