A KNN Undersampling Approach for Data Balancing

Beckmann, Marcelo and Ebecken, Nelson F. F. and Pires de Lima, Beatriz S. L. (2015) A KNN Undersampling Approach for Data Balancing. Journal of Intelligent Learning Systems and Applications, 07 (04). pp. 104-116. ISSN 2150-8402

[thumbnail of JILSA_2015111114204642.pdf] Text
JILSA_2015111114204642.pdf - Published Version

Download (721kB)

Abstract

In supervised learning, the imbalanced number of instances among the classes in a dataset can make the algorithms to classify one instance from the minority class as one from the majority class. With the aim to solve this problem, the KNN algorithm provides a basis to other balancing methods. These balancing methods are revisited in this work, and a new and simple approach of KNN undersampling is proposed. The experiments demonstrated that the KNN undersampling method outperformed other sampling methods. The proposed method also outperformed the results of other studies, and indicates that the simplicity of KNN can be used as a base for efficient algorithms in machine learning and knowledge discovery.

Item Type: Article
Subjects: OA Library Press > Medical Science
Depositing User: Unnamed user with email support@oalibrarypress.com
Date Deposited: 24 Jan 2023 06:56
Last Modified: 01 Jul 2024 09:11
URI: http://archive.submissionwrite.com/id/eprint/172

Actions (login required)

View Item
View Item