Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Managing Borderline and Noisy Examples in Imbalanced Classification by Combining SMOTE with Ensemble Filtering

Authors

[ 1 ] Instytut Informatyki, Wydział Informatyki, Politechnika Poznańska | [ P ] employee

Year of publication

2014

Chapter type

paper

Publication language

english

Keywords
EN
  • classification
  • imbalanced data
  • SMOTE
  • class noise
  • noise filters
Abstract

EN Imbalance data constitutes a great difficulty for most algorithms learning classifiers. However, as recent works claim, class imbalance is not a problem in itself and performance degradation is also associated with other factors related to the distribution of the data as the presence of noisy and borderline examples in the areas surrounding class boundaries. This contribution proposes to extend SMOTE with a noise filter called Iterative-Partitioning Filter (IPF), which can overcome these problems. The properties of this proposal are discussed in a controlled experimental study against SMOTE and its most well-known generalizations. The results show that the new proposal performs better than exiting SMOTE generalizations for all these different scenarios.

Pages (from - to)

61 - 68

DOI

10.1007/978-3-319-10840-7_8

URL

https://link.springer.com/chapter/10.1007/978-3-319-10840-7_8

Book

Intelligent Data Engineering and Automated Learning - IDEAL 2014 : 15th International Conference, Salamanca, Spain, September 10-12, 2014 : proceedings

Presented on

15th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2014, 10-12.09.2014, Salamanca, Spain

Publication indexed in

WoS (15)

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.