Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Discovering Minority Sub-clusters and Local Difficulty Factors from Imbalanced Data

Authors

[ 1 ] Instytut Informatyki, Wydział Informatyki, Politechnika Poznańska | [ P ] employee | [ S ] student

Scientific discipline (Law 2.0)

[2.3] Information and communication technology

Year of publication

2017

Chapter type

chapter in monograph / paper

Publication language

english

Keywords
EN
  • class imbalance
  • minority class categorization
  • data difficulty factors
  • class overlapping
  • minority sub-clusters
Abstract

EN Learning classifiers from imbalanced data is particularly challenging when class imbalance is accompanied by local data difficulty factors, such as outliers, rare cases, class overlapping, or minority class decomposition. Although these issues have been highlighted in previous research, there have been no proposals of algorithms that simultaneously detect all the aforementioned difficulties in a dataset. In this paper, we put forward two extensions to popular clustering algorithms, ImKmeans and ImScan, and one novel algorithm, ImGrid, that attempt to detect minority sub-clusters, outliers, rare cases, and class overlapping. Experiments with artificial datasets show that ImGrid, which uses a Bayesian test to join similar neighboring regions, is able to re-discover simulated clusters and types of minority examples on par with competing methods, while being the least sensitive to parameter tuning.

Date of online publication

16.09.2017

Pages (from - to)

324 - 339

DOI

10.1007/978-3-319-67786-6_23

URL

https://link.springer.com/chapter/10.1007/978-3-319-67786-6_23

Book

Discovery Science : 20th International Conference, DS 2017, Kyoto, Japan, October 15–17, 2017 : Proceedings

Presented on

20th International Conference on Discovery Science DS 2017, 15-17.10.2017, Kyoto, Japan

Ministry points / chapter

20

Ministry points / conference (CORE)

20

Publication indexed in

WoS (15)

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.