Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

An Algorithm for Selective Preprocessing of Multi-class Imbalanced Data

Authors

[ 1 ] Instytut Informatyki, Wydział Informatyki, Politechnika Poznańska | [ P ] employee

Scientific discipline (Law 2.0)

[2.3] Information and communication technology

Year of publication

2018

Chapter type

chapter in monograph / paper

Publication language

english

Abstract

EN In this paper we propose a new algorithm called SPIDER3 for selective preprocessing of multi-class imbalanced data sets. While it borrows selected ideas (i.e., combination of relabeling and local resampling) from its predecessor – SPIDER2, it introduces several important extensions. Unlike SPIDER2, it is able to handle directly multi-class problems. Moreover, it considers the relevance of specific decision classes to control the order of their processing. Finally, it uses information about relations between specific classes (modeled with misclassification costs) to better control the extent of changes introduced locally to preprocessed data. We performed a computational experiment on artificial 3-class data sets to evaluate and compare SPIDER3 to SPIDER2 with temporarily aggregated classes and the results confirmed advantages of the new algorithm.

Pages (from - to)

238 - 247

DOI

10.1007/978-3-319-59162-9_25

URL

https://link.springer.com/chapter/10.1007/978-3-319-59162-9_25

Book

Proceedings of the 10th International Conference on Computer Recognition Systems CORES 2017

Presented on

10th International Conference on Computer Recognition Systems CORES 2017, 22-24.05.2017, Polanica Zdrój, Poland

Ministry points / chapter

20

Publication indexed in

WoS (15)

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.