What makes multi-class imbalanced problems difficult? An experimental study
[ 1 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ P ] employee
2022
scientific article
english
- Imbalanced data
- Classification
- Learning from multiple classes
- Data difficulty factors
EN Multi-class imbalanced classification is more difficult and less frequently studied than its binary counterpart. Moreover, research on the causes of the difficulty of multi-class imbalanced data is quite limited and insufficient. Therefore, we experimentally study the impact of various multi-class imbalanced difficulty factors on the performance of three popular classifiers. The results demonstrated a strong influence of the class overlapping with the extent of its impact related to the types of overlapped classes. In particular, overlapping between minority and majority classes was more difficult than the others. The type of the class size configuration turned out to be another important factor, highlighting the special role of the configurations with classes of intermediate sizes. The obtained results could support studying the nature of the multi-class imbalanced data as well as the development of new methods for improving classifiers.
02.04.2022
116962-1 - 116962-13
Article Number: 116962
140
8,5