Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

The Usefulness of Roughly Balanced Bagging for Complex and High-Dimensional Imbalanced Data

Authors

[ 1 ] Instytut Informatyki, Wydział Informatyki, Politechnika Poznańska | [ P ] employee

Year of publication

2016

Chapter type

paper

Publication language

english

Keywords
EN
  • class imbalance
  • roughly balanced bagging
  • types of minority examples
  • high-dimensional data
  • random subspace method
Abstract

EN Under-sampling generalizations of bagging ensembles improve classification of imbalanced data better than other ensembles. Roughly Balanced Bagging is the most accurate among them. In this paper, we experimentally study its properties that may influence its good performance. Results of experiments show that it can be constructed with a small number of component classifiers. However, they are less diversified than components of the standard bagging. Moreover, its good performance comes from its ability to recognize unsafe types of minority examples better than other ensembles. We also present how to improve its performance by integrating bootstrap sampling with the random selection of attributes.

Pages (from - to)

93 - 107

DOI

10.1007/978-3-319-39315-5_7

URL

https://link.springer.com/chapter/10.1007/978-3-319-39315-5_7

Book

New Frontiers in Mining Complex Patterns : 4th International Workshop, NFMCP 2015, Held in Conjunction with ECML-PKDD 2015, Porto, Portugal, September 7, 2015 : Revised Selected Papers

Presented on

4th International Workshop on New Frontiers in Mining Complex Patterns, NFMCP 2015, 7.09.2015, Porto, Portugal

Publication indexed in

WoS (15)

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.