Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Prequential AUC for Classifier Evaluation and Drift Detection in Evolving Data Streams

Authors

[ 1 ] Instytut Informatyki, Wydział Informatyki, Politechnika Poznańska | [ P ] employee

Year of publication

2015

Chapter type

paper

Publication language

english

Keywords
EN
  • AUC
  • data stream
  • class imbalance
  • concept drift
Abstract

EN Detecting and adapting to concept drifts make learning data stream classifiers a difficult task. It becomes even more complex when the distribution of classes in the stream is imbalanced. Currently, proper assessment of classifiers for such data is still a challenge, as existing evaluation measures either do not take into account class imbalance or are unable to indicate class ratio changes in time. In this paper, we advocate the use of the area under the ROC curve (AUC) in imbalanced data stream settings and propose an efficient incremental algorithm that uses a sorted tree structure with a sliding window to compute AUC using constant time and memory. Additionally, we experimentally verify that this algorithm is capable of correctly evaluating classifiers on imbalanced streams and can be used as a basis for detecting changes in class definitions and imbalance ratio.

Pages (from - to)

87 - 101

DOI

10.1007/978-3-319-17876-9_6

URL

https://link.springer.com/chapter/10.1007/978-3-319-17876-9_6

Book

New Frontiers in Mining Complex Patterns : 3th International Workshop, NFMCP 2014, Held in Conjunction with ECML-PKDD 2014, Nancy, France, September 19, 2014 : Revised Selected Papers

Presented on

Third International Workshop, NFMCP 2014, Held in Conjunction with ECML-PKDD 2014, 19.09.2014, Nancy, France

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.