Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Article

Download file Download BibTeX

Title

Explaining data changes with prototypes: A measure-driven approach

Authors

[ 1 ] Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ 2 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ SzD ] doctoral school student | [ P ] employee

Scientific discipline (Law 2.0)

[2.3] Information and communication technology

Year of publication

2026

Published in

Information Fusion

Journal year: 2026 | Journal volume: vol. 126, part A

Article type

scientific article

Publication language

english

Keywords
EN
  • Explainable Artificial Intelligence
  • Prototype explanation
  • Data shift
  • Evolving data stream
Abstract

EN Prototype explanations of machine learning models have been considered solely for static data, while their use for concept drifting data still remains underexplored. In this work, this challenge is addressed using the algorithm that explains the predictions of the Random Forest tree ensemble classifier with a limited number of prototypes. This also involves the proposal of new measures to evaluate prototypes in static and evolving settings, enabling comparison of prototype sets before and after the data change and the construction of new drift detectors. The presented proposals are evaluated through many experiments. In the first experiments with synthetic datasets, the new measures – mean minimal distance, mean centroid displacement, and prototype reassignment impact – proved effective when evaluated using a set of diverse synthetic data generators and real-world data streams. Then, for incremental learning, the RACE-P algorithm is introduced, leveraging prototypes for interpretable drift detection. Experiments demonstrate competitive performance against established detection methods such as ADWIN and Page–Hinkley. Additionally, the use of prototypes to analyse and explain detected drifts is discussed, underscoring their potential to enhance understanding of data evolution.

Date of online publication

09.08.2025

Pages (from - to)

103602-1 - 103602-13

DOI

10.1016/j.inffus.2025.103602

URL

https://www.sciencedirect.com/science/article/pii/S1566253525006748

Comments

Article Number: 103602

License type

CC BY (attribution alone)

Open Access Mode

czasopismo hybrydowe

Open Access Text Version

final published version

Date of Open Access to the publication

in press

Full text of article

Download file

Access level to full text

public

Ministry points / journal

200

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.