Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Article

Download BibTeX

Title

Granular approximations: A novel statistical learning approach for handling data inconsistency with respect to a fuzzy relation

Authors

[ 1 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ P ] employee

Scientific discipline (Law 2.0)

[2.3] Information and communication technology

Year of publication

2023

Published in

Information Sciences

Journal year: 2023 | Journal volume: vol. 629

Article type

scientific article

Publication language

english

Keywords
EN
  • Inconsistencies in data
  • Fuzzy logic
  • Statistical learning
  • Rough sets
Abstract

EN Inconsistency in classification and regression problems occurs when instances that relate in a certain way on the condition attributes, do not follow the same relation on the decision attribute. It typically appears as a result of perturbation in data caused by incomplete knowledge (missing attributes) or by random effects that occur during data generation (instability in the assessment of decision attribute values). Inconsistencies with respect to a crisp preorder relation (expressing either dominance or indiscernibility between instances) can be handled with set-theoretic approaches like rough sets and by using statistical/machine learning approaches that involve optimization methods. In particular, the Kotłowski-Słowiński (KS) approach relabels the objects from a dataset such that inconsistencies are removed, and such that the new class labels are as close as possible to the original ones in terms of a given loss function. In this paper, we generalize the KS approach to handle inconsistency determined by a fuzzy preorder relation rather than a crisp one. The method produces a consistent fuzzy relabeling of the instances and may be used as a preprocessing tool with algorithms for binary classification and regression. As the obtained fuzzy sets can be represented as unions of meaningful simple fuzzy sets or granules, we call them granular approximations. We provide statistical foundations for our method, develop appropriate optimization procedures, provide didactic examples, and prove several important properties.

Date of online publication

01.02.2023

Pages (from - to)

249 - 275

DOI

10.1016/j.ins.2023.01.119

URL

https://www.sciencedirect.com/science/article/pii/S0020025523001329?via%3Dihub

Ministry points / journal

200

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.