Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Dominance-based rough set approach to knowledge discovery (I) : general perspective

Authors

[ 1 ] Instytut Informatyki (II), Wydział Informatyki i Zarządzania, Politechnika Poznańska | [ P ] employee

Year of publication

2004

Chapter type

chapter in monograph

Publication language

english

Abstract

EN This chapter is devoted to knowledge discovery from data, taking into account prior knowledge about preference semantics in patterns to be discovered. The data concern a set of objects (situations, states, examples) described by a set of attributes (properties, features, characteristics). The attributes are, in general, divided into condition and decision attributes, corresponding to input and output descriptions of an object. The set of objects is partitioned by decision attributes into decision classes. A pattern discovered from the data has a symbolic form of decision rule or decision tree. In many practical problems, some condition attributes are defined on preference ordered scales, and the decision classes are also preference ordered. The known methods of knowledge discovery unfortunately ignore this preference information, risking drawing wrong patterns. To deal with preference-ordered data, we propose to use a new approach called Dominance-based Rough Set Approach (DRSA). Given a set of objects described by at least one condition attribute with preference-ordered scale and partitioned into preference-ordered classes, the new rough set approach is able to approximate this partition by means of dominance relations. The rough approximation of this partition is a starting point for induction of “if..., then...” decision rules. The syntax of these rules is adapted to represent preference orders. The DRSA analyzes only facts present in data, and possible inconsistencies are identified. It preserves the concept of granular computing; however, the granules are dominance cones in evaluation space, and not bounded sets. It is also concordant with the paradigm of computing with words, as it exploits the ordinal, and not necessarily the cardinal, character of data. The basic DRSA and its major extensions are presented in two consecutive parts in this book. In the present part, we give a general perspective of DRSA, explaining its use in the context of multicriteria classification, choice, and ranking. Moreover, we present a variant of DRSA that handles missing values in data sets.

Pages (from - to)

513 - 552

DOI

10.1007/978-3-662-07952-2_20

URL

https://link.springer.com/chapter/10.1007/978-3-662-07952-2_20

Book

Intelligent Technologies for Information Analysis

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.