Incremental data mining using concurrent online refresh of materialized data mining views

Mikołaj Morzy; Tadeusz Morzy; Marek Wojciechowski; Maciej Zakrzewicz

doi:10.1007/11546849_29

System Informacji Naukowej Politechniki Poznańskiej

PL EN

Strona główna / Publikacje / Incremental data mining using concurrent online refresh of materialized data mining views

Zgłoś uwagę

Rozdział

Pobierz BibTeX

Tytuł

Incremental data mining using concurrent online refresh of materialized data mining views

Autorzy

Mikołaj Morzy ^{[ 1 ][ P ]}
Tadeusz Morzy ^{[ 1 ][ P ]}
Marek Wojciechowski ^{[ 1 ][ P ]}
Maciej Zakrzewicz ^{[ 1 ][ P ]}

^{[ 1 ]} Instytut Informatyki (II), Wydział Informatyki i Zarządzania, Politechnika Poznańska | ^{[ P ]} pracownik

Rok publikacji

2005

Typ rozdziału

referat

Język publikacji

angielski

Streszczenie

EN Data mining is an iterative process. Users issue series of similar data mining queries, in each consecutive run slightly modifying either the definition of the mined dataset, or the parameters of the mining algorithm. This model of processing is most suitable for incremental mining algorithms that reuse the results of previous queries when answering a given query. Incremental mining algorithms require the results of previous queries to be available. One way to preserve those results is to use materialized data mining views. Materialized data mining views store the mined patterns and refresh them as the underlying data change. Data mining and knowledge discovery often take place in a data warehouse environment. There can be many relatively small materialized data mining views defined over the data warehouse. Separate refresh of each materialized view can be expensive, if the refresh process has to re-discover patterns in the original database. In this paper we present a novel approach to materialized data mining view refresh process. We show that the concurrent on-line refresh of a set of materialized data mining views is more efficient than the sequential refresh of individual views. We present the framework for the integration of data warehouse refresh process with the maintenance of materialized data mining views. Finally, we prove the feasibility of our approach by conducting several experiments on synthetic data sets.

Strony (od-do)

295 - 304

DOI

10.1007/11546849_29

URL

https://link.springer.com/chapter/10.1007/11546849_29

Książka

Data Warehousing and Knowledge Discovery : 7th International Conference, DaWaK 2005, Copenhagen, Denmark, August 22-26, 2005. Proceedings

Zaprezentowany na

International Conference on Data Warehousing and Knowledge Discovery, 08.2005, Copenhagen, Denmark

System tworzony przez Politechnikę Poznańską oraz Poznańskie Centrum Superkomputerowo-Sieciowe

Zaloguj się przez eKonto, aby dodać do SIN