Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

On Case-Based Reasoning for ETL Process Repairs: Making Cases Fine-Grained

Authors

[ 1 ] Politechnika Poznańska | [ 2 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ D ] phd student | [ P ] employee

Scientific discipline (Law 2.0)

[2.3] Information and communication technology

Year of publication

2020

Chapter type

chapter in monograph / paper

Publication language

english

Keywords
EN
  • data source evolution
  • ETL process repair
  • case-based reasoning
Abstract

EN Data sources (DSs) being integrated in a data warehouse frequently change their structures. As a consequence, in many cases, an already deployed ETL process stops its execution, generating errors. Since the number of deployed ETL processes may reach dozens of thousands and structural changes in DSs are frequent, being able to (semi-)automatically repair an ETL process after DS changes, would decrease ETL maintenance costs. In our approach, we developed the E-ETL framework, for ETL process repairs. In E-ETL, an ETL process is semi-automatically or automatically (depending on a case) repaired, so that it works with the changed DS. E-ETL supports two different repair methods: (1) user defined rules, (2) and Case-Based Reasoning (CBR). Having experimented with CBR, we learned that large cases do not frequently fit a given DS change, even though they include elements that could be applied to repair a given ETL process, and vice-versa - more complex DS changes cannot be handled by small cases. To solve this problem, in this paper, we contribute algorithms for decomposing detected structural changes in DSs. The purpose of the decomposition is to divide a set of detected structural DSs changes into smaller sets, to increase the probability of finding a suitable case by the CBR method.

Date of online publication

12.08.2020

Pages (from - to)

235 - 249

DOI

10.1007/978-3-030-57672-1_18

URL

https://link.springer.com/chapter/10.1007/978-3-030-57672-1_18

Book

Databases and Information Systems : 14th International Baltic Conference, DB&IS 2020, Tallinn, Estonia, June 16–19, 2020 : Proceedings

Presented on

14th International Baltic Conference on Databases and Information Systems DB&IS 2020, 16-19.06.2020, Tallin, Estonia

Ministry points / chapter

20

Ministry points / conference (CORE)

70

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.