Reconciling inconsistent data in probabilistic XML data integration

Tadeusz Pankowski

doi:10.1007/978-3-540-70504-8_8

System Informacji Naukowej Politechniki Poznańskiej

PL EN

Strona główna / Publikacje / Reconciling inconsistent data in probabilistic XML data integration

Zgłoś uwagę

Rozdział

Pobierz BibTeX

Tytuł

Reconciling inconsistent data in probabilistic XML data integration

Autorzy

Tadeusz Pankowski (WE) ^{[ 1 ][ P ]}

^{[ 1 ]} Instytut Automatyki i Inżynierii Informatycznej, Wydział Elektryczny, Politechnika Poznańska | ^{[ P ]} pracownik

Rok publikacji

2008

Typ rozdziału

referat

Język publikacji

angielski

Streszczenie

EN The problem of dealing with inconsistent data while integrating XML data from different sources is an important task, necessary to improve data integration quality. Typically, in order to remove inconsistencies, i.e. conflicts between data, data cleaning (or repairing) procedures are applied. In this paper, we present a probabilistic XML data integration setting. A probability is assigned to each data source and its probability models the reliability level of the data source. In this way, an answer (a tuple of values of XML trees) has a probability assigned to it. The problem is how to compute such probability, especially when the same answer is produced by many sources. We consider three semantics for computing such probabilistic answers: by-peer, by-sequence, and by-subtree semantics. The probabilistic answers can be used for resolving a class of inconsistencies violating XML functional dependencies defined over the target schema. Having a probability distribution over a set of conflicting answers, we can choose the one for which the probability of being correct is the highest.

Strony (od-do)

75 - 86

DOI

10.1007/978-3-540-70504-8_8

URL

https://link.springer.com/chapter/10.1007/978-3-540-70504-8_8

Książka

Sharing data, information and knowledge : 25th British National Conference on Databases, BNCOD 25, Cardiff, UK, July 2008. Proceedings

Zaprezentowany na

25th British National Conference on Databases, BNCOD 25, 7-10.06.2008, Cardif, United Kingdom

System tworzony przez Politechnikę Poznańską oraz Poznańskie Centrum Superkomputerowo-Sieciowe

Zaloguj się przez eKonto, aby dodać do SIN