Still Open Problems in Data Warehouse and Data Lake Research: extended abstract

Robert Wrembel

doi:10.1109/SNAMS53716.2021.9732098

System Informacji Naukowej Politechniki Poznańskiej

PL EN

Strona główna / Publikacje / Still Open Problems in Data Warehouse and Data Lake Research: extended abstract

Zgłoś uwagę

Rozdział

Pobierz BibTeX

Tytuł

Still Open Problems in Data Warehouse and Data Lake Research: extended abstract

Autorzy

Robert Wrembel (WIiT) ^{[ 1 ][ 2.3 ][ P ]}

^{[ 1 ]} Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | ^{[ P ]} pracownik

Dyscyplina naukowa (Ustawa 2.0)

[2.3] Informatyka techniczna i telekomunikacja

Rok publikacji

2021

Typ rozdziału

rozdział w monografii naukowej / referat

Język publikacji

angielski

Słowa kluczowe

EN

data integration
data warehouse
data lake
big data
extract transform load
data processing workflow
data processing pipeline
data quality
ETL optimization
data source evolution
metadata

Streszczenie

EN During recent years, we observe a widespread of new data sources, especially all types of social media and IoT devices, which produce huge data volumes, whose content ranges from fully structured to totally unstructured. All these types of data are commonly referred to as big data. They are typically described by the three most important characteristics, called 3V [1], namely: an extremely large volume, a variety of data models and structures (data representations), as well as a high velocity at which data are generated. We argue that out of these three Vs, the most challenging is variety [2]. Such data need to be integrated and transformed into a common representation, which is suitable for analysis, in a similar manner as traditional (mainly table-like) data.

Strony (od-do)

01 - 03

DOI

10.1109/SNAMS53716.2021.9732098

URL

https://ieeexplore.ieee.org/document/9732098

Książka

2021 Eighth International Conference on Social Network Analysis, Management and Security (SNAMS), Gandia, Spain, Dec 06-09, 2021 (Virtual)

Zaprezentowany na

8th International Conference on Social Network Analysis, Management and Security (SNAMS 2021), 6-9.12.2021, Gandia, Spain

Punktacja Ministerstwa / rozdział

20

System tworzony przez Politechnikę Poznańską oraz Poznańskie Centrum Superkomputerowo-Sieciowe

Zaloguj się przez eKonto, aby dodać do SIN