Subpopulation discovery in epidemiological data with subspace clustering

Uli Niemann; Myra Spiliopoulou; Henry Völzke; Jens-Peter Kühn

doi:10.2478/fcds-2014-0015

System Informacji Naukowej Politechniki Poznańskiej

PL EN

Strona główna / Publikacje / Subpopulation discovery in epidemiological data with subspace clustering

Zgłoś uwagę

Artykuł

Pobierz plik Pobierz BibTeX

Tytuł

Subpopulation discovery in epidemiological data with subspace clustering

Autorzy

Rok publikacji

2014

Opublikowano w

Foundations of Computing and Decision Sciences

Rocznik: 2014 | Tom: vol. 39 | Numer: no. 4

Typ artykułu

artykuł naukowy

Język publikacji

angielski

Streszczenie

EN A prerequisite of personalized medicine is the identification of groups of people who share specific risk factors towards an outcome. We investigate the potential of subspace clustering for finding such groups in epidemiological data. We propose a workflow that encompasses clusterability assessment before cluster discovery and quality assessment after learning the clusters. Epidemiological usually do not have a ground truth for the verification of clusters found in subspaces. Hence, we introduce quality assessment through juxtaposition of the learned models to “models-of-randomness”, i.e. models that do not reflect a true cluster structure. On the basis of this workflow, we select subspace clustering methods, compare and discuss their performance. We use a dataset with hepatic steatosis as outcome, but our findings apply on arbitrary epidemiological cohort data that have tenths of variables and exhibit class skew.

Strony (od-do)

271 - 300

DOI

10.2478/fcds-2014-0015

URL

https://www.sciendo.com/article/10.2478/fcds-2014-0015

Typ licencji

CC BY-NC-ND (uznanie autorstwa - użycie niekomercyjne - bez utworów zależnych)

Pełny tekst artykułu

Pobierz plik

Poziom dostępu do pełnego tekstu

publiczny