Two-Step Heterogeneous Finite Mixture Model Clustering for Mining Healthcare Databases

Ahmed Najjar; Christian Gagne; Daniel Reinharz

doi:10.1109/ICDM.2015.70

Back

Conference proceeding

Two-Step Heterogeneous Finite Mixture Model Clustering for Mining Healthcare Databases

Ahmed Najjar, Christian Gagne and Daniel Reinharz

2015 IEEE International Conference on Data Mining, Vol.2016-, pp.931-936

01/11/2015

DOI: https://doi.org/10.1109/ICDM.2015.70

Abstract

Administrative health care databases

Clustering

Clustering algorithms

Databases

Finite mixture model

Hidden Markov models

Medical services

Mixed attributes

Mixture models

Multivalued categorical variables

Numerical models

Partitioning algorithms

Dealing with real-life databases often implies handling sets of heterogeneous variables. We are proposing in this paper a methodology for exploring and analyzing such databases, with an application in the specific domain of healthcare data analytics. We are thus proposing a two-step heterogeneous finite mixture model, with a first step involving a joint mixture of Gaussian and multinomial distribution to handle numerical (i.e., real and integer numbers) and categorical variables (i.e., discrete values), and a second step featuring a mixture of hidden Markov models to handle sequences of categorical values (e.g., series of events). This approach is evaluated on a real-world application, the clustering of administrative healthcare databases from Québec, with results illustrating the good performances of the proposed method.

Metrics

1 Record Views

Details

Title: Two-Step Heterogeneous Finite Mixture Model Clustering for Mining Healthcare Databases
Creators - without role: Ahmed Najjar - Université Laval
Christian Gagne - Université Laval
Daniel Reinharz - Université Laval
Publication Details: 2015 IEEE International Conference on Data Mining, Vol.2016-, pp.931-936
Publisher: IEEE
Identifiers: 9945541708331
Academic Unit: King Abdullah University of Science & Technology
Language: English
Resource Type: Conference proceeding