Conferences in Research and Practice in Information Technology
  

Online Version - Last Updated - 20 Jan 2012

 

 
Home
 

 
Procedures and Resources for Authors

 
Information and Resources for Volume Editors
 

 
Orders and Subscriptions
 

 
Published Articles

 
Upcoming Volumes
 

 
Contact Us
 

 
Useful External Links
 

 
CRPIT Site Search
 
    

Validating Synthetic Health Datasets for Longitudinal Clustering

Pour, S.G., Maeder, A. and Jorm, L.

    Clustering methods partition datasets into subgroups with some homogeneous properties, with information about the number and particular characteristics of each subgroup unknown a priori. The problem of predicting the number of clusters and quality of each cluster might be overcome by using cluster validation methods. This paper presents such an approach incorporating quantitative methods for comparison between original and synthetic versions of longitudinal health datasets. The use of the methods is demonstrated by using two different clustering algorithms, K-means and Latent Class Analysis, to perform clustering on synthetic data derived from the 45 and Up Study baseline data, from NSW in Australia.
Cite as: Pour, S.G., Maeder, A. and Jorm, L. (2013). Validating Synthetic Health Datasets for Longitudinal Clustering. In Proc. Health Informatics and Knowledge Management 2013 (HIKM 2013) Adelaide, Australia. CRPIT, 142. Gray, K. and Koronios, A. Eds., ACS. 15-20
pdf (from crpit.com) pdf (local if available) BibTeX EndNote GS