Probabilistic clustering of interval data
Probabilistic clustering of interval data
Date
2015
Authors
Paula Brito
Pedro Duarte Silva,APD
Dias,JG
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
In this paper we address the problem of clustering interval data, adopting a model-based approach. To this purpose, parametric models for interval-valued variables are used which consider configurations for the variance-covariance matrix that take the nature of the interval data directly into account. Results, both on synthetic and empirical data, clearly show the well-founding of the proposed approach. The method succeeds in finding parsimonious heterocedastic models which is a critical feature in many applications. Furthermore, the analysis of the different data sets made clear the need to explicitly consider the intrinsic variability present in interval data.