Title :
Analysis of stopping criteria for the EM algorithm in the context of patient grouping according to length of stay
Author :
Abbi, R. ; El-Darzi, E. ; Vasilakis, C. ; Millard, P.
Author_Institution :
Dept. of Comput. Sci., Univ. of Westminster, London
Abstract :
The expectation maximisation (EM) algorithm is an iterative maximum likelihood procedure often used for estimating the parameters of a mixture model. Theoretically, increases in the likelihood function are guaranteed as the algorithm iteratively improves upon previously derived parameter estimates. The algorithm is considered to converge when all parameter estimates become stable and no further improvements can be made to the likelihood value. However, to reduce computational time, it is often common practice for the algorithm to be stopped before complete convergence using heuristic approaches. In this paper, we consider various stopping criteria and evaluate their effect on fitting Gaussian mixture models (GMMs) to patient length of stay (LOS) data. Although the GMM can be successfully fitted to positively skewed data such as LOS, the fitting procedure often requires many iterations of the EM algorithm. To our knowledge, no previous study has evaluated the effect of different stopping criteria on fitting GMMs to skewed distributions. Hence, the aim of this paper is to evaluate the effect of various stopping criteria in order to select and justify their use within a patient spell classification methodology. Results illustrate that criteria based on the difference in the likelihood value and on the GMM parameters may not always be a good indicator for stopping the algorithm. In fact we show that the values of the difference in the variance parameters should be used instead, as these parameters are the last to stabilise. In addition, we also specify threshold values for the other stopping criteria.
Keywords :
Gaussian processes; expectation-maximisation algorithm; health care; parameter estimation; patient care; pattern classification; EM algorithm; Gaussian mixture models; computational time; expectation maximisation algorithm; iterative maximum likelihood procedure; likelihood function; parameter estimation; patient grouping; patient length of stay data; patient spell classification methodology; stopping criteria analysis; Algorithm design and analysis; Computer science; Context modeling; Convergence; Intelligent systems; Iterative algorithms; Maximum likelihood detection; Maximum likelihood estimation; Medical services; Parameter estimation; GMM fitting; LOS data; patient classification; stopping criteria;
Conference_Titel :
Intelligent Systems, 2008. IS '08. 4th International IEEE Conference
Conference_Location :
Varna
Print_ISBN :
978-1-4244-1739-1
DOI :
10.1109/IS.2008.4670413