DocumentCode
2178582
Title
Uncertainty management in model-based imputation for missing data
Author
Azarkhail, M. ; Woytowitz, P.
Author_Institution
Lam Res. Corp., San Jose, CA, USA
fYear
2013
fDate
28-31 Jan. 2013
Firstpage
1
Lastpage
7
Abstract
In semiconductor industry like many other applications, the failure data is rarely available in complete form and is often flawed by missing records. When the missing process is random, the missing data can be safely ignored without major conceptual impact on the statistics of the experiment. The potential flaw with ignoring the missing data, however, is that the remaining complete observations may not carry enough statistical power, due to small sample size of the remaining population of complete failures. In some cases, the modeler may be able to describe the missing records as a function of other independent information available. Imputation of missing records from such empirical model is a typical way by which the lateral information about missing records can be leveraged. These models often carry considerable uncertainty that needs to be effectively incorporated into the data analysis process, in order to avoid false overconfidence in estimated reliability measures. In this article the uncertainty management during the model-based imputation process for missing data is discussed. The case study consists of Weibull analysis for a reliability critical component when a simple linear model is available for the missing records. Ignoring the missing records will result in relatively large uncertainty over the calculated reliability measures. The single imputation from correlation model will mark the other end of the spectrum due to an artificial boost in the statistical significance of the results as expected. The Multiple imputations and Bayesian likelihood averaging methods seem to be the most viable options when it comes to the uncertainty management in this problem. There seems to be some differences, however, that will be explained in detail.
Keywords
Weibull distribution; data analysis; maximum likelihood estimation; production engineering computing; reliability; semiconductor industry; Bayesian likelihood averaging method; Weibull analysis; data analysis process; empirical model; failure data; missing data; missing record; model-based data imputation; multiple imputation; reliability critical component; reliability measure; semiconductor industry; statistical power; uncertainty management; Bayes methods; Correlation; Data models; Entropy; Equations; Mathematical model; Uncertainty; Bayesian Likelihood Averaging; Life Data Analysis; Missing Data; Single and Multiple Imputation;
fLanguage
English
Publisher
ieee
Conference_Titel
Reliability and Maintainability Symposium (RAMS), 2013 Proceedings - Annual
Conference_Location
Orlando, FL
ISSN
0149-144X
Print_ISBN
978-1-4673-4709-9
Type
conf
DOI
10.1109/RAMS.2013.6517697
Filename
6517697
Link To Document