Title :
Machine learning-based anomaly detection for post-silicon bug diagnosis
Author :
DeOrio, Andrew ; Li, Qingkun ; Burgess, Matthew ; Bertacco, Valeria
Author_Institution :
Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, 48109, USA
Abstract :
The exponentially growing complexity of modern processors intensifies verification challenges. Traditional pre-silicon verification covers less and less of the design space, resulting in increasing post-silicon validation effort. A critical challenge is the manual debugging of intermittent failures on prototype chips, where multiple executions of a same test do not yield a consistent outcome. We leverage the power of machine learning to support automatic diagnosis of these difficult, inconsistent bugs. During post-silicon validation, lightweight hardware logs a compact measurement of observed signal activity over multiple executions of a same test: some may pass, somemay fail. Our novel algorithm applies anomaly detection techniques similar to those used to detect credit card fraud to identify the approximate cycle of a bug´s occurrence and a set of candidate root-cause signals. Compared against other state-of-the-art solutions in this space, our new approach can locate the time of a bug´s occurrence with nearly 4x better accuracy when applied to the complex OpenSPARC T2 design.
Keywords :
Clustering algorithms; Computer bugs; Detection algorithms; Hardware; Machine learning algorithms; Time measurement; Training data;
Conference_Titel :
Design, Automation & Test in Europe Conference & Exhibition (DATE), 2013
Conference_Location :
Grenoble, France
Print_ISBN :
978-1-4673-5071-6
DOI :
10.7873/DATE.2013.112