DocumentCode
454586
Title
A New Data Selection Approach for Semi-Supervised Acoustic Modeling
Author
Zhang, Rong ; Rudnicky, Alexander I.
Author_Institution
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
Current approaches to semi-supervised incremental learning prefer to select unlabeled examples predicted with high confidence for model re-training. However, this strategy can degrade the classification performance rather than improve it. We present an analysis for the reasons of this phenomenon, showing that only relying on high confidence for data selection can lead to an erroneous estimate to the true distribution when the confidence annotator is highly correlated with the classifier in the information they use. We propose a new data selection approach to address this problem and apply it to a variety of applications, including machine learning and speech recognition. Encouraging improvements in recognition accuracy are observed in our experiments
Keywords
learning (artificial intelligence); speech recognition; classification performance; confidence annotator; data selection approach; machine learning; model re-training; semi-supervised acoustic modeling; semi-supervised incremental learning; speech recognition; Computer science; Data analysis; Degradation; Information analysis; Lattices; Machine learning; Predictive models; Semisupervised learning; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660047
Filename
1660047
Link To Document