Title :
Unsupervised submodular subset selection for speech data
Author :
Kai Wei ; Yuzong Liu ; Kirchhoff, Katrin ; Bilmes, Jeff
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
Abstract :
We conduct a comparative study on selecting subsets of acoustic data for training phone recognizers. The data selection problem is approached as a constrained submodular optimization problem. Previous applications of this approach required transcriptions or acoustic models trained in a supervised way. In this paper we develop and evaluate a novel and entirely unsupervised approach, and apply it to TIMIT data. Results show that our method consistently outperforms a number of baseline methods while being computationally very efficient and requiring no labeling.
Keywords :
optimisation; speech recognition; unsupervised learning; TIMIT data; acoustic data; acoustic models; constrained submodular optimization problem; data selection problem; speech data; training phone recognizers; unsupervised approach; unsupervised submodular subset selection; Acoustics; Hidden Markov models; Speech; Speech processing; Speech recognition; Training; Training data; automatic speech recognition; machine learning; speech processing;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854374