DocumentCode :
179046
Title :
Unsupervised submodular subset selection for speech data
Author :
Kai Wei ; Yuzong Liu ; Kirchhoff, Katrin ; Bilmes, Jeff
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
4107
Lastpage :
4111
Abstract :
We conduct a comparative study on selecting subsets of acoustic data for training phone recognizers. The data selection problem is approached as a constrained submodular optimization problem. Previous applications of this approach required transcriptions or acoustic models trained in a supervised way. In this paper we develop and evaluate a novel and entirely unsupervised approach, and apply it to TIMIT data. Results show that our method consistently outperforms a number of baseline methods while being computationally very efficient and requiring no labeling.
Keywords :
optimisation; speech recognition; unsupervised learning; TIMIT data; acoustic data; acoustic models; constrained submodular optimization problem; data selection problem; speech data; training phone recognizers; unsupervised approach; unsupervised submodular subset selection; Acoustics; Hidden Markov models; Speech; Speech processing; Speech recognition; Training; Training data; automatic speech recognition; machine learning; speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854374
Filename :
6854374
Link To Document :
بازگشت