DocumentCode
730725
Title
A language space representation for speech recognition
Author
Ragni, A. ; Gales, M.J.F. ; Knill, K.M.
Author_Institution
Dept. of Eng., Univ. of Cambridge, Cambridge, UK
fYear
2015
fDate
19-24 April 2015
Firstpage
4634
Lastpage
4638
Abstract
The number of languages for which speech recognition systems have become available is growing each year. This paper proposes to view languages as points in some rich space, termed language space, where bases are eigen-languages and a particular selection of the projection determines points. Such an approach could not only reduce development costs for each new language but also provide automatic means for language analysis. For the initial proof of the concept, this paper adopts cluster adaptive training (CAT) known for inducing similar spaces for speaker adaptation needs. The CAT approach used in this paper builds on the previous work for language adaptation in speech synthesis and extends it to Gaussian mixture modelling more appropriate for speech recognition. Experiments conducted on IARPA Babel program languages show that such language space representations can outperform language independent models and discover closely related languages in an automatic way.
Keywords
Gaussian processes; mixture models; natural language processing; pattern clustering; signal representation; speech recognition; speech synthesis; CAT; Gaussian mixture modelling; IARPA Babel program language; cluster adaptive training; development cost reduction; language analysis; language independent model; language space representation; projection determine point selection; speech recognition; speech synthesis; Adaptation models; Buildings; Indexes; Peer-to-peer computing; Speech; Training; Transforms; babel; cluster adaptive training; language space;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178849
Filename
7178849
Link To Document