DocumentCode
2789039
Title
Evidence for the strength of the relationship between Automatic Speech Recognition and Phoneme Alignment performance
Author
Baghai-Ravary, Ladan
Author_Institution
Phonetics Lab., Univ. of Oxford, Wellington, UK
fYear
2010
fDate
14-19 March 2010
Firstpage
5262
Lastpage
5265
Abstract
It might be naïvely assumed that the performance of an Automatic Speech Recognition (ASR) system, and that of an Automatic Speech-to-Phoneme Alignment (ASPA) system using the same acoustic-phonetic models, would be closely related. However many researchers believe this relationship to be, at best weak - but this belief has not previously been tested in an objective and quantitative manner. This paper quantifies the strength of the relationship using analysis of data without reference to manually defined alignment labels. By avoiding comparison with a set of reference labels, both the ASR and the ASPA systems can be considered equivalent, removing any bias due to any difference of “opinion” between the human labeller and the automatic system.
Keywords
data analysis; hidden Markov models; speech processing; speech recognition; acoustic-phonetic models; automatic speech recognition; data analysis; phoneme alignment performance; Acoustic testing; Automatic speech recognition; Data analysis; Hidden Markov models; Humans; Laboratories; Speech recognition; Speech synthesis; System performance; System testing; HMMs; acoustic-phonetic models; optimal performance; phoneme alignment; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5494977
Filename
5494977
Link To Document