DocumentCode
431624
Title
Relating the acoustic space of vowels to the perceptual space in cochlear implant simulations
Author
Liu, Chuping ; Fu, Qian-Jie
Author_Institution
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
Volume
3
fYear
2005
fDate
18-23 March 2005
Abstract
In automatic speech recognition, speech features are measured and used as reference templates for machine learning and recognition. The differences between these features can be used to calculate relative acoustic distances between phonemes. However, when speech signals are spectrally degraded, as in electric hearing with cochlear implants, it is unclear whether these acoustic distances can predict speech recognition performance. The present study measured acoustic distances between spectrally degraded vowel tokens and investigated the relation between acoustic vowel space and perceptual vowel space. After processing vowel tokens using a cochlear implant simulation, the Mel-frequency cepstrum was extracted from each token; features were then time aligned and the weighted Euclidean distance was calculated between all tokens. Results demonstrate a significant correlation between vowel perception data and averaged acoustic distance between vowel tokens, for a variety of experimental conditions. These results suggest that acoustic distance between phonemes may well predict the recognition performance of spectrally degraded speech.
Keywords
acoustic signal processing; digital simulation; hearing aids; speech processing; speech recognition; Mel-frequency cepstrum; acoustic distance; acoustic distances; acoustic vowel space; automatic speech recognition; cochlear implant; machine learning; perceptual space; perceptual vowel space; spectrally degraded speech signals; spectrally degraded vowel tokens; vowel perception data; weighted Euclidean distance; Acoustic measurements; Auditory system; Automatic speech recognition; Cepstrum; Cochlear implants; Data mining; Degradation; Extraterrestrial measurements; Machine learning; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8874-7
Type
conf
DOI
10.1109/ICASSP.2005.1415639
Filename
1415639
Link To Document