Title :
On the phonetic information in ultrasonic microphone signals
Author :
Livescu, Karen ; Zhu, Bo ; Glass, James
Author_Institution :
Toyota Technol. Inst. at Chicago, Chicago, IL
Abstract :
We study the phonetic information in the signal from an ultrasonic ldquomicrophonerdquo, a device that emits an ultrasonic wave toward a speaker and receives the reflected, Doppler-shifted signal. This can be used in addition to audio to improve automatic speech recognition. This work is an effort to better understand the ultrasonic signal, and potentially to determine a set of natural sub-word units. We present classification and clustering experiments on CVC and VCV sequences in speaker-dependent and multi-speaker settings. Using a set of ultrasonic spectral features and diagonal Gaussian models, it is possible to distinguish all consonants and most vowels. When clustering the confusion data, the consonant clusters mostly correspond to places and manners of articulation; the vowel data roughly clusters into high, low, and rounded vowels.
Keywords :
Gaussian processes; signal processing; speaker recognition; speech processing; Doppler-shifted signal; automatic speech recognition; diagonal Gaussian models; multispeaker settings; phonetic information; speaker-dependent settings; ultrasonic microphone signals; Artificial intelligence; Band pass filters; Computer science; Frequency; Glass; Hardware; Loudspeakers; Microphones; Spectrogram; Speech recognition; Speech recognition; multimodal; ultrasonic;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960660