DocumentCode
3530356
Title
On the phonetic information in ultrasonic microphone signals
Author
Livescu, Karen ; Zhu, Bo ; Glass, James
Author_Institution
Toyota Technol. Inst. at Chicago, Chicago, IL
fYear
2009
fDate
19-24 April 2009
Firstpage
4621
Lastpage
4624
Abstract
We study the phonetic information in the signal from an ultrasonic ldquomicrophonerdquo, a device that emits an ultrasonic wave toward a speaker and receives the reflected, Doppler-shifted signal. This can be used in addition to audio to improve automatic speech recognition. This work is an effort to better understand the ultrasonic signal, and potentially to determine a set of natural sub-word units. We present classification and clustering experiments on CVC and VCV sequences in speaker-dependent and multi-speaker settings. Using a set of ultrasonic spectral features and diagonal Gaussian models, it is possible to distinguish all consonants and most vowels. When clustering the confusion data, the consonant clusters mostly correspond to places and manners of articulation; the vowel data roughly clusters into high, low, and rounded vowels.
Keywords
Gaussian processes; signal processing; speaker recognition; speech processing; Doppler-shifted signal; automatic speech recognition; diagonal Gaussian models; multispeaker settings; phonetic information; speaker-dependent settings; ultrasonic microphone signals; Artificial intelligence; Band pass filters; Computer science; Frequency; Glass; Hardware; Loudspeakers; Microphones; Spectrogram; Speech recognition; Speech recognition; multimodal; ultrasonic;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960660
Filename
4960660
Link To Document