Title :
Ultrasonic sensing for robust speech recognition
Author :
Srinivasan, Sundararajan ; Raj, Bhiksha ; Ezzat, Tony
Author_Institution :
Dept. of Electr. & Comput. Eng., Mississippi State Univ., Starkville, MS, USA
Abstract :
In this paper, we present our work using ultrasonic sensing of speech for digit recognition. First, a set of spectral ultrasonic features are developed and tuned in order to achieve optimal performance for the digit recognition task. Using these features, we demonstrate an overall accuracy of 33.00% on a digit recognition task using HMMs with recordings from 6 speakers. The results indicate that ultrasonic sensing of speech is viable, but that further work is needed to achieve word accuracies that match those of audio. Finally, experimental results are presented which demonstrate that fusing information from ultrasound and audio sources show marginal improvements over audio-only performances.
Keywords :
hidden Markov models; speech recognition; ultrasonic measurement; HMM; audio sources; digit recognition; speech recognition; ultrasonic sensing; ultrasound sources; Audio recording; Laboratories; Loudspeakers; Natural languages; Reproducibility of results; Robustness; Speech analysis; Speech recognition; Transmitters; Ultrasonic imaging; digit recognition; fusion; ultrasound;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495039