Title :
Further investigations on EMG-to-speech conversion
Author :
Janke, Matthias ; Wand, Michael ; Nakamura, Keigo ; Schultz, Tanja
Author_Institution :
Cognitive Syst. Lab., Karlsruhe Inst. of Technol. (KIT), Karlsruhe, Germany
Abstract :
Our study deals with a Silent Speech Interface based on mapping surface electromyographic (EMG) signals to speech waveforms. Electromyographic signals recorded from the facial muscles capture the activity of the human articulatory apparatus and therefore allow to retrace speech, even when no audible signal is produced. The mapping of EMG signals to speech is done via a Gaussian mixture model (GMM)-based conversion technique. In this paper, we follow the lead of EMG-based speech-to-text systems and apply two major recent technological advances to our system, namely, we consider session-independent systems, which are robust against electrode repositioning, and we show that mapping the EMG signal to whispered speech creates a better speech signal than a mapping to normally spoken speech. We objectively evaluate the performance of our systems using a spectral distortion measure.
Keywords :
electromyography; medical signal processing; speech synthesis; user interfaces; EMG signal mapping; EMG-based speech-to-text systems; EMG-to-speech conversion; electrode repositioning; facial muscles; human articulatory apparatus; session-independent systems; silent speech interface; spectral distortion measure; speech signal; speech waveforms; surface electromyographic signal mapping; Cepstral analysis; Electrodes; Electromyography; Speech; Speech recognition; Training; Vectors; Electromyography; Silent Speech; Speech Synthesis; Voice Conversion;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6287892