Direct conversion from facial myoelectric signals to speech using Deep Neural Networks

Author

Lorenz Diener;Matthias Janke;Tanja Schultz

Author_Institution

Cognitive Systems Lab, Karlsruhe Institute of Technology (KIT), Germany

fYear

2015

fDate

7/1/2015 12:00:00 AM

Firstpage

Lastpage

Abstract

This paper presents our first results using Deep Neural Networks for surface electromyographic (EMG) speech synthesis. The proposed approach enables a direct mapping from EMG signals captured from the articulatory muscle movements to the acoustic speech signal. Features are processed from multiple EMG channels and are fed into a feed forward neural network to achieve a mapping to the target acoustic speech output. We show that this approach is feasible to generate speech output from the input EMG signal and compare the results to a prior mapping technique based on Gaussian mixture models. The comparison is conducted via objective Mel-Cepstral distortion scores and subjective listening test evaluations. It shows that the proposed Deep Neural Network approach gives substantial improvements for both evaluation criteria.

Keywords

Electromyography

Publisher

ieee

Conference_Titel

Neural Networks (IJCNN), 2015 International Joint Conference on

Electronic_ISBN

2161-4407

Type

conf

DOI

10.1109/IJCNN.2015.7280404

Filename

7280404

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3661096