مرکز منطقه ای اطلاع رساني علوم و فناوري - Enhancement of EMG-based Thai number words classification using frame-based time domain features with stacking filter

DocumentCode :

117921

Title :

Enhancement of EMG-based Thai number words classification using frame-based time domain features with stacking filter

Author :

Srisuwan, Niyawadee ; Wand, Michael ; Janke, Matthias ; Phukpattaranont, Pornchai ; Schultz, Tanja ; Limsakul, Chusak

Author_Institution :

Dept. of Electr. Eng., Prince of Songkla Univ., Songkhla, Thailand

fYear :

2014

fDate :

9-12 Dec. 2014

Firstpage :

Lastpage :

Abstract :

In order to overcome a problem existing in a classical automatic speech recognition (e.g. ambient noise and loss of privacy), Electromyography (EMG) from speech production muscles was used in place of a human speech signal. We aim to investigate the EMG speech recognition based on Thai language. The earlier work, we used five channels of the EMG from the facial and neck muscles to classify 11 Thai number words based on Neural Network Classification. 15 features in time domain and frequency domain were employed for feature extraction. We obtained an average accuracy rate of 89.45% for audible speech and 78.55% for silent speech. However, it needs to be enhanced to get the best result. This paper proposes to improve an accuracy rate of EMG-based Thai number words classification. The ten subjects uttered 11 words in both an audible and a silent speech while five channels of the EMG signal were captured. Frame-based time domain features with a stacking filter was performed for feature extraction stage. After that, LDA was used to lessen a dimension of the feature vector. Hidden Markov Model (HMM) was employed in classification stage. The results show that using above techniques of feature extraction, feature dimensionality reduction and classification can improve an average accuracy rate by 3% absolute for audible speech when were compared to earlier work. We achieved an average classification rate of 92.45% and 75.73% for audible and silent speech respectively.

Keywords :

electromyography; feature extraction; medical signal processing; neural nets; speech; speech recognition; time-domain analysis; EMG speech recognition; EMG-based Thai number words classification enhancement; Thai language; audible speech; classical automatic speech recognition; electromyography; facial muscle; feature dimensionality classification; feature dimensionality reduction; feature extraction; feature vector; frame-based time domain feature; frequency domain; hidden Markov model; human speech signal; neck muscle; neural network classification; silent speech; speech production muscle; stacking filter; Accuracy; Decision support systems; Electromyography; Feature extraction; Speech; Speech recognition; Stacking;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Asia-Pacific Signal and Information Processing Association, 2014 Annual Summit and Conference (APSIPA)

Conference_Location :

Siem Reap

Type :

conf

DOI :

10.1109/APSIPA.2014.7041549

Filename :

7041549

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=117921