DocumentCode :
26951
Title :
A Multi-Views Multi-Learners Approach Towards Dysarthric Speech Recognition Using Multi-Nets Artificial Neural Networks
Author :
Shahamiri, Seyed Reza ; Binti Salim, Siti Salwah
Author_Institution :
Dept. of Software Eng., Univ. of Malaya, Kuala Lumpur, Malaysia
Volume :
22
Issue :
5
fYear :
2014
fDate :
Sept. 2014
Firstpage :
1053
Lastpage :
1063
Abstract :
Automatic speech recognition (ASR) can be very helpful for speakers who suffer from dysarthria, a neurological disability that damages the control of motor speech articulators. Although a few attempts have been made to apply ASR technologies to sufferers of dysarthria, previous studies show that such ASR systems have not attained an adequate level of performance. In this study, a dysarthric multi-networks speech recognizer (DM-NSR) model is provided using a realization of multi-views multi-learners approach called multi-nets artificial neural networks, which tolerates variability of dysarthric speech. In particular, the DM-NSR model employs several ANNs (as learners) to approximate the likelihood of ASR vocabulary words and to deal with the complexity of dysarthric speech. The proposed DM-NSR approach was presented as both speaker-dependent and speaker-independent paradigms. In order to highlight the performance of the proposed model over legacy models, multi-views single-learner models of the DM-NSRs were also provided and their efficiencies were compared in detail. Moreover, a comparison among the prominent dysarthric ASR methods and the proposed one is provided. The results show that the DM-NSR recorded improved recognition rate by up to 24.67% and the error rate was reduced by up to 8.63% over the reference model.
Keywords :
medical signal processing; neural nets; neurophysiology; speech; speech processing; speech recognition; automatic speech recognition; dysarthria; dysarthric multinetworks speech recognizer model; motor speech articulators; multinets artificial neural networks; neurological disability; Accuracy; Artificial neural networks; Databases; Hidden Markov models; Speech; Speech recognition; Vocabulary; Dysarthria; dysarthric speech recognition; multi-nets artificial neural networks; multi-views multi-learners (MVML);
fLanguage :
English
Journal_Title :
Neural Systems and Rehabilitation Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1534-4320
Type :
jour
DOI :
10.1109/TNSRE.2014.2309336
Filename :
6762967
Link To Document :
بازگشت