مرکز منطقه ای اطلاع رساني علوم و فناوري - A Multi-Views Multi-Learners Approach Towards Dysarthric Speech Recognition Using Multi-Nets Artificial Neural Networks

DocumentCode :

26951

Title :

A Multi-Views Multi-Learners Approach Towards Dysarthric Speech Recognition Using Multi-Nets Artificial Neural Networks

Author :

Shahamiri, Seyed Reza ; Binti Salim, Siti Salwah

Author_Institution :

Dept. of Software Eng., Univ. of Malaya, Kuala Lumpur, Malaysia

Volume :

Issue :

fYear :

2014

fDate :

Sept. 2014

Firstpage :

1053

Lastpage :

1063

Abstract :

Automatic speech recognition (ASR) can be very helpful for speakers who suffer from dysarthria, a neurological disability that damages the control of motor speech articulators. Although a few attempts have been made to apply ASR technologies to sufferers of dysarthria, previous studies show that such ASR systems have not attained an adequate level of performance. In this study, a dysarthric multi-networks speech recognizer (DM-NSR) model is provided using a realization of multi-views multi-learners approach called multi-nets artificial neural networks, which tolerates variability of dysarthric speech. In particular, the DM-NSR model employs several ANNs (as learners) to approximate the likelihood of ASR vocabulary words and to deal with the complexity of dysarthric speech. The proposed DM-NSR approach was presented as both speaker-dependent and speaker-independent paradigms. In order to highlight the performance of the proposed model over legacy models, multi-views single-learner models of the DM-NSRs were also provided and their efficiencies were compared in detail. Moreover, a comparison among the prominent dysarthric ASR methods and the proposed one is provided. The results show that the DM-NSR recorded improved recognition rate by up to 24.67% and the error rate was reduced by up to 8.63% over the reference model.

Keywords :

medical signal processing; neural nets; neurophysiology; speech; speech processing; speech recognition; automatic speech recognition; dysarthria; dysarthric multinetworks speech recognizer model; motor speech articulators; multinets artificial neural networks; neurological disability; Accuracy; Artificial neural networks; Databases; Hidden Markov models; Speech; Speech recognition; Vocabulary; Dysarthria; dysarthric speech recognition; multi-nets artificial neural networks; multi-views multi-learners (MVML);

fLanguage :

English

Journal_Title :

Neural Systems and Rehabilitation Engineering, IEEE Transactions on

Publisher :

ieee

ISSN :

1534-4320

Type :

jour

DOI :

10.1109/TNSRE.2014.2309336

Filename :

6762967

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=26951