Title :
Limited resource speech recognition for Nigerian English
Author :
Sulyman Amuda;Hynek Bo?il;Abhijeet Sangwan;John H. L. Hansen
Author_Institution :
Department of Electrical Engineering, University of Ilorin, Nigeria
Abstract :
In this study, we introduce the UISpeech corpus which consists of Nigerian-Accented English audio-visual data. The corpus captures the linguistic diversity of Nigeria with data collected from native-speakers of Yoruba, Hausa, Igbo, Tiv, Funali and others. The UIS-peech corpus comprises isolated word recordings and read speech utterances. The new corpus is intended to provide a unique opportunity to apply and expand speech processing techniques to a limited resource language. Acoustic-phonetic differences between American English (AE) and Nigerian English (NE) are studied in terms of pronunciation variations, vowel locations in the formant space, and distances between AE-trained acoustic models and models adapted to NE. A strong impact of the AE-NE acoustic mismatch on automatic speech recognition (ASR) is observed. A combination of model adaptation and extension of AE lexicon for newly established NE pronunciation variants is shown to substantially improve performance of the AE-trained ASR system in the new NE task. This study represents the first step towards incorporating speech technology in Nigerian English.
Keywords :
"Speech recognition","Natural languages","Automatic speech recognition","Loudspeakers","Speech processing","Adaptation model","Speech analysis","Rhythm","Robustness","Audio recording"
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
2379-190X
DOI :
10.1109/ICASSP.2010.5495036