DocumentCode :
3318983
Title :
Comparative experiments to evaluate the use of syllables for large-vocabulary automatic speech recognition
Author :
Tolba, Hesham ; Azmi, Mohamed
Author_Institution :
Electr. Eng. Dept., Taibah Univ., Al Madinah, Saudi Arabia
fYear :
2009
fDate :
8-11 Aug. 2009
Firstpage :
250
Lastpage :
253
Abstract :
This paper motivates the use of syllables to enhance the performance of automatic speech recognition (ASR) systems when dealing with large-vocabulary speech. Arabic and English are considered in our paper to test the proposed approach. The Arabic database consists of sentences selected from different Arabic broadcast news, whereas for English speech, TIMIT database had been used to test our approach. Comparative experiments have indicated that the use of syllables as acoustic units for the recognition of both languages leads to an improvement in the recognition performance of HMM-based ASR systems. The Hidden Markov Model Toolkit (HTK) was used throughout our experiments. A series of experiments on speaker-independent continuous-speech recognition have been carried out using both databases. Using such an approach, experiments show that for Arabic database, the recognition rate using syllables outperforms the recognition rate obtained using monophones and triphones by 15.75% and 2.64%, respectively. On the other hand, for TIMIT database, the recognition rate using syllables outperforms the recognition rate using monophones and triphones by 40.08% and 19.74%, respectively.
Keywords :
database management systems; hidden Markov models; speech recognition; Arabic broadcast news; Arabic database; English speech; HMM-based ASR systems; TIMIT database; hidden Markov model toolkit; large-vocabulary automatic speech recognition; monophones; speaker-independent continuous-speech recognition; syllables; triphones; Acoustic testing; Automatic speech recognition; Broadcasting; Databases; Dictionaries; Hidden Markov models; Natural languages; Speech analysis; Speech enhancement; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Technology, 2009. ICCSIT 2009. 2nd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4519-6
Electronic_ISBN :
978-1-4244-4520-2
Type :
conf
DOI :
10.1109/ICCSIT.2009.5234953
Filename :
5234953
Link To Document :
بازگشت