DocumentCode :
2560904
Title :
Biphone-rich versus triphone-rich: a comparison of speech corpora in automatic speech recognition
Author :
Yio, Yong-Chang ; Liang, Min-Siong ; Chiang, Yuang-chin ; Lyu, Ren-Yuan
Author_Institution :
Inst. of Stat., Nat. Tsing Hua Univ., Hsin Chu, Taiwan
fYear :
2005
fDate :
28-30 May 2005
Firstpage :
194
Lastpage :
197
Abstract :
In this paper, we compare the performance of a speech recognition system trained with two speech corpora. We select two set of words such that they covered all the cross-syllable bi-phones and tri-phones, and are called phonetically biphone-rich and triphone-rich respectively. It is required about 10 times more words than that of cross-syllable biphones to cover all the cross-syllable triphones. To facilitate fair comparison, the biphone-rich corpus is thus consisted often sets of words that each covers all the cross-syllable biphones. With those words as data sheets, a male Taiwanese speaker recorded all the words as microphone speech. The resulting speech corpora, about 100 minutes for each set, are used to train for the acoustic models. Although both perform quite well in tasks with recognition networks of linear net and free syllable net, the triphone-rich corpus does not show much advantages over the biphone-rich corpus.
Keywords :
speech recognition; automatic speech recognition; cross-syllable biphones corpus; cross-syllable triphones corpus; speech corpora; speech recognition system; Automatic speech recognition; Chromium; Computer science; Loudspeakers; Microphones; Natural languages; Speech recognition; Speech synthesis; Statistics; Tongue;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cellular Neural Networks and Their Applications, 2005 9th International Workshop on
Print_ISBN :
0-7803-9185-3
Type :
conf
DOI :
10.1109/CNNA.2005.1543194
Filename :
1543194
Link To Document :
بازگشت