DocumentCode
2560904
Title
Biphone-rich versus triphone-rich: a comparison of speech corpora in automatic speech recognition
Author
Yio, Yong-Chang ; Liang, Min-Siong ; Chiang, Yuang-chin ; Lyu, Ren-Yuan
Author_Institution
Inst. of Stat., Nat. Tsing Hua Univ., Hsin Chu, Taiwan
fYear
2005
fDate
28-30 May 2005
Firstpage
194
Lastpage
197
Abstract
In this paper, we compare the performance of a speech recognition system trained with two speech corpora. We select two set of words such that they covered all the cross-syllable bi-phones and tri-phones, and are called phonetically biphone-rich and triphone-rich respectively. It is required about 10 times more words than that of cross-syllable biphones to cover all the cross-syllable triphones. To facilitate fair comparison, the biphone-rich corpus is thus consisted often sets of words that each covers all the cross-syllable biphones. With those words as data sheets, a male Taiwanese speaker recorded all the words as microphone speech. The resulting speech corpora, about 100 minutes for each set, are used to train for the acoustic models. Although both perform quite well in tasks with recognition networks of linear net and free syllable net, the triphone-rich corpus does not show much advantages over the biphone-rich corpus.
Keywords
speech recognition; automatic speech recognition; cross-syllable biphones corpus; cross-syllable triphones corpus; speech corpora; speech recognition system; Automatic speech recognition; Chromium; Computer science; Loudspeakers; Microphones; Natural languages; Speech recognition; Speech synthesis; Statistics; Tongue;
fLanguage
English
Publisher
ieee
Conference_Titel
Cellular Neural Networks and Their Applications, 2005 9th International Workshop on
Print_ISBN
0-7803-9185-3
Type
conf
DOI
10.1109/CNNA.2005.1543194
Filename
1543194
Link To Document