• DocumentCode
    2560904
  • Title

    Biphone-rich versus triphone-rich: a comparison of speech corpora in automatic speech recognition

  • Author

    Yio, Yong-Chang ; Liang, Min-Siong ; Chiang, Yuang-chin ; Lyu, Ren-Yuan

  • Author_Institution
    Inst. of Stat., Nat. Tsing Hua Univ., Hsin Chu, Taiwan
  • fYear
    2005
  • fDate
    28-30 May 2005
  • Firstpage
    194
  • Lastpage
    197
  • Abstract
    In this paper, we compare the performance of a speech recognition system trained with two speech corpora. We select two set of words such that they covered all the cross-syllable bi-phones and tri-phones, and are called phonetically biphone-rich and triphone-rich respectively. It is required about 10 times more words than that of cross-syllable biphones to cover all the cross-syllable triphones. To facilitate fair comparison, the biphone-rich corpus is thus consisted often sets of words that each covers all the cross-syllable biphones. With those words as data sheets, a male Taiwanese speaker recorded all the words as microphone speech. The resulting speech corpora, about 100 minutes for each set, are used to train for the acoustic models. Although both perform quite well in tasks with recognition networks of linear net and free syllable net, the triphone-rich corpus does not show much advantages over the biphone-rich corpus.
  • Keywords
    speech recognition; automatic speech recognition; cross-syllable biphones corpus; cross-syllable triphones corpus; speech corpora; speech recognition system; Automatic speech recognition; Chromium; Computer science; Loudspeakers; Microphones; Natural languages; Speech recognition; Speech synthesis; Statistics; Tongue;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cellular Neural Networks and Their Applications, 2005 9th International Workshop on
  • Print_ISBN
    0-7803-9185-3
  • Type

    conf

  • DOI
    10.1109/CNNA.2005.1543194
  • Filename
    1543194