• DocumentCode
    2177192
  • Title

    Bilingual acoustic modeling with state mapping and three-stage adaptation for transcribing unbalanced code-mixed lectures

  • Author

    Yeh, Ching-Feng ; Sun, Liang-Che ; Huan, Chao-Yu ; Lee, Lin-shan

  • Author_Institution
    Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5020
  • Lastpage
    5023
  • Abstract
    This paper presents a bilingual acoustic modeling approach for transcribing Mandarin-English code-mixed lectures with highly unbalanced language distribution. Special terminologies for the content were produced in the guest language of English (about 15%) and embedded in the utterances produced in the host language of Mandarin (about 85%). The code-mixing nature of the target corpus and the very small percentage of the English data made the task difficult. State mapping and merging approaches plus three stages of model adaptation handles the above problem. Significant improvements in recognition accuracy were obtained in the experiment with a real bilingual code-mixed lecture corpus recorded at National Taiwan University. The code-mixing situation considered is actually very natural in the spoken language of the daily lives of many people in the globalized world today.
  • Keywords
    speech recognition; Mandarin-English code-mixed lectures; bilingual acoustic modeling; recognition accuracy; speech processing; three-stage adaptation; transcribing unbalanced code-mixed lectures; Adaptation models; Silicon; Switches; acoustic modeling; adaptation; bilingual; code-mixing; lecture; state mapping;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947484
  • Filename
    5947484