• DocumentCode
    3527324
  • Title

    Language model transformation applied to lightly supervised training of acoustic model for congress meetings

  • Author

    Kawahara, Tatsuya ; Mimura, Masato ; Akita, Yuya

  • Author_Institution
    Acad. Center for Comput. & Media Studies, Kyoto Univ., Kyoto
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    3853
  • Lastpage
    3856
  • Abstract
    For effective training of acoustic and language models for spontaneous speech such as meetings, it is significant to exploit the texts available in a large scale, which may not be faithful transcripts of the utterances. We have proposed a language model transformation scheme to cope with the differences between verbatim transcripts of spontaneous utterances and human-made transcripts such as those in proceedings. In this paper, we investigate its application to lightly supervised training of the acoustic model. By transforming the corresponding text in the proceedings, we can generate a very constrained model to predict the actual utterances. The experimental evaluation with the transcription system for the Japanese Congress meetings demonstrated that the proposed scheme can generate accurate labels for acoustic model training and thus realizes the comparable ASR (Automatic Speech Recognition) performance to the case using manual transcripts.
  • Keywords
    natural language processing; speech recognition; speech recognition equipment; acoustic model; automatic speech recognition; congress meetings; human-made transcripts; language model transformation; lightly supervised training; spontaneous speech; spontaneous utterance; verbatim transcript; Acoustic applications; Automatic speech recognition; Broadcasting; Humans; Large-scale systems; Natural languages; Predictive models; Speech recognition; Surface-mount technology; Training data; acoustic model; language model; lightly supervised training; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960468
  • Filename
    4960468