• DocumentCode
    2176555
  • Title

    Incorporating alignments into Conditional Random Fields for grapheme to phoneme conversion

  • Author

    Lehnen, Patrick ; Hahn, Stefan ; Guta, Andreas ; Ney, Hermann

  • Author_Institution
    Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4916
  • Lastpage
    4919
  • Abstract
    Conditional Random Fields (CRFs) are a state-of-the-art approach to natural language processing tasks like grapheme-to phoneme (g2p) conversion which is used to produce pronunciations or pronunciation variants for almost all ASR pronunciation lexica. One drawback of CRFs is that for training, an alignment is needed between graphemes and phonemes, usually even 1-to-l. The quality of the g2p result heavily depends on this alignment. Since these alignments are usually not annotated within the corpora, external models have to be used to produce such an alignment in a preprocessing step. In this work, we propose two approaches to integrate the alignment generation directly and efficiently into the CRF training process. Whereas the first approach relies on linear segmentation as starting point, the second approach considers all possible alignments given certain constraints. Both methods have been evaluated on two English g2p tasks, namely NETtalk and Celex, on which state-of-the-art results have been reported in the literature. The proposed approaches lead to results comparable to the state-of-the art.
  • Keywords
    speech recognition; ASR pronunciation lexica; CRF training process; Celex; G2P conversion; NETtalk; conditional random fields; grapheme-to- phoneme conversion; linear segmentation; natural language processing tasks; Automata; Biological system modeling; Hidden Markov models; Joints; Manuals; Mathematical model; Training; Alignments; CRF; EM Algorithm; G2P;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947458
  • Filename
    5947458