• DocumentCode
    3625854
  • Title

    Language Modeling For Computer-Aided Transcription

  • Author

    Cagdas Kayra Akman;Murat Saraclar

  • Author_Institution
    Elektrik-Elektronik M?hendisli?i B?l?m?, Bo?azi?i ?niversitesi, Bebek, ?stanbul, T?rkiye. kayra.akman@boun.edu.tr
  • fYear
    2007
  • fDate
    6/1/2007 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Speech recognition and language processing systems require large amounts of transcribed speech corpora. Manual transcription is expensive and slow. Computers may do the same task faster but with more errors. Computer aided transcription is a compromise between these two methods. The output lattices of an ASR engine are manipulated to be used as language models in combination with a letter-based N-gram language model. The combined model is used as the language model of the open source Dasher application. The resulting application allows easy transcription of speech data thanks to the combination of both models at letter level. It is shown that the combined model performs better than both a letter-based N-gram model and models combined at sentence level.
  • Keywords
    "Application software","Intersymbol interference","Speech recognition","Natural languages","Speech processing","Computer errors","Lattices","Automatic speech recognition","Engines"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications, 2007. SIU 2007. IEEE 15th
  • ISSN
    2165-0608
  • Print_ISBN
    1-4244-0719-2
  • Type

    conf

  • DOI
    10.1109/SIU.2007.4298566
  • Filename
    4298566