DocumentCode
3625854
Title
Language Modeling For Computer-Aided Transcription
Author
Cagdas Kayra Akman;Murat Saraclar
Author_Institution
Elektrik-Elektronik M?hendisli?i B?l?m?, Bo?azi?i ?niversitesi, Bebek, ?stanbul, T?rkiye. kayra.akman@boun.edu.tr
fYear
2007
fDate
6/1/2007 12:00:00 AM
Firstpage
1
Lastpage
4
Abstract
Speech recognition and language processing systems require large amounts of transcribed speech corpora. Manual transcription is expensive and slow. Computers may do the same task faster but with more errors. Computer aided transcription is a compromise between these two methods. The output lattices of an ASR engine are manipulated to be used as language models in combination with a letter-based N-gram language model. The combined model is used as the language model of the open source Dasher application. The resulting application allows easy transcription of speech data thanks to the combination of both models at letter level. It is shown that the combined model performs better than both a letter-based N-gram model and models combined at sentence level.
Keywords
"Application software","Intersymbol interference","Speech recognition","Natural languages","Speech processing","Computer errors","Lattices","Automatic speech recognition","Engines"
Publisher
ieee
Conference_Titel
Signal Processing and Communications Applications, 2007. SIU 2007. IEEE 15th
ISSN
2165-0608
Print_ISBN
1-4244-0719-2
Type
conf
DOI
10.1109/SIU.2007.4298566
Filename
4298566
Link To Document