DocumentCode :
2781285
Title :
An improved recursive algorithm for automatic alignment of complex long audio
Author :
Kejia, He ; Gang, Liu ; Jie, Tang ; Jun, Guo
Author_Institution :
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear :
2009
fDate :
6-8 Nov. 2009
Firstpage :
690
Lastpage :
694
Abstract :
In this paper we present an approach for automatic alignment of long audio data with varied acoustic conditions to their corresponding transcripts in an effective manner. Accurate time-aligned transcripts provide easier access to audio materials by aiding applications such as the indexing, summarizing and retrieving of audio segments. Accurate time alignments are also necessary for labeling the training data for a speech recognizer´s acoustic model. We provide an improved recursive technique of speech recognition with a gradually self-adaptive language model and acoustic model.
Keywords :
acoustic signal processing; audio coding; recursive estimation; speech recognition; acoustic model; audio segments retrieving; automatic alignment; indexing; long audio data; recursive algorithm; self-adaptive language model; speech recognition; summarizing; time-aligned transcripts; Automatic speech recognition; Dynamic programming; Indexing; Labeling; Multimedia communication; Multimedia databases; Natural languages; Speech recognition; Text recognition; Training data; Speech alignment; acoustic re-estimation; dynamic programming; language model adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network Infrastructure and Digital Content, 2009. IC-NIDC 2009. IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-4898-2
Electronic_ISBN :
978-1-4244-4900-6
Type :
conf
DOI :
10.1109/ICNIDC.2009.5360838
Filename :
5360838
Link To Document :
بازگشت