DocumentCode :
2812615
Title :
Automatic Synchronization of live speech and its Transcripts based on a frame-synchronous likelihood ratio test
Author :
Gao, Jie ; Zhao, Qingwei ; Yan, Yonghong
Author_Institution :
ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
1622
Lastpage :
1625
Abstract :
In this paper, we present our initial efforts in the task of Automatically Synchronizing live spoken Utterances with their Transcripts (textual contents) (ASUT) when the texts are known. We treat it as a online speech-text alignment problem. And it is further simplified into the problem of on-the-fly detecting of the end time of a spoken utterance given its textual content. A general framework called frame-synchronous likelihood ratio test (FS-LRT) procedure is proposed for this end time detection task and explored with the hidden Markov models (HMMs). The property of FS-LRT is studied empirically. Extensive experiments indicate that our proposed approach shows satisfying performance. In addition, FS-LRT has been successfully applied in a subtitling system for live broadcast news.
Keywords :
hidden Markov models; speech processing; text analysis; FS-LRT procedure; automatically synchronizing spoken utterances with their transcripts; end time detection task; frame-synchronous likelihood ratio test; hidden Markov model; live broadcast news; live speech; live spoken utterance; online speech-text alignment problem; textual content; Acoustic testing; Automatic speech recognition; Automatic testing; Delay; Digital multimedia broadcasting; Error correction; Hidden Markov models; Multimedia communication; Research and development; TV broadcasting; Automatically Synchronizing spoken Utterances with their Transcripts; frame-synchronous likelihood ratio test;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5496295
Filename :
5496295
Link To Document :
بازگشت