DocumentCode :
166104
Title :
A hybrid approach for Discourse Segment Detection in the automatic subtitle generation of computer science lecture videos
Author :
Sridhar, Rajeswari ; Aravind, S. ; Muneerulhudhakalvathi, Hamid ; Senthur, M. Sibi
Author_Institution :
Dept. of Comput. Sci. & Eng., Anna Univ., Chennai, India
fYear :
2014
fDate :
24-27 Sept. 2014
Firstpage :
284
Lastpage :
287
Abstract :
The aim of this paper is to develop an automatic subtitle generation system for computer science lecture videos. CMU Sphinx Speech API is used to accomplish speech recognition. The main challenge of this work, is to align the translated text with the video. Discourse Segment Detection (DSD) is the process of analyzing and identifying discourse boundaries in human speech. Discourse Segment Detection (DSD) is carried out that classifies word boundaries and groups words until a discourse break occurs. The approach that has been devised in this paper for DSD to identify word boundary is a hybrid approach combining acoustic and linguistic features from the speech. This helps to segment the text obtained from Speech Engine, group words that can be written to the subtitles file without violating the subtitle standards. The devised approach has shown an improved performance than the existing approach as the error has reduced from 30% to 18 %.
Keywords :
computer aided instruction; computer science education; speech recognition; speech synthesis; video signal processing; CMU Sphinx speech API; DSD; automatic subtitle generation; computer science lecture videos; hybrid discourse segment detection approach; speech engine; speech recognition; translated text; Acoustics; Automatic speech recognition; Computer science; Pragmatics; Speech; Videos; Discourse Segment Detection; Subtitles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4799-3078-4
Type :
conf
DOI :
10.1109/ICACCI.2014.6968422
Filename :
6968422
Link To Document :
بازگشت