Title :
Bilingual audio-subtitle extraction using automatic segmentation of movie audio
Author :
Tsiartas, Andreas ; Ghosh, Prasanta ; Georgiou, Panayiotis G. ; Narayanan, Shrikanth
Author_Institution :
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
Abstract :
Extraction of bilingual audio and text data is crucial for designing Speech to Speech (S2S) systems. In this work, we propose an automatic method to segment multilingual audio streams from movies. In addition, the audio streams are aligned with the corresponding subtitles. We found that the proposed method gives 89% perfectly segmented bilingual audio and 6% partially segmented bilingual audio. In addition, the mapping of the audio to the corresponding subtitles has accuracy 91%.
Keywords :
audio signal processing; speech processing; S2S systems; automatic segmentation; bilingual audio data; bilingual audio-subtitle extraction; movie audio; multilingual audio stream segmentation; speech to speech systems; text data; Accuracy; Acoustics; Conferences; Data mining; Motion pictures; Speech; Training; audio segmentation; bilingual movie audio; movie subtitle;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947635