مرکز منطقه ای اطلاع رساني علوم و فناوري - System for producing subtitles to internet audio-visual documents

DocumentCode :

3670776

Title :

System for producing subtitles to internet audio-visual documents

Author :

Jan Nouza;Karel Blavka;Marek Boháć;Petr Červa;Jiří Málek

Author_Institution :

SpeechLab at the Institute of Information Technology and Electronics, Faculty of Mechatronics, Informatics and Interdisciplinary Studies, Technical University of Liberec, Liberec, 461 17, Czech Republic

fYear :

2015

fDate :

7/1/2015 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

In this paper, we present a system developed in our lab to provide subtitles to audio-visual shows and documents produced by Czech internet company Stream.cz. The main goal is to make these programs understandable also for deaf and hearing impaired persons. We describe the whole process that starts with extracting the audio channel from the document, then identifies speech and converts it to text (using either automatically generated or human edited transcripts), and eventually produces the subtitles synchronized with the audio and video tracks. We present the employed methods (including the adaptation of the system to the target data), compare results on various types of documents and provide some relevant statistics collected during the first year of practical deployment.

Keywords :

"Speech","Multiple signal classification","Speech recognition","Acoustics","Noise","Synchronization","Training"

Publisher :

ieee

Conference_Titel :

Telecommunications and Signal Processing (TSP), 2015 38th International Conference on

Type :

conf

DOI :

10.1109/TSP.2015.7296415

Filename :

7296415

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3670776