DocumentCode :
3670776
Title :
System for producing subtitles to internet audio-visual documents
Author :
Jan Nouza;Karel Blavka;Marek Boháć;Petr Červa;Jiří Málek
Author_Institution :
SpeechLab at the Institute of Information Technology and Electronics, Faculty of Mechatronics, Informatics and Interdisciplinary Studies, Technical University of Liberec, Liberec, 461 17, Czech Republic
fYear :
2015
fDate :
7/1/2015 12:00:00 AM
Firstpage :
1
Lastpage :
5
Abstract :
In this paper, we present a system developed in our lab to provide subtitles to audio-visual shows and documents produced by Czech internet company Stream.cz. The main goal is to make these programs understandable also for deaf and hearing impaired persons. We describe the whole process that starts with extracting the audio channel from the document, then identifies speech and converts it to text (using either automatically generated or human edited transcripts), and eventually produces the subtitles synchronized with the audio and video tracks. We present the employed methods (including the adaptation of the system to the target data), compare results on various types of documents and provide some relevant statistics collected during the first year of practical deployment.
Keywords :
"Speech","Multiple signal classification","Speech recognition","Acoustics","Noise","Synchronization","Training"
Publisher :
ieee
Conference_Titel :
Telecommunications and Signal Processing (TSP), 2015 38th International Conference on
Type :
conf
DOI :
10.1109/TSP.2015.7296415
Filename :
7296415
Link To Document :
بازگشت