• DocumentCode
    2345892
  • Title

    Generating Subtitles Automatically Using Audio Extraction and Speech Recognition

  • Author

    Mathur, Abhinav ; Saxena, Tanya ; Krishnamurthi, Rajalakshmi

  • Author_Institution
    Dept. of Comput. Sci., Jaypee Inst. of Inf. Technol., Noida, India
  • fYear
    2015
  • fDate
    13-14 Feb. 2015
  • Firstpage
    621
  • Lastpage
    626
  • Abstract
    In present scenario, video plays a vital role to help people understand and comprehend the information for example the songs, movies or the video lectures or any other multimedia data relevant to the user. Hence, here it becomes important to make videos available to the people having auditory problems and even more for the people to remove the gaps of their native language. This can be best done by the use of subtitles of the video. However, downloading subtitles of any video from the internet is a monotonous process. Consequently, to generate subtitles automatically through the software itself and without the use of internet is a valid subject of research. Hence, this research paper resolves the above issue through three distinct modules namely Audio Extraction which converts an input file of any format supported by MPEG standards to .wav format. Here 24% reduction rate has been achieved in the size of the song after the extraction. Then Speech Recognition of the extracted .wav file is implemented and finally, Subtitle Generation in which a .txt/.srt file is generated which is synchronized with the input file.
  • Keywords
    audio signal processing; feature extraction; handicapped aids; speech recognition; video signal processing; .srt file; .txt file; MPEG standards; audio extraction; auditory problems; native language; speech recognition; subtitle automatic generation; video; wav format; Acoustics; Bit rate; Filter banks; Hidden Markov models; Psychoacoustic models; Speech recognition; .srt file; .wav format; Audio Extraction; MPEG standards; Speech Recognition; Subtitle Generation; Subtitles; Video;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence & Communication Technology (CICT), 2015 IEEE International Conference on
  • Conference_Location
    Ghaziabad
  • Print_ISBN
    978-1-4799-6022-4
  • Type

    conf

  • DOI
    10.1109/CICT.2015.46
  • Filename
    7078779