• DocumentCode
    3587523
  • Title

    Automatic segmentation of broadcast news audio using self similarity matrix

  • Author

    Soni, Sapna ; Ahmed, Imran ; Kopparapu, Sunil Kumar

  • Author_Institution
    Inst. of Technol., Nirma Univ., Ahmedabad, India
  • fYear
    2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Audio broadcast news is composed of music, commercials, news from correspondents and recorded statements; apart from the newsreader who reads out the news during the news program. Automatic segmentation of news audio is essential, for automatic time alignment of audio and the available text transcription to build speech corpora. In this paper we address this particular problem of segmentation of broadcast news audio, in order to extract the audio segments corresponding to the newsreader. We observe that the existing systems and techniques proposed in literature tend to face difficulties when used on our task of automatic segmentation and extraction of the newsreader segments; and thus produce sub- optimal results. Hence, we propose a technique which identifies the actual acoustic change points using an acoustic Self Similarity Matrix. We discuss the proposed algorithm, which uses a two pass technique, and present experimental results on broadcast news audio from All India Radio.
  • Keywords
    audio signal processing; matrix algebra; radio broadcasting; All India Radio; acoustic identification; acoustic self similarity matrix; audio automatic time alignment; broadcast news audio automatic segmentation; speech corpora; text transcription; two pass technique; Convergence; Face; Feature extraction; Mel frequency cepstral coefficient; Speech; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Convergence of Technology (I2CT), 2014 International Conference for
  • Print_ISBN
    978-1-4799-3758-5
  • Type

    conf

  • DOI
    10.1109/I2CT.2014.7092245
  • Filename
    7092245