• DocumentCode
    431597
  • Title

    Broadcast news segmentation by audio type analysis

  • Author

    Nwe, Tin Lay ; Li, Haizhou

  • Author_Institution
    Inst. for Infocomm Res., Singapore
  • Volume
    2
  • fYear
    2005
  • fDate
    18-23 March 2005
  • Abstract
    It is common for us to define audio types according to human perception instead of audio spectral properties. In this paper, we analyse the spectral properties of audio types and propose the acoustic features based on spectral properties and harmonic enhancement, to classify audio. By analyzing the spectral properties of sound types, a multi-model HMM is proposed to integrate the primitive spectral properties in statistical modeling. To validate the approach, we build a classifier to segment audio streams into speech, commercials, environmental sound, physical violence and silence in multiple steps. It is shown that the proposed approach outperforms conventional methods. Experimental evaluations on 20 audio tracks of the TRECVID broadcast news database have shown the effectiveness of the proposed approach.
  • Keywords
    audio signal processing; feature extraction; hidden Markov models; signal classification; spectral analysis; acoustic features; audio classification; audio stream classification; audio type analysis; audio type spectral properties; broadcast news segmentation; energy distribution analysis; harmonic enhancement; human audio perception; multimodel HMM; statistical modeling; Broadcasting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8874-7
  • Type

    conf

  • DOI
    10.1109/ICASSP.2005.1415592
  • Filename
    1415592