• DocumentCode
    468324
  • Title

    An Unsupervised Audio Segmentation and Classification Approach

  • Author

    Pan, Wenjuan ; Yao, Yong ; Liu, Zhijing

  • Author_Institution
    Xidian Univ., Xian
  • Volume
    3
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    303
  • Lastpage
    306
  • Abstract
    This paper presents an unsupervised audio segmentation and classification approach. First, the multiple change-point segmentation is adopted, and a new feature named Mel-ICA is introduced to improve it. An audio type "uncertain" is proposed to represent mixed type. Three features of each sub-segment are extracted using Fourier and wavelet transform. Then, classification is performed over each sub-segment based on feature threshold, and the majority rule is applied to determine the final type. The experimental results have shown that the false alarm rate decreased using Mel-ICA, and high accuracy of classification achieved.
  • Keywords
    Fourier transforms; audio signal processing; feature extraction; speech recognition; wavelet transforms; Fourier transform; Mel-ICA; feature extraction; feature threshold; mixed type representation; multiple change-point segmentation; uncertain audio type; unsupervised audio classification; unsupervised audio segmentation; wavelet transform; Automatic speech recognition; Classification tree analysis; Computer science; Feature extraction; Filters; Fourier transforms; Hidden Markov models; Independent component analysis; Neural networks; Wavelet transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.172
  • Filename
    4406249