• DocumentCode
    2790302
  • Title

    Speech/Music Discrimination Based on Spectral Peak Analysis and Multi-layer Perceptron

  • Author

    Keum, Ji-Soo ; Lee, Hyon-Soo

  • Author_Institution
    Kyung Hee University
  • Volume
    2
  • fYear
    2006
  • fDate
    9-11 Nov. 2006
  • Firstpage
    56
  • Lastpage
    61
  • Abstract
    This study presents a new Speech/Music discrimination method based on spectral peak feature and Multilayer Perceptron. The focus was on feature extraction that reflects spectral peak duration characteristics and high performance using small number of train dataset. Spectral peak features were extracted from audio spectral peak tracks and the feature was normalized by length of segment. Then, we grouping the frequency channel to reflect the spectral distribution. For train, only 25 seconds of speech (Korean) and 50 seconds of music are used. This method was evaluated on speech and music for 24,258 seconds of audio data. An average accuracy was 96.58% for speech and 91.82% for music. The results of this experiment found that proposed method was suitable for Speech/Music discrimination.
  • Keywords
    Data mining; Feature extraction; Indexing; Loudspeakers; Mel frequency cepstral coefficient; Multilayer perceptrons; Music; Spectral analysis; Speech analysis; Strontium;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hybrid Information Technology, 2006. ICHIT '06. International Conference on
  • Conference_Location
    Cheju Island
  • Print_ISBN
    0-7695-2674-8
  • Type

    conf

  • DOI
    10.1109/ICHIT.2006.253589
  • Filename
    4021194