Title :
Speech/Music Discrimination Based on Spectral Peak Analysis and Multi-layer Perceptron
Author :
Keum, Ji-Soo ; Lee, Hyon-Soo
Author_Institution :
Kyung Hee University
Abstract :
This study presents a new Speech/Music discrimination method based on spectral peak feature and Multilayer Perceptron. The focus was on feature extraction that reflects spectral peak duration characteristics and high performance using small number of train dataset. Spectral peak features were extracted from audio spectral peak tracks and the feature was normalized by length of segment. Then, we grouping the frequency channel to reflect the spectral distribution. For train, only 25 seconds of speech (Korean) and 50 seconds of music are used. This method was evaluated on speech and music for 24,258 seconds of audio data. An average accuracy was 96.58% for speech and 91.82% for music. The results of this experiment found that proposed method was suitable for Speech/Music discrimination.
Keywords :
Data mining; Feature extraction; Indexing; Loudspeakers; Mel frequency cepstral coefficient; Multilayer perceptrons; Music; Spectral analysis; Speech analysis; Strontium;
Conference_Titel :
Hybrid Information Technology, 2006. ICHIT '06. International Conference on
Conference_Location :
Cheju Island
Print_ISBN :
0-7695-2674-8
DOI :
10.1109/ICHIT.2006.253589