DocumentCode
2790302
Title
Speech/Music Discrimination Based on Spectral Peak Analysis and Multi-layer Perceptron
Author
Keum, Ji-Soo ; Lee, Hyon-Soo
Author_Institution
Kyung Hee University
Volume
2
fYear
2006
fDate
9-11 Nov. 2006
Firstpage
56
Lastpage
61
Abstract
This study presents a new Speech/Music discrimination method based on spectral peak feature and Multilayer Perceptron. The focus was on feature extraction that reflects spectral peak duration characteristics and high performance using small number of train dataset. Spectral peak features were extracted from audio spectral peak tracks and the feature was normalized by length of segment. Then, we grouping the frequency channel to reflect the spectral distribution. For train, only 25 seconds of speech (Korean) and 50 seconds of music are used. This method was evaluated on speech and music for 24,258 seconds of audio data. An average accuracy was 96.58% for speech and 91.82% for music. The results of this experiment found that proposed method was suitable for Speech/Music discrimination.
Keywords
Data mining; Feature extraction; Indexing; Loudspeakers; Mel frequency cepstral coefficient; Multilayer perceptrons; Music; Spectral analysis; Speech analysis; Strontium;
fLanguage
English
Publisher
ieee
Conference_Titel
Hybrid Information Technology, 2006. ICHIT '06. International Conference on
Conference_Location
Cheju Island
Print_ISBN
0-7695-2674-8
Type
conf
DOI
10.1109/ICHIT.2006.253589
Filename
4021194
Link To Document