DocumentCode
3403577
Title
A background music detection method based on robust feature extraction
Author
Izumitani, Tomonori ; Mukai, Ryo ; Kashino, Kunio
Author_Institution
NTT Commun. Sci. Labs., Kanagawa
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
13
Lastpage
16
Abstract
We propose a music segment detection method for audio signals. Unlike many existing methods, ours specifically focuses on a background-music detection task, that is, detecting music used in background of main sounds. This task is important because music is almost always overlapped by speech or other environmental sounds in visual materials such as TV programs. Our method consists of feature extraction, dimension reduction, and statistical discrimination steps. For each step, we analyzed a set of methods to maximize the detection accuracy. With a simple post processing step, we achieved a framewise error rate as low as 8 % even when the mixed speech was louder than the target music by 10dB.
Keywords
audio signal processing; feature extraction; music; signal detection; speech processing; statistical analysis; audio signal; background music segment detection method; dimension reduction; robust feature extraction; speech-music discrimination system; statistical discrimination; Feature extraction; Frequency; Hidden Markov models; Indexing; Multimedia systems; Multiple signal classification; Music; Robustness; Speech; TV; Background music detection; Gaussian mixture model; feature selection; k-nearest neighbor method;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4517534
Filename
4517534
Link To Document