A background music detection method based on robust feature extraction

Author

Izumitani, Tomonori ; Mukai, Ryo ; Kashino, Kunio

Author_Institution

NTT Commun. Sci. Labs., Kanagawa

fYear

2008

fDate

March 31 2008-April 4 2008

Firstpage

13

Lastpage

16

Abstract

We propose a music segment detection method for audio signals. Unlike many existing methods, ours specifically focuses on a background-music detection task, that is, detecting music used in background of main sounds. This task is important because music is almost always overlapped by speech or other environmental sounds in visual materials such as TV programs. Our method consists of feature extraction, dimension reduction, and statistical discrimination steps. For each step, we analyzed a set of methods to maximize the detection accuracy. With a simple post processing step, we achieved a framewise error rate as low as 8 % even when the mixed speech was louder than the target music by 10dB.

Keywords

audio signal processing; feature extraction; music; signal detection; speech processing; statistical analysis; audio signal; background music segment detection method; dimension reduction; robust feature extraction; speech-music discrimination system; statistical discrimination; Feature extraction; Frequency; Hidden Markov models; Indexing; Multimedia systems; Multiple signal classification; Music; Robustness; Speech; TV; Background music detection; Gaussian mixture model; feature selection; k-nearest neighbor method;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on

Conference_Location

Las Vegas, NV

ISSN

1520-6149

Print_ISBN

978-1-4244-1483-3

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2008.4517534

Filename

4517534