Extracting noise-robust features from audio data

Author

Burges, Christopher J.C. ; Platt, John C. ; Jana, Soumya

Author_Institution

Microsoft Research, One Microsoft Way, Redmond, WA 98052, USA

Volume

fYear

2002

fDate

13-17 May 2002

Abstract

A key problem faced by audio identification, classification, and retrieval systems is the mapping of high-dimensional audio input data into informative lower-dimensional feature vectors. This paper explores an automatic dimensionality reduction algorithm called Distortion Discriminant Analysis (DDA). Each layer of DDA projects its input into directions which maximize the SNR for a given set of distortions. Multiple layers efficiently extract features over a wide temporal window. The audio input to DDA undergoes perceptually-relevant preprocessing and de-equalization, to further suppress distortions. We apply DDA to the task of identifying audio clips in an incoming audio stream, based on matching stored audio fingerprints. We show excellent test results on matching input fingerprints against 36 hours of stored audio data.

Keywords

Data mining; Ferroelectric films; Nonvolatile memory; Random access memory; Robustness;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location

Orlando, FL, USA

ISSN

1520-6149

Print_ISBN

0-7803-7402-9

Type

conf

DOI

10.1109/ICASSP.2002.5743968

Filename

5743968

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=542349