Music Identification with Weighted Finite-State Transducers

Author

Weinstein, E. ; Moreno, Pablo

Author_Institution

Courant Inst. of Mathematical Sci., New York, NY, USA

Volume

2

fYear

2007

fDate

15-20 April 2007

Abstract

Music identification is the process of matching an audio stream to a particular song. Previous work has relied on hashing, where an exact or almost-exact match between local features of the test and reference recordings is required. In this work we present a new approach to music identification based on finite-state transducers and Gaussian mixture models. We apply an unsupervised training process to learn an inventory of music phone units similar to phonemes in speech. We also learn a unique sequence of music units characterizing each song. We further propose a novel application of transducers for recognition of music phone sequences. Preliminary experiments demonstrate an identification accuracy of 99.5% on a database of over 15,000 songs running faster than real time.

Keywords

audio signal processing; transducers; Gaussian mixture models; audio stream matching; music identification; music phone units; unsupervised training process; weighted finite-state transducers; Acoustic transducers; Audio recording; Cepstral analysis; Fingerprint recognition; Hidden Markov models; Maximum likelihood decoding; Music; Speech processing; Streaming media; Testing; Music identification; acoustic modeling; finite-state transducers;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location

Honolulu, HI

ISSN

1520-6149

Print_ISBN

1-4244-0727-3

Type

conf

DOI

10.1109/ICASSP.2007.366329

Filename

4217502