• DocumentCode
    463721
  • Title

    Music Identification with Weighted Finite-State Transducers

  • Author

    Weinstein, E. ; Moreno, Pablo

  • Author_Institution
    Courant Inst. of Mathematical Sci., New York, NY, USA
  • Volume
    2
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    Music identification is the process of matching an audio stream to a particular song. Previous work has relied on hashing, where an exact or almost-exact match between local features of the test and reference recordings is required. In this work we present a new approach to music identification based on finite-state transducers and Gaussian mixture models. We apply an unsupervised training process to learn an inventory of music phone units similar to phonemes in speech. We also learn a unique sequence of music units characterizing each song. We further propose a novel application of transducers for recognition of music phone sequences. Preliminary experiments demonstrate an identification accuracy of 99.5% on a database of over 15,000 songs running faster than real time.
  • Keywords
    audio signal processing; transducers; Gaussian mixture models; audio stream matching; music identification; music phone units; unsupervised training process; weighted finite-state transducers; Acoustic transducers; Audio recording; Cepstral analysis; Fingerprint recognition; Hidden Markov models; Maximum likelihood decoding; Music; Speech processing; Streaming media; Testing; Music identification; acoustic modeling; finite-state transducers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366329
  • Filename
    4217502