• DocumentCode
    780216
  • Title

    Low Bit-Rate Object Coding of Musical Audio Using Bayesian Harmonic Models

  • Author

    Vincent, Emmanuel ; Plumbley, Mark D.

  • Author_Institution
    Dept. of Electron. Eng., Queen Mary, Univ. of London
  • Volume
    15
  • Issue
    4
  • fYear
    2007
  • fDate
    5/1/2007 12:00:00 AM
  • Firstpage
    1273
  • Lastpage
    1282
  • Abstract
    This paper deals with the decomposition of music signals into pitched sound objects made of harmonic sinusoidal partials for very low bit-rate coding purposes. After a brief review of existing methods, we recast this problem in the Bayesian framework. We propose a family of probabilistic signal models combining learned object priors and various perceptually motivated distortion measures. We design efficient algorithms to infer object parameters and build a coder based on the interpolation of frequency and amplitude parameters. Listening tests suggest that the loudness-based distortion measure outperforms other distortion measures and that our coder results in a better sound quality than baseline transform and parametric coders at 8 and 2 kbit/s. This work constitutes a new step towards a fully object-based coding system, which would represent audio signals as collections of meaningful note-like sound objects
  • Keywords
    audio coding; harmonic analysis; music; probability; Bayesian harmonic models; harmonic sinusoidal partials; loudness-based distortion measure; low bit-rate object coding; music signals; musical audio; pitched sound objects; probabilistic signal models; Acoustic noise; Algorithm design and analysis; Bayesian methods; Distortion measurement; Frequency estimation; Interpolation; MPEG 4 Standard; Multiple signal classification; Music; Speech analysis; Bayesian inference; harmonic sinusoidal model; object coding; perceptual distortion measure;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2006.889792
  • Filename
    4156222