• DocumentCode
    1275779
  • Title

    Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription

  • Author

    Benetos, Emmanouil ; Dixon, Sam

  • Author_Institution
    Centre for Digital Music, Queen Mary Univ. of London, London, UK
  • Volume
    5
  • Issue
    6
  • fYear
    2011
  • Firstpage
    1111
  • Lastpage
    1123
  • Abstract
    In this paper, a method for automatic transcription of music signals based on joint multiple-F0 estimation is proposed. As a time-frequency representation, the constant-Q resonator time-frequency image is employed, while a novel noise suppression technique based on pink noise assumption is applied in a preprocessing step. In the multiple-F0 estimation stage, the optimal tuning and inharmonicity parameters are computed and a salience function is proposed in order to select pitch candidates. For each pitch candidate combination, an overlapping partial treatment procedure is used, which is based on a novel spectral envelope estimation procedure for the log-frequency domain, in order to compute the harmonic envelope of candidate pitches. In order to select the optimal pitch combination for each time frame, a score function is proposed which combines spectral and temporal characteristics of the candidate pitches and also aims to suppress harmonic errors. For postprocessing, hidden Markov models (HMMs) and conditional random fields (CRFs) trained on MIDI data are employed, in order to boost transcription accuracy. The system was trained on isolated piano sounds from the MAPS database and was tested on classic and jazz recordings from the RWC database, as well as on recordings from a Disklavier piano. A comparison with several state-of-the-art systems is provided using a variety of error metrics, where encouraging results are indicated.
  • Keywords
    audio signal processing; hidden Markov models; music; random processes; signal denoising; time-frequency analysis; Disklavier piano; MAPS database; classic recordings; conditional random fields; constant-Q resonator time-frequency image; error metrics; harmonic envelope estimation; harmonic errors suppression; hidden Markov model; isolated piano sounds; jazz recordings; joint multipitch detection; joint multiple-F0 estimation; log-frequency domain; music signal transcription; noise suppression technique; overlapping partial treatment procedure; pink noise assumption; polyphonic music transcription; salience function; spectral characteristics; spectral envelope estimation; temporal characteristics; time-frequency representation; Estimation; Harmonic analysis; Hidden Markov models; Multiple signal classification; Noise; Time frequency analysis; Tuning; Automatic music transcription; conditional random fields (CRFs); harmonic envelope estimation; resonator time–frequency image;
  • fLanguage
    English
  • Journal_Title
    Selected Topics in Signal Processing, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    1932-4553
  • Type

    jour

  • DOI
    10.1109/JSTSP.2011.2162394
  • Filename
    5957255