• DocumentCode
    2179654
  • Title

    Gain-robust multi-pitch tracking using sparse nonnegative matrix factorization

  • Author

    Peharz, Robert ; Wohlmayr, Michael ; Pernkopf, Franz

  • Author_Institution
    Signal Process. & Speech Commun. Lab., Graz Univ. of Technol., Graz, Austria
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5416
  • Lastpage
    5419
  • Abstract
    While nonnegative matrix factorization (NMF) has successfully been applied for gain-robust multi-pitch detection, a method to track pitch values over time was not provided. We embed NMF-based pitch detection into a recently proposed pitch-tracking system, based on a factorial hidden Markov model (FHMM). The original system models speech spectra with Gaussian mixture models, which is sensitive to a gain mismatch between training and test data. We therefore combine the advantages of these two approaches and derive a gain-adaptive observation model for the FHMM. As training algorithm we use a modification of ℓ0-sparse NMF, which represents the short-time spectrum with scalable basis vectors. In experiments we show that the new approach significantly increases the gain-robustness of the original tracking system.
  • Keywords
    Gaussian processes; hidden Markov models; matrix decomposition; speech processing; FHMM; Gaussian mixture models; NMF-based pitch detection; factorial hidden Markov model; gain-robust multipitch detection; gain-robust multipitch tracking; sparse nonnegative matrix factorization; Gain; Hidden Markov models; Laplace equations; Markov processes; Sparse matrices; Speech; Training; factorial model; multi-pitch; sparse NMF;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947583
  • Filename
    5947583