• DocumentCode
    2933055
  • Title

    A phrase-level piecewise linear scaling algorithm for melody match in Query-by-Humming systems

  • Author

    Cao, Wenxiao ; Jiang, Danning ; Hou, Jue ; Qin, Yong ; Zheng, Thomas Fang ; Liu, Yi

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • fYear
    2009
  • fDate
    June 28 2009-July 3 2009
  • Firstpage
    942
  • Lastpage
    945
  • Abstract
    The Query-by-Humming (QBH) system allows users to retrieve songs by singing/humming. In this paper we propose a phrase-level piecewise linear scaling algorithm for melody match. Musical phrase boundaries are predicted for the query to split it to phrases. The boundaries of melody fragment corresponding to each phrase are allowed for adjusting in a limited scope. The algorithm employs Dynamic Programming and Recursive Alignment to search for the minimal piecewise matching cost upon Linear Scaling at phrase-level. Our experimental results on 5223 melody database show that the proposed algorithm outperforms traditional algorithms. The proposed algorithm gives significant improvements of 17.0%, 14.7% and 4.8% with respect to Linear Scaling, Dynamic Time Wrapping and Recursive Alignment in top-1 rate, respectively. The results show that the proposed algorithm is more efficient than the previous algorithms.
  • Keywords
    speech recognition; dynamic programming; melody match; minimal piecewise matching; musical phrase boundaries; phrase-level piecewise linear scaling; query-by-humming systems; recursive alignment; Costs; Databases; Dynamic programming; Heuristic algorithms; Laboratories; Natural languages; Piecewise linear techniques; Rhythm; Speech; Technological innovation; Phrase-level; Piecewise Linear Scaling; Query-by-Humming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-4290-4
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2009.5202651
  • Filename
    5202651