• DocumentCode
    427168
  • Title

    Singing voice detection using twice-iterated composite Fourier transform

  • Author

    Maddage, Namunu Chinthaka ; Wan, Kongwah ; Xu, Changsheng ; Ye Wang

  • Author_Institution
    Inst. for Infocomm Res., Singapore
  • Volume
    2
  • fYear
    2004
  • fDate
    30-30 June 2004
  • Firstpage
    1347
  • Abstract
    In this paper, we propose a twice-iterated composite Fourier transform (TICFT) technique to detect the singing voice boundaries from acoustical polyphonic music signals. We show that the cumulative TICFT energy in the lower coefficients is capable of differentiating the harmonic structures of vocal and instrumental music in higher octaves. The musical signal is first segmented into frames based on quarter-notes. Then TICFT is used to measure the harmonic structure of each frame. Finally, the vocal and instrumental frames are classified by applying music domain knowledge. Experimental results show over 80% frame level accuracy can be achieved
  • Keywords
    Fourier transforms; audio signal processing; music; acoustical polyphonic music signals; cumulative TICFT energy; instrumental frame classification; quarter-notes; singing voice boundaries; singing voice detection; twice-iterated composite Fourier transform; vocal frame classification; Acoustic measurements; Acoustic signal detection; Band pass filters; Fourier transforms; Instruments; Multiple signal classification; Music information retrieval; Power harmonic filters; Rhythm; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    0-7803-8603-5
  • Type

    conf

  • DOI
    10.1109/ICME.2004.1394478
  • Filename
    1394478