• DocumentCode
    1274013
  • Title

    Signal-Driven Window-Length Adaptation for Sinusoid Detection in Polyphonic Music

  • Author

    Rao, Vishweshwara ; Gaddipati, Pradeep ; Rao, Preeti

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Technol. Bombay, Mumbai, India
  • Volume
    20
  • Issue
    1
  • fYear
    2012
  • Firstpage
    342
  • Lastpage
    348
  • Abstract
    Audio processing applications that use short-time signal analysis techniques typically utilize fixed window duration single- or multi-resolution analyses. However, different real-world signal conditions such as polyphony and non-stationarity, manifested as musical accompaniment and pitch-modulations, respectively, in the context of music content analysis, require varying data window lengths for reliable processing. In this paper, we investigate the use of signal sparsity for adapting analysis window lengths. Adaptive-window analysis driven by different measures of sparsity applied to the local spectrum, such as kurtosis and Gini index, is evaluated and shown to be superior to fixed-window analysis in terms of sinusoid detection and frequency estimation for simulated and real signals. A window main-lobe matching method for sinusoid detection is also shown to be more robust to signal conditions such as polyphony and frequency modulation relative to other methods.
  • Keywords
    audio signal processing; frequency modulation; music; signal detection; signal resolution; Gini index; adaptive-window analysis; audio processing application; frequency estimation; frequency modulation; kurtosis index; multiresolution analysis; music content analysis; musical accompaniment; pitch modulation; polyphonic music; real-world signal condition; reliable processing; short-time signal analysis technique; signal sparsity; signal-driven window-length adaptation; sinusoid detection; window main-lobe matching method; Frequency estimation; Frequency modulation; Harmonic analysis; Interference; Noise; Time frequency analysis; Signal sparsity; sinusoid detection; window adaptation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2162319
  • Filename
    5955094