• DocumentCode
    1485692
  • Title

    Histogram Equalization-Based Features for Speech, Music, and Song Discrimination

  • Author

    Gallardo-Antolín, Ascensión ; Montero, Juan M.

  • Author_Institution
    Dept. of Signal Theor. & Commun., Univ. Carlos III de Madrid, Leganes, Spain
  • Volume
    17
  • Issue
    7
  • fYear
    2010
  • fDate
    7/1/2010 12:00:00 AM
  • Firstpage
    659
  • Lastpage
    662
  • Abstract
    In this letter, we present a new class of segment-based features for speech, music and song discrimination. These features, called PHEQ (Polynomial-Fit Histogram Equalization), are derived from the nonlinear relationship between the short-term feature distributions computed at segment level and a reference distribution. Results show that PHEQ characteristics outperform short-term features such as Mel Frequency Cepstrum Coefficients (MFCC) and conventional segment-based ones such as MFCC mean and variance. Furthermore, the combination of short-term and PHEQ features significantly improves the performance of the whole system.
  • Keywords
    audio signal processing; speech processing; PHEQ; feature distribution; mel frequency cepstrum coefficients; music discrimination; polynomial fit histogram equalization; song discrimination; speech discrimination; Acoustic features; HEQ-based features; audio classification; parameterization; speech/music/song discrimination;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2010.2049877
  • Filename
    5460954