• DocumentCode
    2272041
  • Title

    Single Channel Speech and Background Segregation Through Harmonic-Temporal Clustering

  • Author

    Le Roux, Jonathan ; Kameoka, Hirokazu ; Ono, Nobutaka ; de Cheveigne, Alain ; Sagayama, Shigeki

  • Author_Institution
    Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan; CNRS, Université Paris 5, and Ecole Normale Supérieure, Paris, France. leroux@hil.t.u-tokyo.ac.jp
  • fYear
    2007
  • fDate
    21-24 Oct. 2007
  • Firstpage
    279
  • Lastpage
    282
  • Abstract
    The design of effective algorithms for single-channel analysis of complex and varied acoustical scenes is a very important and challenging problem. We present here the application of the recently introduced Harmonic-Temporal Clustering (HTC) framework to single channel speech enhancement, background retrieval and speaker separation. HTC processing relies on a precise parametric description of the voiced parts of speech derived from the power spectrum. We explain the positioning of the algorithm inside the Computational Acoustic Scene Analysis (CASA) area, describe the theoretical background of the method, show through preliminary experiments its basic feasibility, and discuss potential improvements.
  • Keywords
    Acoustic applications; Algorithm design and analysis; Auditory system; Clustering algorithms; Layout; Loudspeakers; Signal processing algorithms; Speech analysis; Speech enhancement; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics, 2007 IEEE Workshop on
  • Conference_Location
    New Paltz, NY, USA
  • Print_ISBN
    978-1-4244-1620-2
  • Electronic_ISBN
    978-1-4244-1619-6
  • Type

    conf

  • DOI
    10.1109/ASPAA.2007.4393003
  • Filename
    4393003