Single Channel Speech and Background Segregation Through Harmonic-Temporal Clustering

Author

Le Roux, Jonathan ; Kameoka, Hirokazu ; Ono, Nobutaka ; de Cheveigne, Alain ; Sagayama, Shigeki

Author_Institution

Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan; CNRS, UniversitÃ© Paris 5, and Ecole Normale SupÃ©rieure, Paris, France. leroux@hil.t.u-tokyo.ac.jp

fYear

2007

fDate

21-24 Oct. 2007

Firstpage

279

Lastpage

282

Abstract

The design of effective algorithms for single-channel analysis of complex and varied acoustical scenes is a very important and challenging problem. We present here the application of the recently introduced Harmonic-Temporal Clustering (HTC) framework to single channel speech enhancement, background retrieval and speaker separation. HTC processing relies on a precise parametric description of the voiced parts of speech derived from the power spectrum. We explain the positioning of the algorithm inside the Computational Acoustic Scene Analysis (CASA) area, describe the theoretical background of the method, show through preliminary experiments its basic feasibility, and discuss potential improvements.

Keywords

Acoustic applications; Algorithm design and analysis; Auditory system; Clustering algorithms; Layout; Loudspeakers; Signal processing algorithms; Speech analysis; Speech enhancement; Speech processing;

fLanguage

English

Publisher

ieee

Conference_Titel

Applications of Signal Processing to Audio and Acoustics, 2007 IEEE Workshop on

Conference_Location

New Paltz, NY, USA

Print_ISBN

978-1-4244-1620-2

Electronic_ISBN

978-1-4244-1619-6

Type

conf

DOI

10.1109/ASPAA.2007.4393003

Filename

4393003