DocumentCode
2272041
Title
Single Channel Speech and Background Segregation Through Harmonic-Temporal Clustering
Author
Le Roux, Jonathan ; Kameoka, Hirokazu ; Ono, Nobutaka ; de Cheveigne, Alain ; Sagayama, Shigeki
Author_Institution
Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan; CNRS, Université Paris 5, and Ecole Normale Supérieure, Paris, France. leroux@hil.t.u-tokyo.ac.jp
fYear
2007
fDate
21-24 Oct. 2007
Firstpage
279
Lastpage
282
Abstract
The design of effective algorithms for single-channel analysis of complex and varied acoustical scenes is a very important and challenging problem. We present here the application of the recently introduced Harmonic-Temporal Clustering (HTC) framework to single channel speech enhancement, background retrieval and speaker separation. HTC processing relies on a precise parametric description of the voiced parts of speech derived from the power spectrum. We explain the positioning of the algorithm inside the Computational Acoustic Scene Analysis (CASA) area, describe the theoretical background of the method, show through preliminary experiments its basic feasibility, and discuss potential improvements.
Keywords
Acoustic applications; Algorithm design and analysis; Auditory system; Clustering algorithms; Layout; Loudspeakers; Signal processing algorithms; Speech analysis; Speech enhancement; Speech processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications of Signal Processing to Audio and Acoustics, 2007 IEEE Workshop on
Conference_Location
New Paltz, NY, USA
Print_ISBN
978-1-4244-1620-2
Electronic_ISBN
978-1-4244-1619-6
Type
conf
DOI
10.1109/ASPAA.2007.4393003
Filename
4393003
Link To Document