DocumentCode
2403579
Title
Time-frequency processing of partials for high-quality speech synthesis
Author
Ciobanu, Amelia ; Negrescu, Cristian ; Stanomir, Dumitru ; Burileanu, Dragos
Author_Institution
Telecommun. & Inf. Technol., Univ. Politeh. of Bucharest, Bucharest, Romania
fYear
2009
fDate
18-21 June 2009
Firstpage
1
Lastpage
6
Abstract
Based on the particularities offered by the chosen signal model, we introduce a novel approach regarding the chain of actions pursued in the analysis stage of the speech signal, which succeeds the level of partial extraction. According to the harmonic plus noise model (HNM) a number of successive estimation and synthesis operations are performed. The present paper proposes a method to enhance the harmonic parameters estimation. This new algorithm proves to have good behavior offering support in selecting an appropriate subset of partials. In addition to reducing the arithmetic complexity of the harmonic synthesis (which is known to be the most resource consuming module), this optimized selection of the partials allows us to perform a specific frequency correction, which enables the possibility of a simple and coherent future pitch manipulation. The performed experiments confirmed the expected complexity reduction. Moreover, we applied the proposed algorithm for partial selection and tracking followed by a frequency aligning of the harmonic components. The reconstructed signal compared to the original speech proved to be a perceptually indistinguishable replica.
Keywords
feature extraction; parameter estimation; speech enhancement; speech synthesis; time-frequency analysis; tracking; frequency alignment; harmonic plus noise model; parameters estimation; partial extraction; speech enhancement; speech synthesis; time-frequency processing; tracking procedure; Frequency estimation; Information technology; Signal analysis; Signal processing; Signal synthesis; Speech analysis; Speech enhancement; Speech processing; Speech synthesis; Time frequency analysis; HNM model; partial tracking; speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech Technology and Human-Computer Dialogue, 2009. SpeD '09. Proceedings of the 5-th Conference on
Conference_Location
Constant
Print_ISBN
978-1-4244-4727-5
Type
conf
DOI
10.1109/SPED.2009.5156175
Filename
5156175
Link To Document