Title :
High quality voice manipulation method based on the vocal tract area function obtained from sub-band LSP of straight spectrum
Author :
Arakawa, Ayanori ; Uchimura, Yoshinori ; Banno, Hideki ; Itakura, Fumitada ; Kawahara, Hideki
Author_Institution :
Grad. Sch. of Sci. & Technol., Meijo Univ., Nagoya, Japan
Abstract :
This paper describes a high-quality manipulation method of voice quality base on the vocal tract area function (VTAF) obtained from sub-band LSP of STRAIGHT spectrum. Our research group had developed the manipulation technique of voice quality based on VTAF that can generate natural formant transition. However, it is observed that the generated sound sometimes results in degradation when the input signal has a high sampling frequency. Therefore, we develop a new method that extracts VTAF properly from such input signal. This method firstly divides the input spectral envelope represented by STRAIGHT spectrum into lower and higher frequency bands, secondly extracts the Line spectrum pair (LSP) in each frequency band after spectral flattening that is appropriate for the frequency band, thirdly concatenates a pair of the sub-band LSP, and finally obtains VTAF from PARCOR coefficients converted from the concatenated LSP. A subjective experiment proved that the proposed method is high quality enough.
Keywords :
speech synthesis; PARCOR coefficients; STRAIGHT spectrum; VTAF; high quality voice manipulation method; high sampling frequency; line spectrum pair; natural formant transition; sub-band LSP; vocal tract area function; Adaptive filters; Degradation; Frequency conversion; Human voice; Linear predictive coding; Sampling methods; Shape; Signal design; Signal generators; Speech analysis; Speech analysis; Speech synthesis; Vocal system; Vocoders;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495142