DocumentCode
2175279
Title
Accurate parameter generation using fixed-point arithmetic for embedded HMM-based speech synthesizers
Author
Nishizawa, Nobuyuki ; Kato, Tsuneo
Author_Institution
KDDI R&D Labs. Inc., Saitama, Japan
fYear
2011
fDate
22-27 May 2011
Firstpage
4696
Lastpage
4699
Abstract
Parameter trajectory generation for HMM-based speech synthesis is practically achieved using only fixed-point arithmetic with 32-bit integers. Since processors for embedded devices often provide no hardware-based floating-point number processor, a speech synthesizer using only fixed-point arithmetic is necessary for such devices. In this study, a new method to reduce rounding errors is introduced, as well as optimizing value scaling, and the generation of Fo trajectory is discussed. The experimental results indicated that RMSE in a logarithmic scale of Fo can be reduced down to approximately 0.04 semitones (1 semitone = 1/12 octaves) by the proposed method even where a 2-bit margin was arranged to avoid calculation overflow. An extension for trajectories considering the global variance (GV) using the basic program for trajectories without consideration of GV is also introduced. The extension method reduces required iteration counts to 5 for 0.05-semitone RMSE comparable to the converged results of the conventional method.
Keywords
fixed point arithmetic; floating point arithmetic; hidden Markov models; speech synthesis; GV; RMSE; embedded HMM-based speech synthesizers; fixed-point arithmetic; global variance; hardware-based floating-point number processor; parameter trajectory generation; word length 32 bit; Computational efficiency; Equations; Hidden Markov models; Mathematical model; Speech; Speech synthesis; Trajectory; HMM-based speech synthesis; embedded devices; fixed-point arithmetic; global variance;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947403
Filename
5947403
Link To Document