DocumentCode
3652271
Title
Efficient representation of short-time phase based on group delay
Author
H. Banno; Jinlin Lu;S. Nakamura;K. Shikano;H. Kawahara
Author_Institution
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Japan
Volume
2
fYear
1998
Firstpage
861
Abstract
An efficient representation of short-time phase characteristics of speech sounds is proposed, based on findings which suggest the perceptual importance of phase characteristics. Subjective tests indicated that the synthesized speech sounds by the proposed method are indistinguishable from the original speech sounds with a moderate data compression. The proposed representation uses lower-order coefficients of the inverse Fourier transform of the group delay of speech. It also alleviates the voiced/unvoiced decision, which is an indispensable part in conventional speech coding algorithms. These features make our method potentially very useful in many applications like speech morphing.
Keywords
"Speech synthesis","Speech coding","Linear predictive coding","Speech analysis","Delay estimation","Bit rate","Delay effects","Information science","Acoustical engineering","Systems engineering and theory"
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.675401
Filename
675401
Link To Document