Title :
A warped time-frequency expansion for speech signal representation
Author :
Silsbee, Peter L. ; Zahorian, Stephen A. ; Nossair, Zaki B.
Author_Institution :
Dept. of Electr. & Comput. Eng., Old Dominion Univ., Norfolk, VA, USA
Abstract :
A novel representation for speech signals is proposed. The time-varying frequency content of a speech segment is represented as a weighted sum of two-dimensional basis vectors; these incorporate both frequency warping and frequency-dependent time warping. This is quite flexible; for example, any arbitrary time or frequency warping function can easily be implemented, and any time-frequency representation can be used as the starting point. Examples are presented which demonstrate desirable characteristics of the representation: (1) explicit quantification of parameter trajectories, (2) time resolution which varies with respect to time and frequency, and (3) the ability to reconstruct a time-frequency plot which reflects the resolution characteristics of the representation
Keywords :
speech processing; time-frequency analysis; explicit quantification; frequency warping; frequency-dependent time warping; parameter trajectories; reconstruction; speech segment; speech signal representation; time resolution; time-frequency plot; time-varying frequency content; two-dimensional basis vectors; warped time-frequency expansion; weighted sum; Cepstral analysis; Finite impulse response filter; Kernel; Neural networks; Recurrent neural networks; Signal representations; Signal resolution; Speech analysis; Speech recognition; Time frequency analysis;
Conference_Titel :
Time-Frequency and Time-Scale Analysis, 1994., Proceedings of the IEEE-SP International Symposium on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-2127-8
DOI :
10.1109/TFSA.1994.467271