مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech transformations based on a sinusoidal representation

DocumentCode :

2994845

Title :

Speech transformations based on a sinusoidal representation

Author :

Quatieri, Thomas E. ; Mcaulay, R.J.

Author_Institution :

Massachusetts Institute of Technology, Lexington, Massachusetts

Volume :

fYear :

1985

fDate :

31138

Firstpage :

489

Lastpage :

492

Abstract :

This paper presents a new speech analysis/synthesis technique based on a sinusoidal representation of the speech production mechanism but which is independent of pitch and the voiced/unvoiced speech state. The resulting synthetic speech preserves the waveform shape and is essentially perceptually indistinguishable from the original. The method provides the basis for a general class of speech transformations and is successfully applied to time-scale modification, frequency scaling, and scaling of pitch. Furthermore, these modifications can be performed with a time-varying rate of change, allowing, for example, continuous adjustment of a speaker´s fundamental frequency and rate of articulation. Although the analysis/synthesis system was originally designed for single-speaker signals, it is equally capable of recovering and modifying nonspeech signals such as music, multi-speakers, marine biologic sounds, and speech in the presence of interferences such as noise and musical backgrounds.

Keywords :

Frequency; Multiple signal classification; Music; Shape; Signal analysis; Signal design; Signal synthesis; Speech analysis; Speech enhancement; Speech synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '85.

Type :

conf

DOI :

10.1109/ICASSP.1985.1168380

Filename :

1168380

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2994845