مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech transformations based on a sinusoidal representation

DocumentCode :

1110009

Title :

Speech transformations based on a sinusoidal representation

Author :

Quatieri, Thomas F. ; McAulay, Robert J.

Author_Institution :

Massachusetts Institute of Technology, Lexington, MA

Volume :

Issue :

fYear :

1986

fDate :

12/1/1986 12:00:00 AM

Firstpage :

1449

Lastpage :

1464

Abstract :

In this paper a new speech analysis/synthesis technique is presented which provides the basis for a general class of speech transformations including time-scale modification, frequency scaling, and pitch modification. These modifications can be performed with a time-varying change, permitting continuous adjustment of a speaker´s fundamental frequency and rate of articulation. The method is based on a sinusoidal representation of the speech production mechanism which has been shown to produce synthetic speech that preserves the wave-form shape and is perceptually indistinguishable from the original. Although the analysis/synthesis system was originally designed for single-speaker signals, it is also capable of recovering and modifying nonspeech signals such as music, multiple speakers, marine biologic sounds, and speakers in the presence of interferences such as noise and musical backgrounds.

Keywords :

Frequency synthesizers; Loudspeakers; Multiple signal classification; Music; Shape; Signal analysis; Signal design; Signal synthesis; Speech analysis; Speech synthesis;

fLanguage :

English

Journal_Title :

Acoustics, Speech and Signal Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

0096-3518

Type :

jour

DOI :

10.1109/TASSP.1986.1164985

Filename :

1164985

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1110009