Title :
A Pitch Synchronous Method for Speech Modification
Author :
Kuo, Chih-Ting ; Wang, Hsiao-Chuan
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu, China
Abstract :
The speech modification is a mechanism of changing speech characteristics and prosody for some specific applications. It is used in voice conversion, pronunciation correction, tone perception, and language learning. The most important part is the change of pitch in an utterance. Pitch extraction is an essential process for speech modification. This paper presents an efficient pitch extraction algorithm based on the normalized second standard deviation function (NSSDF) of magnitude difference. A pitch synchronous method for modifying speaking rate and pitch trajectory is proposed. The speaking rate is modified by inserting or deleting pitch periods in voiced segments. The pitch trajectory change is accomplished by modifying the pitch period of residual signal obtained from pitch synchronous linear prediction (LP) analysis and reconstructing speech signal by LP filter. A speech modification system is developed for Mandarin perception which is used to help hearing impaired students in pronunciation learning.
Keywords :
feature extraction; prediction theory; signal reconstruction; speech processing; LP filter; Mandarin perception; NSSDF; language learning; normalized second standard deviation function; pitch extraction; pitch synchronous linear prediction analysis; pronunciation correction; speech modification mechanism; speech reconstruction; tone perception; voice conversion; Auditory system; Autocorrelation; Data mining; Equations; Frequency estimation; Nonlinear filters; Signal analysis; Speech analysis; Speech processing; Trajectory;
Conference_Titel :
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2942-4
Electronic_ISBN :
978-1-4244-2943-1
DOI :
10.1109/CHINSL.2008.ECP.73