مرکز منطقه ای اطلاع رساني علوم و فناوري - An autoencoder neural-network based low-dimensionality approach to excitation modeling for HMM-based text-to-speech

DocumentCode :

2798666

Title :

An autoencoder neural-network based low-dimensionality approach to excitation modeling for HMM-based text-to-speech

Author :

Vishnubhotla, Srikanth ; Fernandez, Raul ; Ramabhadran, Bhuvana

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Maryland, College Park, MD, USA

fYear :

2010

fDate :

14-19 March 2010

Firstpage :

4614

Lastpage :

4617

Abstract :

HMM-TTS synthesis is a popular approach toward flexible, low-footprint, data driven systems that produce highly intelligible speech. In spite of these strengths, speech generated by these systems exhibit some degradation in quality, attributable to an inadequacy in modeling the excitation signal that drives the parametric models of the vocal tract. This paper proposes a novel method for modeling the excitation as a low-dimensional set of coefficients, based on a non-linear map learned through an autoencoder. Through analysis-and-resynthesis experiments, and a formal listening test, we show that this model produces speech of higher perceptual quality compared to conventional pulse-excited speech signals at the p <; 0.01 significance level.

Keywords :

neural nets; speech processing; HMM based text-to-speech; autoencoder neural-network; data driven systems; excitation modeling; highly intelligible speech; low-dimensionality approach; Cepstral analysis; Hidden Markov models; Matched filters; Neural networks; Runtime; Signal analysis; Signal generators; Signal synthesis; Speech analysis; Speech synthesis; Hidden Markov Models; autoencoders; excitation modeling; neural networks; speech synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location :

Dallas, TX

ISSN :

1520-6149

Print_ISBN :

978-1-4244-4295-9

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2010.5495546

Filename :

5495546

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2798666