DocumentCode
3631367
Title
From acoustics to Vocal Tract time functions
Author
Vikramjit Mitra;Yucel Ozbek;Hosung Nam;Xinhui Zhou;Carol Y. Espy-Wilson
Author_Institution
Department of Electrical and Computer Engineering, University of Maryland, College Park, USA
fYear
2009
Firstpage
4497
Lastpage
4500
Abstract
In this paper we present a technique for obtaining Vocal Tract (VT) time functions from the acoustic speech signal. Knowledge-based Acoustic Parameters (APs) are extracted from the speech signal and a pertinent subset is used to obtain the mapping between them and the VT time functions. Eight different vocal tract constriction variables consisting of five constriction degree variables, lip aperture (LA), tongue body (TBCD), tongue tip (TTCD), velum (VEL), and glottis (GLO); and three constriction location variables, lip protrusion (LP), tongue tip (TTCL), tongue body (TBCL) were considered in this study. The TAsk Dynamics Application model (TADA [1]) is used to create a synthetic speech dataset along with its corresponding VT time functions. We explore Support Vector Regression (SVR) followed by Kalman smoothing to achieve mapping between the APs and the VT time functions.
Keywords
"Acoustics","Speech recognition","Tongue","Automatic speech recognition","Speech synthesis","Stress","Natural languages","Databases","Noise measurement","Pollution measurement"
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
2379-190X
Type
conf
DOI
10.1109/ICASSP.2009.4960629
Filename
4960629
Link To Document