Title :
Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech
Author :
Bush, Brian O. ; Kain, Alexander
Author_Institution :
Center for Spoken Language Understanding, Oregon Health & Sci. Univ., Portland, OR, USA
Abstract :
We present a data-driven formant model and methodology for discovering its parameters, namely phoneme targets and coarticulation functions for consonant-vowel-consonant (CVC) words from fully-automatic formant data. The model uses formant targets that are speaker dependent, but independent of speaking style and phonemic context. We used a global error measure to search for optimal formant targets for all phonemes, including classes of sounds where formants are not directly observable. Analysis of coarticulation parameters found significant differences in parameters between clear and conversational speech. Estimated formant targets were largely in agreement with acoustic-phonetic expectations. An intelligibility test validated that resynthesized CVC words using modeled formant trajectories were nearly as intelligible as resynthesized CVC words using observed formant trajectories.
Keywords :
speaker recognition; speech intelligibility; speech processing; CVC words; acoustic-phonetic expectations; clear speech; coarticulation functions; coarticulation parameters; consonant-vowel-consonant words; conversational speech; data-driven formant; formant trajectories; fully-automatic formant data; intelligibility test; optimal formant targets; phoneme formant targets; phonemic context; speaker dependent; speaking style; Abstracts; Digital TV; ISO standards; Speech; clear speech; coarticulation; formants;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639226