DocumentCode :
730666
Title :
Estimation of the invariant and variant characteristics in speech articulation and its application to speaker identification
Author :
Prasad, Abhay ; Periyasamy, Vijitha ; Ghosh, Prasanta Kumar
Author_Institution :
Manipal Inst. of Technol., Manipal, India
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4265
Lastpage :
4269
Abstract :
Speech articulation varies across speakers for producing a speech sound due to the differences in their vocal tract morphologies, though the speech motor actions are executed in terms of relatively invariant gestures [1]. While the invariant articulatory gestures are driven by the linguistic content of the spoken utterance, the component of speech articulation that varies across speakers reflects speaker-specific and other paralinguistic information. In this work, we present a formulation to decompose the speech articulation from multiple speakers into the variant and invariant aspects when they speak the same sentence. The variant component is found to be a better representation for discriminating speakers compared to the speech articulation which includes the invariant part. Experiments with real-time magnetic resonance imaging (rtMRI) videos of speech production from multiple speakers reveal that the variant component of speech articulation yields a better frame-level speaker identification accuracy compared to the speech articulation as well as acoustic features by 29.9% and 9.4% (absolute) respectively.
Keywords :
magnetic resonance imaging; signal representation; sound reproduction; speaker recognition; discriminating speaker representation; frame-level speaker identification; invariant estimation; real-time magnetic resonance imaging; rtMRI; speaker identification; speech articulation; speech production; spoken utterance linguistic content; videos; Accuracy; Acoustics; Estimation; Linear programming; Speech; Speech processing; Subspace constraints; invariant gestures; speaker identification; speech articulation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178775
Filename :
7178775
Link To Document :
بازگشت