Exploiting speech production information for automatic speech and speaker modeling and recognition - possibilities and new opportunities

Author

Ramanarayanan, V. ; Ghosh, P.K. ; Lammert, Adam ; Narayanan, Shrikanth S.

Author_Institution

Signal Anal. & Interpretation Lab., Univ. of Southern California, Los Angeles, CA, USA

fYear

2012

fDate

3-6 Dec. 2012

Firstpage

1

Lastpage

6

Abstract

We consider the potential for incorporating direct, or inferred, speech production knowledge in speech technology development. We first review the technologies that can be used to capture speech articulation information. We discuss how meaningful (speech and speaker) representations can be derived from articulatory data thus captured and further how they can be estimated from the acoustics in the absence of these direct measurements. We present some applications that have used speech production information to further the state of the art in automatic speech and speaker recognition. We also offer an outlook on how such knowledge and applications can in turn inform scientific understanding of the human speech communication process.

Keywords

speaker recognition; automatic speech-speaker modeling; automatic speech-speaker recognition; human speech communication process; speech articulation information; speech production information; speech production knowledge; speech technology development; Acoustics; Magnetic resonance imaging; Production; Speaker recognition; Speech; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific

Conference_Location

Hollywood, CA

Print_ISBN

978-1-4673-4863-8

Type

conf

Filename

6412005