DocumentCode
590858
Title
Exploiting speech production information for automatic speech and speaker modeling and recognition - possibilities and new opportunities
Author
Ramanarayanan, V. ; Ghosh, P.K. ; Lammert, Adam ; Narayanan, Shrikanth S.
Author_Institution
Signal Anal. & Interpretation Lab., Univ. of Southern California, Los Angeles, CA, USA
fYear
2012
fDate
3-6 Dec. 2012
Firstpage
1
Lastpage
6
Abstract
We consider the potential for incorporating direct, or inferred, speech production knowledge in speech technology development. We first review the technologies that can be used to capture speech articulation information. We discuss how meaningful (speech and speaker) representations can be derived from articulatory data thus captured and further how they can be estimated from the acoustics in the absence of these direct measurements. We present some applications that have used speech production information to further the state of the art in automatic speech and speaker recognition. We also offer an outlook on how such knowledge and applications can in turn inform scientific understanding of the human speech communication process.
Keywords
speaker recognition; automatic speech-speaker modeling; automatic speech-speaker recognition; human speech communication process; speech articulation information; speech production information; speech production knowledge; speech technology development; Acoustics; Magnetic resonance imaging; Production; Speaker recognition; Speech; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location
Hollywood, CA
Print_ISBN
978-1-4673-4863-8
Type
conf
Filename
6412005
Link To Document