DocumentCode
2701449
Title
Development of a Femininity Estimator using Speaker Recognition Techniques for Voice Therapy of Gender Identity Disorder Clients
Author
Minematsu, Nobuaki ; Maruyama, Kazunori ; Sakuraba, Kyoko ; Hirose, Keikichi ; Tayama, N. ; Imaizumi, Shoko ; Yamauch, T.
Author_Institution
Tokyo Univ., Japan
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
This paper describes the development of an estimator of perceptual femininity (PF) of an input utterance using speaker recognition techniques. The estimator was designed for its clinical use and the target speakers are gender identity disorder (GID) clients, especially MtF (male to female) transsexuals. The voice therapy for MtFs is composed of three kinds of training; 1) raising the baseline F0 range, 2) changing the baseline voice quality, and 3) enhancing fo dynamics to produce an exaggerated intonation pattern. The first two focus on static acoustic properties of speech and the voice quality is mainly controlled by size and shape of the articulators, which can be acoustically characterized by the spectral envelope. Gaussian mixture models (GMM) of fo values and spectrums were built separately for biologically male speakers and female ones. Using the four models, PF was estimated automatically for each of 142 utterances of 111 MtFs. The estimated values were compared with the PF values obtained through listening tests. Results showed very high correlation (R=0.86), which is comparable to the intra-rater correlation.
Keywords
Gaussian processes; speaker recognition; speech processing; Gaussian mixture models; exaggerated intonation pattern; femininity estimator; gender identity disorder; gender identity disorder clients; input utterance; intra-rater correlation; male to female transsexuals; perceptual femininity; speaker recognition techniques; spectral envelope; speech static acoustic properties; voice therapy; Automatic control; Biochemistry; Biological system modeling; Medical treatment; Shape control; Size control; Speaker recognition; Speech; Surgery; Testing; Femininity; GID; GMM; speaker recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.366908
Filename
4218096
Link To Document