مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning Virtual HD Model for Bi-model Emotional Speaker Recognition

DocumentCode :

2480995

Title :

Learning Virtual HD Model for Bi-model Emotional Speaker Recognition

Author :

Huang, Ting ; Yang, Yingchun

Author_Institution :

Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China

fYear :

2010

fDate :

23-26 Aug. 2010

Firstpage :

1614

Lastpage :

1617

Abstract :

Pitch mismatch between training and testing is one of the important factors causing the performance degradation of the speaker recognition system. In this paper, we adopted the missing feature theory and specified the Unreliable Region (UR) as the parts of the utterance with high emotion induced pitch variation. To model these regions, a virtual HD (High Different from neutral, with large pitch offset) model for each target speaker was built from the virtual speech, which were converted from the neutral speech by the Pitch Transformation Algorithm (PTA). In the PTA, a polynomial transformation function was learned to model the relationship of the average pitch between the neutral and the high-pitched utterances. Compared with traditional GMM-UBM and our previous method, our new method obtained 1.88% and 0.84% identification rate (IR) increase on the MASC respectively, which are promising results.

Keywords :

emotion recognition; polynomials; speaker recognition; GMM-UBM; MASC; bimodel emotional speaker recognition; high emotion induced pitch variation; performance degradation; pitch mismatch; pitch transformation algorithm; polynomial transformation function; virtual HD model learning; High definition video; Mel frequency cepstral coefficient; Polynomials; Speaker recognition; Speech; Strontium; Training; emotional speaker recognition; missing feature theory; pitch mismatch; virtual HD model;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Pattern Recognition (ICPR), 2010 20th International Conference on

Conference_Location :

Istanbul

ISSN :

1051-4651

Print_ISBN :

978-1-4244-7542-1

Type :

conf

DOI :

10.1109/ICPR.2010.399

Filename :

5595957

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2480995