مرکز منطقه ای اطلاع رساني علوم و فناوري - Research on truncated speech in speaker verification

DocumentCode :

134278

Title :

Research on truncated speech in speaker verification

Author :

Fanhu Bie ; Dong Wang ; Zheng, Thomas Fang

Author_Institution :

Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China

fYear :

2014

fDate :

12-14 Sept. 2014

Firstpage :

425

Lastpage :

425

Abstract :

Summary form only given. The speech truncating phenomenon is a general problem is practical speaker recognition system. After the speech was truncated by amplitude, the spectral was changed during the process, resulting in the decreasing in the system`s performance. The paper describes the observation and the conclusion on the impact of the truncated segments, studies the reason of the impact on the recognition performance, gives out the ways of the truncated segments detection and reducing the decreasing of the performance. The simulation on NIST SRE08 shows that, just when the amplitude truncating ratio remains high (more than the 80% of the maximum amplitude), the performance drops sharply; the performance of traditional GMM-UBM system and I-vector system behavior familiar when the amplitude truncating is low, while I-vector gives a better robustness when is high. The paper gives out a proposal on truncating segments detection based on subspace discriminant information, which is then used to discard the truncating segments. The experiments show that this proposal could well detect the truncated segments. However, the results show that there are still speaker discriminant information in the truncated segments, when the amplitude truncated ratio remains low, it´s better to remain the data to sustain the performance, otherwise, the speaker should take another recording to keep the system performance.

Keywords :

speaker recognition; GMM-UBM system; I-vector system behavior; amplitude truncated ratio; amplitude truncating ratio; speaker discriminant information; speaker recognition system; speaker verification; speech truncating phenomenon; subspace discriminant information; truncated segments detection; truncated speech; truncating segments detection; Educational institutions; Laboratories; Proposals; Speaker recognition; Speech; Speech recognition; Technological innovation; i-vector; speaker recognition; truncated speech;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on

Conference_Location :

Singapore

Type :

conf

DOI :

10.1109/ISCSLP.2014.6936671

Filename :

6936671

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=134278