DocumentCode :
134278
Title :
Research on truncated speech in speaker verification
Author :
Fanhu Bie ; Dong Wang ; Zheng, Thomas Fang
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear :
2014
fDate :
12-14 Sept. 2014
Firstpage :
425
Lastpage :
425
Abstract :
Summary form only given. The speech truncating phenomenon is a general problem is practical speaker recognition system. After the speech was truncated by amplitude, the spectral was changed during the process, resulting in the decreasing in the system`s performance. The paper describes the observation and the conclusion on the impact of the truncated segments, studies the reason of the impact on the recognition performance, gives out the ways of the truncated segments detection and reducing the decreasing of the performance. The simulation on NIST SRE08 shows that, just when the amplitude truncating ratio remains high (more than the 80% of the maximum amplitude), the performance drops sharply; the performance of traditional GMM-UBM system and I-vector system behavior familiar when the amplitude truncating is low, while I-vector gives a better robustness when is high. The paper gives out a proposal on truncating segments detection based on subspace discriminant information, which is then used to discard the truncating segments. The experiments show that this proposal could well detect the truncated segments. However, the results show that there are still speaker discriminant information in the truncated segments, when the amplitude truncated ratio remains low, it´s better to remain the data to sustain the performance, otherwise, the speaker should take another recording to keep the system performance.
Keywords :
speaker recognition; GMM-UBM system; I-vector system behavior; amplitude truncated ratio; amplitude truncating ratio; speaker discriminant information; speaker recognition system; speaker verification; speech truncating phenomenon; subspace discriminant information; truncated segments detection; truncated speech; truncating segments detection; Educational institutions; Laboratories; Proposals; Speaker recognition; Speech; Speech recognition; Technological innovation; i-vector; speaker recognition; truncated speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
Type :
conf
DOI :
10.1109/ISCSLP.2014.6936671
Filename :
6936671
Link To Document :
بازگشت