DocumentCode
2178981
Title
Language-independent constrained cepstral features for speaker recognition
Author
Shriberg, Elizabeth ; Stolcke, Andreas
Author_Institution
SRI Int., Menlo Park, CA, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
5296
Lastpage
5299
Abstract
Constrained cepstral systems, which select frames to match various linguistic "constraints" in enrollment and test, have shown significant improvements for speaker verification performance. Past work, however, relied on word recognition, making the approach language dependent (LD). We develop language-independent (LI) versions of constraints and compare results to parallel LD versions for English data on the NIST 2008 interview task. Results indicate that (1) LI versions show surprisingly little degradation from associated LD versions, (2) some LI constraints outperform their LD counterparts, (3) useful constraint types include phonetic, syllable position, prosodic, and speaking-rate regions, (4) benefits generally hold for different train/test lengths, and (5) constraints provide particular benefit in reducing false alarms. Overall, we conclude that constrained cepstral modeling can benefit speaker recognition without the need for language-dependent automatic speech recognition.
Keywords
speaker recognition; NIST; language-dependent automatic speech recognition; language-independent constrained cepstral features; speaker verification; Mel frequency cepstral coefficient; NIST; Pragmatics; Speaker recognition; Speech; Training; cepstral constraints; language-independent phone recognition; speaker verification;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947553
Filename
5947553
Link To Document