• DocumentCode
    3373827
  • Title

    Ideal GMM parameters & posterior log likelihood for speaker verification

  • Author

    Bhattacharyya, S. ; Srikanthan, T. ; Krishnamurthy, Pramod

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    471
  • Lastpage
    480
  • Abstract
    Personal Identity Verification using non-alterable bio-characters is fast becoming a standard add-on layer of security for access to sensitive information. This paper presents a performance centric study of our text-independent speaker verification system using the highly parametric Gaussian Mixture Modeling [GMM] technique on the KING Speaker Verification database. The system was evaluated based on Equal Error Rate [EER] scores using different parameters - namely, the number of mixtures [M] and dimensions [D], applied to the Gaussian Mixture Model [GMM]. A new scoring method, in the quest for better performance in terms of EER is also discussed. These techniques aim to pre-determine the optimum values for M and D, and apply a scoring technique that provides optimum tradeoff between complexity and performance. Simulation results obtained confirm the implementation of speaker verification algorithms with minimal real-time adaptability requirements. This aids the development of more robust and predictive voice based authentication applications. Finally, we demonstrate that the proposed Posterior Log-Likelihood based scoring does not provide significant performance gains over log-likelihood based scoring techniques
  • Keywords
    Gaussian processes; access control; authorisation; computational complexity; speaker recognition; Gaussian mixture modeling; complexity; personal identity verification; posterior log-likelihood; real-time adaptability requirements; speaker verification; voice based authentication; Authentication; Clustering algorithms; Data security; Embedded computing; Embedded system; Feature extraction; High performance computing; Poles and towers; Robustness; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks for Signal Processing XI, 2001. Proceedings of the 2001 IEEE Signal Processing Society Workshop
  • Conference_Location
    North Falmouth, MA
  • ISSN
    1089-3555
  • Print_ISBN
    0-7803-7196-8
  • Type

    conf

  • DOI
    10.1109/NNSP.2001.943151
  • Filename
    943151