• DocumentCode
    3311051
  • Title

    Speaker identification in the presence of packet losses

  • Author

    Borah, Deva K. ; DeLeon, Phillip

  • Author_Institution
    Klipsch Sch. of Electr. & Comput. Eng., New Mexico State Univ., Las Cruces, NM, USA
  • fYear
    2004
  • fDate
    1-4 Aug. 2004
  • Firstpage
    302
  • Lastpage
    306
  • Abstract
    Gaussian mixture model (GMM)-based speaker identification systems have proved remarkably accurate for large populations using reasonable lengths of high-quality test utterances. Test utterances, however, acquired from cellular telephones or over the Internet (VoIP) may have dropouts due to packet loss. In our research, we have demonstrated that for small packet sizes, these losses can result in degraded accuracy of the speaker identification system. It is shown that by training the GMM model with lossy speech packets, corresponding to the loss rate experienced by the speaker to be identified, significant performance improvement is obtained. In order to avoid the prior estimation of the packet loss rate experienced by the test subject, we propose an algorithm to identify the user based on maximizing the a posteriori probability over the GMM models of the users, trained with several packet loss rates. It is shown that the proposed algorithm provides excellent identification performance.
  • Keywords
    Gaussian distribution; Internet telephony; feature extraction; maximum likelihood estimation; speaker recognition; GMM model a posteriori probability maximization; GMM-based speaker identification systems; Gaussian mixture model; ML parameter estimation; feature extraction; lossy speech packets; packet loss dropouts; packet loss rate models; speech analysis; test utterances; Databases; Decoding; Degradation; Feature extraction; GSM; Internet; Loudspeakers; Performance loss; Speech; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing Workshop, 2004 and the 3rd IEEE Signal Processing Education Workshop. 2004 IEEE 11th
  • Print_ISBN
    0-7803-8434-2
  • Type

    conf

  • DOI
    10.1109/DSPWS.2004.1437963
  • Filename
    1437963