• DocumentCode
    3123911
  • Title

    Discriminant local information distance preserving projection for text-independent speaker recognition

  • Author

    Liang He ; Jia Li

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2012
  • fDate
    5-8 Dec. 2012
  • Firstpage
    349
  • Lastpage
    352
  • Abstract
    A novel method is presented based on a statistical manifold for text-independent speaker recognition. After feature extraction, speaker recognition becomes a sequence classification problem. By discarding time information, the core task is the comparison of multiple sample sets. Each set is assumed to be governed by a probability density function (PDF). We estimate the PDFs and place the estimated statistical models on a statistical manifold. Fisher information distance is applied to compute distance between adjacent PDFs. Discriminant local preserving projection is used to push adjacent PDFs which belong to different classes apart to further improve the recognition accuracy. Experiments were carried out on the NIST SRE08 tel-tel database. Our presented method gave an excellent performance.
  • Keywords
    feature extraction; probability; signal classification; speaker recognition; statistical analysis; Fisher information distance; NIST SRE08 tel-tel database; PDF; discriminant local information distance preserving projection; feature extraction; probability density function; sequence classification problem; statistical manifold; text-independent speaker recognition; Cepstral analysis; Databases; Manifolds; NIST; Probability density function; Speaker recognition; Vectors; Fisher information; discriminant local preserving projection; information geometry; text-independent speaker recognition; total variability model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
  • Conference_Location
    Kowloon
  • Print_ISBN
    978-1-4673-2506-6
  • Electronic_ISBN
    978-1-4673-2505-9
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2012.6423466
  • Filename
    6423466