• DocumentCode
    2045385
  • Title

    Unsupervised distributional anomaly detection for a self-diagnostic speech activity detector

  • Author

    Borges, Nash ; Meyer, Gerard G L

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD
  • fYear
    2008
  • fDate
    19-21 March 2008
  • Firstpage
    950
  • Lastpage
    955
  • Abstract
    One feature that classification algorithms typically lack is the ability to know what they do not know. With this knowledge an algorithm would be able to operate in any domain and only produce results when it is confident that data is within nominal conditions. Otherwise, it could generate warning messages or request more appropriate training material. We present an unsupervised approach capable of working in concert with an existing classifier to detect off-nominal conditions by estimating the divergence between the distribution of input features and a nominal world model. Using a measure of parametric divergence for a mixture of Gaussians and two different estimates for the Kullback-Leibler divergence, we significantly outperform the baseline average log probability thresholding to distinguish nominal conversational audio from a variety of structured noises and incorrectly decoded audio using features from a speech activity detector.
  • Keywords
    Gaussian processes; speech processing; Kullback-Leibler divergence; baseline average log probability thresholding; classification algorithms; nominal conversational audio; self-diagnostic speech activity detector; unsupervised distributional anomaly detection; Classification algorithms; Clustering algorithms; Detectors; Intrusion detection; Natural languages; Noise measurement; Robustness; Speech processing; Training data; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Sciences and Systems, 2008. CISS 2008. 42nd Annual Conference on
  • Conference_Location
    Princeton, NJ
  • Print_ISBN
    978-1-4244-2246-3
  • Electronic_ISBN
    978-1-4244-2247-0
  • Type

    conf

  • DOI
    10.1109/CISS.2008.4558655
  • Filename
    4558655