Title :
Universal outlier hypothesis testing: Application to anomaly detection
Author :
Yun Li ; Nitinawarat, Sirin ; Yu Su ; Veeravalli, Venugopal V.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Champaign, IL, USA
Abstract :
In outlier hypothesis testing, multiple observation sequences are collected, a small subset of which are outliers. Observations in an outlier sequence are generated by a mechanism different from that generating the observations in the majority of sequences. The goal is to best discern all the outlier sequences without any knowledge of the underlying generating mechanisms. A generalized likelihood test is considered in the fixed sample size setting. In the sequential setting, a test based on the Multihypothesis Sequential Probability Ratio Test and the repeated significance test is considered. The sequential test outperforms the generalized likelihood test when the lengths of the observation sequences exceed certain values. Applied to a real data set for spam detection, the performance of the proposed tests is shown to be superior to those based on the maximum mean discrepancy for large sample size.
Keywords :
maximum likelihood detection; probability; unsolicited e-mail; anomaly detection; fixed sample size setting; generalized likelihood test; maximum mean discrepancy; multihypothesis sequential probability ratio test; multiple observation sequences; outlier sequence; repeated significance test; sequential setting; spam detection; universal outlier hypothesis testing; Accuracy; Electronic mail; Error probability; IP networks; Manganese; Quantization (signal); Testing; anomaly detection; generalized likelihood test; maximum mean discrepancy; multihypothesis sequential probability ratio test; universal outlier hypothesis testing;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7179042