DocumentCode
2447331
Title
A configurable-hardware document-similarity classifier to detect web attacks
Author
Ulmer, Craig ; Gokhale, Maya
Author_Institution
Sandia Nat. Labs., Livermore, CA, USA
fYear
2010
fDate
19-23 April 2010
Firstpage
1
Lastpage
8
Abstract
This paper describes our approach to adapting a text document similarity classifier based on the Term Frequency Inverse Document Frequency (TFIDF) metric to reconfigurable hardware. The TFIDF classifier is used to detect web attacks in HTTP data. In our reconfigurable hardware approach, we design a streaming, real-time classifier by simplifying an existing sequential algorithm and manipulating the classifier´s model to allow decision information to be represented compactly. We have developed a set of software tools to help automate the process of converting training data to synthesizable hardware and to provide a means of trading off between accuracy and resource utilization. The Xilinx Virtex 5-LX implementation requires two orders of magnitude less memory than the original algorithm. At 166MB/s (80X the software) the hardware implementation is able to achieve Gigabit network throughput at the same accuracy as the original algorithm.
Keywords
Internet; field programmable gate arrays; pattern classification; security of data; text analysis; TFIDF metric; Web attack detection; Xilinx Virtex 5-LX; configurable-hardware document; document similarity classifier; reconfigurable hardware; term frequency inverse document frequency; Algorithm design and analysis; Frequency; Hardware; Laboratories; Network synthesis; Resource management; Software tools; Throughput; Training data; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
Conference_Location
Atlanta, GA
Print_ISBN
978-1-4244-6533-0
Type
conf
DOI
10.1109/IPDPSW.2010.5470737
Filename
5470737
Link To Document