Title :
IRILD: An Information Retrieval Based Method for Information Leak Detection
Author :
Gessiou, Eleni ; Vu, Quang Hieu ; Ioannidis, Sotiris
Author_Institution :
Inst. of Comput. Sci., FORTH, Heraklion, Greece
Abstract :
The traditional approach for detecting information leaks is to generate fingerprints of sensitive data, by partitioning and hashing it, and then comparing these fingerprints against outgoing documents. Unfortunately, this approach incurs a high computation cost as every part of document needs to be checked. As a result, it is not applicable to systems with a large number of documents that need to be protected. Additionally, the approach is prone to false positives if the fingerprints are common phrases. In this paper, we propose an improvement for this approach to offer a much faster processing time with less false positives. The core idea of our solution is to eliminate common phrases and non-sensitive phrases from the fingerprinting process. Non-sensitive phrases are identified by looking at available public documents of the organization that we want to protect from information leaks and common phrases are identified with the help of a search engine. In this way, our solution both accelerates leak detection and increases the accuracy of the result. Experiments were conducted on real-world data to prove the efficiency and effectiveness of the proposed solution.
Keywords :
data privacy; document handling; fingerprint identification; information retrieval; search engines; IRILD; common phrases elimination; fingerprint generation; fingerprinting process; high computation cost; information leak detection; information retrieval-based method; nonsensitive phrases elimination; outgoing documents; processing time; public documents; real-world data; search engine; searching; sensitive data; Accuracy; Electronic mail; Google; Leak detection; Organizations; Search engines; fingerprinting; information leaks; privacy;
Conference_Titel :
Computer Network Defense (EC2ND), 2011 Seventh European Conference on
Conference_Location :
Gothenburg
Print_ISBN :
978-1-4673-2116-7
DOI :
10.1109/EC2ND.2011.21