Title :
Validation Algorithms Based on Content Characters and Internal Structure: The PDF File Carving Method
Author :
Chen, Mo ; Zheng, Ning ; Xu, Ming ; Lou, Yongjian ; Wang, Xia
Author_Institution :
Coll. of Comput., Hangzhou Dianzi Univ., Hangzhou
Abstract :
This paper presents a new carving method for automatically and effectively carving PDF files from an unstructured digital forensic image. The carving method has five validation algorithms based on the content characters and the internal structure of PDF files. These validation algorithms include header/file length/maximal offset of objects/footer validation, internal structure validation, entropy difference validation, zlib/deflate decompression validation and character table validation. Effectively reassembling PDF file fragments including out-of-order fragments, exactly carving PDF files without any manual intervention, lower "false positives" are the advantage of this method. The PDF file carving method and other carving applications are illustrated over real world data using the DFRWS 2007 carving challenge dataset. The results show that this method is better than otherpsilas.
Keywords :
data compression; document handling; system recovery; PDF file carving method; character table validation; content characters; deflate decompression validation; entropy difference validation; footer validation; internal structure validation; object validation; out-of-order fragments; unstructured digital forensic image; validation algorithms; zlib decompression validation; DFRWS 2007 carving challenge dataset; PDF validation; content characters; file carving; internal structure;
Conference_Titel :
Information Science and Engineering, 2008. ISISE '08. International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-2727-4
DOI :
10.1109/ISISE.2008.209