Title
A Proposed Approach to Compound File Fragment Identification.
Abstract
One of the biggest challenges in file fragment classification is the low classification rate of compound files known as high entropy files that contain different types of data, such as images and compressed text. It is seen that current methods for file fragment classification may not work for classifying these compound files. In this paper we propose a novel approach based on detecting deflate-encoded data in compound file fragments then decompress that data before applying a machine learning technique for classification. We apply our proposed method to classify Adobe portable document format (PDF) file type. Experiments showed high classification rate for the proposed method.
Year
DOI
Venue
2014
10.1007/978-3-319-11698-3_38
Lecture Notes in Computer Science
Keywords
Field
DocType
Digital forensics,file type classification,compound file fragment classification
File format,Network forensics,Information retrieval,Digital forensics,Computer science,Computer network,Data type,Portable document format,Classification rate,Entropy (information theory),Database
Conference
Volume
ISSN
Citations 
8792
0302-9743
1
PageRank 
References 
Authors
0.36
10
4
Name
Order
Citations
PageRank
Khoa Nguyen115913.09
Dat Tran245478.64
Wanli Ma327032.72
Dharmendra Sharma424058.91