Using Machine Learning to Predict Stealthy Watermarks in Files During Cyber Crime Investigations

Maha F. Sabir; James H. Jones; Hang Liu; Alex V. Mbaziira; Assoc Comp Machinery

doi:10.1145/3314545.3314561

Back

Conference proceeding

Using Machine Learning to Predict Stealthy Watermarks in Files During Cyber Crime Investigations

Maha F. Sabir, James H. Jones, Hang Liu, Alex V. Mbaziira and Assoc Comp Machinery

PROCEEDINGS OF THE 2019 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTE AND DATA ANALYSIS (ICCDA 2019), pp.20-25

01/01/2019

DOI: https://doi.org/10.1145/3314545.3314561

Abstract

Computer Science

Computer Science, Theory & Methods

Science & Technology

Technology

Digital evidence continues to be an integral component in cybercrime investigative and judicial processes. However, increasing volume digital content and files makes it challenging for forensic examiners to process evidence in a timely way. In this paper, we use machine learning to predict stealthy watermarks in various file types. We use a black box approach which is different from current steganographic and cryptographic methods to find patterns of candidate file locations for hidden data. The results in this paper demonstrate that it is possible to use machine learning to build singleton models of the same file type as well as hybrid models to predict stealthy watermarks in files. In our experiments, the DOCX singleton models predicted stealthy watermarks with predictive accuracies ranging from 40% to 100%. The PPTX singleton model predicted stealthy watermarks with predictive accuracies ranging from 32.5% to 100%. Similarly, the JPEG singleton model predicted stealthy watermarks with predictive accuracies ranging from 37.5% to 65%. We also generated four types of hybrid models: both HYBID3 and JPEG_PPTX models predicted stealthy watermarks with predictive accuracies ranging from 47.5% to 92.5% while HYBRID_OOXML model predicted stealthy watermarks with predictive accuracies ranging from 32.5% to 100%. In addition, JPEG_DOCX model predicted stealthy watermarks in files with predictive accuracies ranging from 47.5% to 90%.

Metrics

1 Record Views

Details

Title: Using Machine Learning to Predict Stealthy Watermarks in Files During Cyber Crime Investigations
Creators - without role: Maha F. Sabir - University of America
James H. Jones - George Mason University
Hang Liu - Pontifícia Universidade Católica de São Paulo
Alex V. Mbaziira - Marymount University
Assoc Comp Machinery
Publication Details: PROCEEDINGS OF THE 2019 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTE AND DATA ANALYSIS (ICCDA 2019), pp.20-25
Publisher: Assoc Computing Machinery
Number of pages: 6
Grant note: King Abdulaziz University (KAU) - Saudi Arabia Culture Mission (SACM)
Identifiers: 9940389608331
Academic Unit: King Abdulaziz University
Language: English
Resource Type: Conference proceeding