Detecting Trojans Using Data Mining Techniques

Muazzam Siddiqui; Morgan C. Wang; Joohan Lee

doi:10.1007/978-3-540-89853-5_43

Back

Detecting Trojans Using Data Mining Techniques

Conference proceeding

Peer reviewed

Detecting Trojans Using Data Mining Techniques

Muazzam Siddiqui, Morgan C. Wang and Joohan Lee

WIRELESS NETWORKS, INFORMATION PROCESSING AND SYSTEMS, Vol.20, pp.400-411

Communications in Computer and Information Science

01/01/2008

DOI: https://doi.org/10.1007/978-3-540-89853-5_43

Abstract

Computer Science

Computer Science, Hardware & Architecture

Computer Science, Information Systems

Science & Technology

Technology

Telecommunications

A trojan horse is a program that surreptitiously performs its operation under the guise of a legitimate program. Traditional approaches using signatures to detect these programs pose little danger to new and unseen samples whose signatures are not available. The focus of malware research is shifting from using signature patterns to identifying the malicious behavior displayed by these malwares. This paper presents the novel idea of extracting variable length instruction sequences that can identify trojans from clean programs using data mining techniques. The analysis is facilitated by the program control flow information contained in the instruction sequences. Based on general statistics gathered from these instruction sequences, we formulated the problem as a binary classification problem and built random forest, bagging and support vector machine classifiers. Our approach showed a 94.0% detection rate on novel trojans whose data was not used in the model building process.

Metrics

1 Record Views

Details

Title: Detecting Trojans Using Data Mining Techniques
Creators - without role: Muazzam Siddiqui - University of Central Florida
Morgan C. Wang - University of Central Florida
Joohan Lee - Univ Cent Florida, Orlando, FL 32816 USA
Contributors - without role: DMA Hussain
AQK Rajput
B S Chowdhry
Q Gee
Publication Details: WIRELESS NETWORKS, INFORMATION PROCESSING AND SYSTEMS, Vol.20, pp.400-411
Series: Communications in Computer and Information Science
Publisher: Springer Nature
Number of pages: 12
Identifiers: 9935954208331
Academic Unit: King Abdulaziz University
Language: English
Resource Type: Conference proceeding