Suitability of BlackBox Dataset for Style Analysis in Detection of Source Code Plagiarism

Olfat M. Mirza; Mike Joy; Georgina Cosma

Back

Conference proceeding

Suitability of BlackBox Dataset for Style Analysis in Detection of Source Code Plagiarism

Olfat M. Mirza, Mike Joy and Georgina Cosma

2017 SEVENTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH 2017), pp.90-94

01/01/2017

Abstract

Computer Science

Computer Science, Hardware & Architecture

Computer Science, Software Engineering

Computer Science, Theory & Methods

Science & Technology

Technology

Plagiarism is one of the most common problem that has been increasing in the field of higher education. Many research papers have highlighted the issue of plagiarism in context to its detection and source that is often obtained from the text books and online sources, there is a variety of easy ways for students to copy others' work. Coding style can be used to detect source code plagiarism because it relates to programmer personality but does not affect the logic of a program, thus offering a way to differentiate between different code authors. The immediate objective of this paper is to identify whether a data set consisting of student programming assignments is rich enough to apply coding style metrics on in order to detect similarities between code sequences, and we use the BlackBox data set as a case study.

Metrics

1 Record Views

Details

Title: Suitability of BlackBox Dataset for Style Analysis in Detection of Source Code Plagiarism
Creators - without role: Olfat M. Mirza - University of Warwick
Mike Joy - University of Warwick
Georgina Cosma - Nottingham Trent University
Contributors - without role: E Ariwa
P Pichappan
Publication Details: 2017 SEVENTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH 2017), pp.90-94
Publisher: IEEE
Number of pages: 5
Identifiers: 9931384408331
Academic Unit: Umm Al Qura University
Language: English
Resource Type: Conference proceeding