Skip to content

Code plagiarism test set containing plagiarized and obfuscated source code. Can be used to test detection tools.

License

Notifications You must be signed in to change notification settings

nordicway/SourceCode-Plagiarism-TestSets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Source Code Plagiarism Test Sets

This is a free test set for comparing plagiarism detection software. More will be added soon.

Test sets

MiniFactorial

Description: calculates the factorial of a number

Challenge: very small source code, plenty of obfuscation.

Number of programs: 21

  • Original: 1
  • Duplicates: 1
  • Type-1 clones: 2
  • Type-2 clones: 2
  • Type-3 clones: 10
  • Type-4 clones: 5

Usage

You may use these test sets for comparing plagiarism detection software or other use cases free of charge. Please see the attached license file. When using them for a paper, please cite like this:

name: Source Code Plagiarism Test Sets

year: 2014

url: https://github.com/nordicway/SourceCode-Plagiarism-TestSets

Contribution

I would love to add your own test sets here, so don't hesitate to commit them.

Process

Just create a new directory for each test set, describe it shortly in this README and send a pull request. A single test set should include one directory with the original source code plus a number of directories containing clones, with or without obfuscation. Clone types are determined using the categorization by [Roy et al.] 1

License

You may add your own test sets for source code plagiarism to this repository, provided you own the rights to publish them. All code you commit to this repository will be made available under the MIT License.

About

Code plagiarism test set containing plagiarized and obfuscated source code. Can be used to test detection tools.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published