Offline Plagiarism Detector

Overview

Online plagiarism detection tools usually come with a few constraints. It could be a paid-only service, the number of characters to check could be artificially limited, etc. This tool aims to fill a gap where:

  1. Plagiarism cases are usually simple copy-paste jobs of a few text phrases with minor edits,
  2. Paying for an online tool is unpalatable,
  3. The source texts that might be copied from can be put together manually by the user into a few files (i.e. the Internet is not automatically searched by the tool), or the only concern is people copying from each other, and
  4. Running a command-line tool is simple enough for the user

The tool is written in Rust, and it’s a quick-and-dirty implementation that uses either an equality check between sets of words or Levenshtein distance to detect plagiarism. It’s parallelized with Rayon to make Levenshtein distance calculations faster, specifically.

The Rust crates can be found here:

Crates.io Version Crates.io Version

and the GitHub repository here.

Interesting things

rss facebook twitter github gitlab youtube mail spotify lastfm instagram linkedin google google-plus pinterest medium vimeo stackoverflow reddit quora quora