Plagiarism is on its rise especially in today's scenario where everything is operated and evaluated digitally. Students and teachers all over the world are facing this problem. The main reason we took topic is because of its effect on the learning and the integrity of student, researchers as well as teachers. The more we observe the clearer it becomes that when one starts using plagiarism he/she gets into the habit of passing someone's work as their own, this stops the learning process of the individual altogether, as that person gets rewarded for doing nothing at all. This also forces many people to switch to copy-paste habit as their hard work does not get appreciated meanwhile the ones' who cheated get more rewarded. Our motivation is to survey and understand the reasons and types of such practices and try to come up with a better solution.
Most of the available and proposed models are uni-dimensional and thus treat one aspect of data at a time that is either text or code. We are proposing a website to deal with online as well as peer-to-peer plagiarism. This website or prototype will not only be able to check for the textual similarity of articles but also code similarity (as implemented in MOSS checkers) at the same time. We are proposing to use python libraries for textual similarity detection (online as well as peer-to-peer) and implementation of available resources for code similarity detection.
The existing challenges of present detectors are:
Our proposed model can deal with the following problems from above:
The challenges we are predicting to occur in our model are:
Technologies used
Discussion