This code project is an attempt to replicate the results from paper "Improving plagiarism detection in text document using hybrid weighted similarity" doing a python implementation of it.
Its currently been developed on python 3.11. To run this project follow the setup.sh file or run the necessary commands to install requirements and download the PAN-PC-11 dataset.