Improving performance of a plagiarism detection system
Andrzej Sobecki, Marcin Kępa IKC 2017
plagiarism detection system Andrzej Sobecki, Marcin Kpa IKC 2017 - - PowerPoint PPT Presentation
Improving performance of a plagiarism detection system Andrzej Sobecki, Marcin Kpa IKC 2017 Plagiarism detection problem Text documents unstructured form Finding a potential source documents based on the suspected document
Andrzej Sobecki, Marcin Kępa IKC 2017
document
Parsing Hashing Filtering Calculating similarities
The crucial stage for performance The important stage for accuracy
Doc Doc profile Hash function h(x) One hash – One sentence Suspected doc Suspected doc profile Repository Repositories Doc profile Doc profile Doc profile Available documents profiles Count identical hash values
accuracy of the plagiarism detection process?
repositories,