Publication details

Finding Plagiarism by Evaluating Document Similarities

Authors

KASPRZAK Jan BRANDEJS Michal KŘIPAČ Miroslav

Year of publication 2009
Type Article in Proceedings
Conference Proceedings of the SEPLN'09 Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse
MU Faculty or unit

Faculty of Informatics

Citation
Web
Field Informatics
Keywords Plagiarism Similar Documents Document Overlap Distributed Computing Parallelism
Description In this paper we discuss the approach we have used for finding plagiarized passages of text during the PAN'09 plagiarism detection competition. We describe the existing anti-plagiarism system we use in the Czech National Archive of Graduate Theses. We then discuss the modifications to this system which have been necessary in order to fit the results to the competition rules. We also present a performance data of the described system, and the possible improvement for our production systems, which result from the code written for the PAN'09 competition.
Related projects: