Document Plagiarism Detection Using a New Concept Similarity in Formal Concept Analysis

This paper proposes an algorithm for document plagiarism detection using the provided incremental knowledge construction with formal concept analysis (FCA). The incremental knowledge construction is presented to support document matching between the source document in storage and the suspect documen...

Full description

Saved in:
Bibliographic Details
Main Authors: Jirapond Muangprathub, Siriwan Kajornkasirat, Apirat Wanichsombat
Format: Article
Language:English
Published: Wiley 2021-01-01
Series:Journal of Applied Mathematics
Online Access:http://dx.doi.org/10.1155/2021/6662984
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper proposes an algorithm for document plagiarism detection using the provided incremental knowledge construction with formal concept analysis (FCA). The incremental knowledge construction is presented to support document matching between the source document in storage and the suspect document. Thus, a new concept similarity measure is also proposed for retrieving formal concepts in the knowledge construction. The presented concept similarity employs appearance frequencies in the obtained knowledge construction. Our approach can be applied to retrieve relevant information because the obtained structure uses FCA in concept form that is definable by a conjunction of properties. This measure is mathematically proven to be a formal similarity metric. The performance of the proposed similarity measure is demonstrated in document plagiarism detection. Moreover, this paper provides an algorithm to build the information structure for document plagiarism detection. Thai text test collections are used for performance evaluation of the implemented web application.
ISSN:1110-757X
1687-0042