A detection method for the illegal copying of digital documents

作者:Li Xu*; Liu Guohua; Ma Huidong; Wang Lei
来源:International Journal of Innovative Computing Information and Control, 2008, 4(3): 681-688.

摘要

Easy accessibility to digital documents via the internet makes it easy for many users to share information. However, it also leads to a perplexing problem concerning intellectual property security. To address the problem, a detection method for the illegal copying of digital documents is proposed in this paper, which can automate to detect the partial or whole overlaps between electronic documents. It is a powerful tool to protect the author's intellectual property and to improve the efficiency of information retrieval. We describe the representing method of a document and define the corresponding, overlap measure. An experiment verifies the efficiency of the proposed method and compares it with the word-frequency-based method and the sentence-based method in different data sets. The experimental results show that the proposed method is superior, and it can accurately identify any matching text of a certain length.