A Two Phases Plagiarism Detection System for the Newspaper Articles by using a Web Search and a Document Similarity Estimation


KIPS Transactions on Software and Data Engineering, Vol. 16, No. 2, pp. 181-194, Feb. 2009
10.3745/KIPSTB.2009.16.2.181, Full Text:

Abstract

With the increased interest on the document copyright, many of researches related to the document plagiarism have been done up to now. The plagiarism problem of newspaper articles has attracted much interest because the plagiarism cases of the articles having much commercial values in market are currently happened very often. Many researches related to the document plagiarism have been so hard to be applied to the newspaper articles because they have strong real-time characteristics. So to detect the plagiarism of the articles, many human detectors have to read every single thousands of articles published by hundreds of newspaper companies manually. In this paper, we firstly sorted out the articles with high possibility of being copied by utilizing OpenAPI modules supported by web search companies such as Naver and Daum. Then, we measured the document similarity between selected articles and the original article and made the system decide whether the article was plagiarized or not. In experiment, we used YonHap News articles as the original articles and we also made the system select the suspicious articles from all searched articles by Naver and Daum news search services.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
J. H. Cho, H. K. Jung and Y. S. Kim, "A Two Phases Plagiarism Detection System for the Newspaper Articles by using a Web Search and a Document Similarity Estimation," KIPS Journal B (2001 ~ 2012) , vol. 16, no. 2, pp. 181-194, 2009. DOI: 10.3745/KIPSTB.2009.16.2.181.

[ACM Style]
Jung Hyun Cho, Hyun Ki Jung, and Yu Seop Kim. 2009. A Two Phases Plagiarism Detection System for the Newspaper Articles by using a Web Search and a Document Similarity Estimation. KIPS Journal B (2001 ~ 2012) , 16, 2, (2009), 181-194. DOI: 10.3745/KIPSTB.2009.16.2.181.