Implementation of a Journal`s Table of Contents Separation System based on Contents Analysis


The KIPS Transactions:PartB , Vol. 14, No. 7, pp. 481-492, Dec. 2007
10.3745/KIPSTB.2007.14.7.481,   PDF Download:

Abstract

In this paper, a method for automatic indexing of contents to reduce effort for inputting paper information and constructing index is considered. Existing document analysis methods can't analyse various table of contents of journal paper formats efficiently because they have many exceptions. In this paper, various contents formats for journals, which have different features from those for general documents, are analysed and described. The principal elements that we want to represent are titles, authors, and pages for each papers. Thus, the three principal elements are modeled according to the order of their arrangement, and their features are extracted. And a table of content recognition system of journal is implemented, based on the proposed modeling method. The accuracy of exact extraction ratio of 91.5% on title, author, and page type on 660 published papers of various journals is obtained.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
Y. B. Kwon, "Implementation of a Journal`s Table of Contents Separation System based on Contents Analysis," The KIPS Transactions:PartB , vol. 14, no. 7, pp. 481-492, 2007. DOI: 10.3745/KIPSTB.2007.14.7.481.

[ACM Style]
Young Bin Kwon. 2007. Implementation of a Journal`s Table of Contents Separation System based on Contents Analysis. The KIPS Transactions:PartB , 14, 7, (2007), 481-492. DOI: 10.3745/KIPSTB.2007.14.7.481.