Construction Scheme of Training Data using Automated Exploring of Boundary Categories


The KIPS Transactions:PartB , Vol. 16, No. 6, pp. 479-488, Dec. 2009
10.3745/KIPSTB.2009.16.6.479,   PDF Download:

Abstract

This paper shows a reinforced construction scheme of training data for improvement of text classification by automatic search of boundary category. The documents laid on boundary area are usually misclassified as they are including multiple topics and features. which is the main factor that we focus on. In this paper, we propose an automated exploring methodology of optimal boundary category based on previous research. We consider the boundary area among target categories to new category to be required training, which are then added to the target category sementically. In experiments, we applied our method to complex documents by intentionally making errors in training process. The experimental results show that our system has high accuracy and reliability in noisy environment.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
Y. J. Choi, J. G. Jee, S. S. Park, "Construction Scheme of Training Data using Automated Exploring of Boundary Categories," The KIPS Transactions:PartB , vol. 16, no. 6, pp. 479-488, 2009. DOI: 10.3745/KIPSTB.2009.16.6.479.

[ACM Style]
Yun Jeong Choi, Jeong Gyu Jee, and Seung Soo Park. 2009. Construction Scheme of Training Data using Automated Exploring of Boundary Categories. The KIPS Transactions:PartB , 16, 6, (2009), 479-488. DOI: 10.3745/KIPSTB.2009.16.6.479.