Improving Naive Bayes Text Classifiers with Incremental Feature Weighting


The KIPS Transactions:PartB , Vol. 15, No. 5, pp. 457-464, Oct. 2008
10.3745/KIPSTB.2008.15.5.457,   PDF Download:

Abstract

In the real-world operational environment, most of text classification systems have the problems of insufficient training documents and no prior knowledge of feature space. In this regard, Naive Bayes is known to be an appropriate algorithm of operational text classification since the classification model can be evolved easily by incrementally updating its pre-learned classification model and feature space. This paper proposes the improving technique of Naive Bayes classifier through feature weighting strategy. The basic idea is that parameter estimation of Naive Bayes considers the degree of feature importance as well as feature distribution. We can develop a more accurate classification model by incorporating feature weights into Naive Bayes learning algorithm, not performing a learning process with a reduced feature set. In addition, we have extended a conventional feature update algorithm for incremental feature weighting in a dynamic operational environment. To evaluate the proposed method, we perform the experiments using the various document collections, and show that the traditional Naive Bayes classifier can be significantly improved by the proposed technique.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
H. J. Kim and J. Y. Chang, "Improving Naive Bayes Text Classifiers with Incremental Feature Weighting," The KIPS Transactions:PartB , vol. 15, no. 5, pp. 457-464, 2008. DOI: 10.3745/KIPSTB.2008.15.5.457.

[ACM Style]
Han Joon Kim and Jae Young Chang. 2008. Improving Naive Bayes Text Classifiers with Incremental Feature Weighting. The KIPS Transactions:PartB , 15, 5, (2008), 457-464. DOI: 10.3745/KIPSTB.2008.15.5.457.