Performance Improvement of Word Clustering Using Ontology

KIPS Transactions on Software and Data Engineering, Vol. 13, No. 3, pp. 337-344, Mar. 2006
10.3745/KIPSTB.2006.13.3.337, Full Text:


In this paper, we describe the design and the implementation of word clustering system using a definition of an entry word in the dictionary, called a dictionary definition. Generally word clustering needs various features like words and the performance of a system for the word clustering depends on using some kinds of features. Dictionary definition describes the meaning of an entry in detail, but words in the dictionary definition are implicative or abstractive, and then its length is not long. The word clustering using only features extracted from the dictionary definition results in a lots of small-size clusters. In order to make large-size clusters and improve the performance, we need to transform the features into more general words with keeping the original meaning of the dictionary definition as intact as possible. In this paper, we propose two methods for extending the dictionary definition using ontology. One is to extend the dictionary definition to parent words on the ontology and the other is to extend the dictionary definition to some words in fixed depth from the root of the ontology. Through our experiments, we have observed that the proposed systems outperform that without extending features, and the latter’s extending method overtakes the former’s extending method in performance. We have also observed that verbs are very useful in extending features in the case of word clustering.

Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.

Cite this article
[IEEE Style]
E. J. Park, J. H. Kim and C. Y. Ock, "Performance Improvement of Word Clustering Using Ontology," KIPS Journal B (2001 ~ 2012) , vol. 13, no. 3, pp. 337-344, 2006. DOI: 10.3745/KIPSTB.2006.13.3.337.

[ACM Style]
Eun Jin Park, Jae Hoon Kim, and Cheol Young Ock. 2006. Performance Improvement of Word Clustering Using Ontology. KIPS Journal B (2001 ~ 2012) , 13, 3, (2006), 337-344. DOI: 10.3745/KIPSTB.2006.13.3.337.