Query Expansion and Term Weighting Method for Document Filtering


The KIPS Transactions:PartB , Vol. 10, No. 7, pp. 743-750, Dec. 2003
10.3745/KIPSTB.2003.10.7.743,   PDF Download:

Abstract

In this paper, we propose a query expansion and term weighting method for document filtering to increase precision of the result of Web search engines. Query expansion for document filtering uses ConceptNet, encyclopedia and documents of top 10% high similarity. Term weighting method is used for calculation of query-documents similarity. In the first step, we expand an initial query into the first expanded query using ConceptNet and encyclopedia. And then we weight the first expanded query and calculate the first expanded query-documents similarity. Next, we create the second expanded query using documents of top 10% high similarity and calculate the second expanded query-documents similarity. We combine two similarities from the first and the second step. And then we re-rank the documents according to the combined similarities and filter off non-relevant documents with the lower similarity than the threshold. Our experiments showed that our document filtering method results in a notable improvement in the retrieval effectiveness when measured using both precision-recall and F-Measure.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
S. S. Eun, K. Y. Hwan, O. H. Jeong, J. M. Gil, P. S. Gyu, L. J. Seong, S. Y. Hun, "Query Expansion and Term Weighting Method for Document Filtering," The KIPS Transactions:PartB , vol. 10, no. 7, pp. 743-750, 2003. DOI: 10.3745/KIPSTB.2003.10.7.743.

[ACM Style]
Sin Seung Eun, Kang Yu Hwan, O Hyo Jeong, Jang Myeong Gil, Park Sang Gyu, Lee Jae Seong, and Seo Yeong Hun. 2003. Query Expansion and Term Weighting Method for Document Filtering. The KIPS Transactions:PartB , 10, 7, (2003), 743-750. DOI: 10.3745/KIPSTB.2003.10.7.743.