Relevant Keyword Collection using Click-log

KIPS Transactions on Software and Data Engineering, Vol. 19, No. 2, pp. 149-154, Feb. 2012
10.3745/KIPSTB.2012.19.2.149, Full Text:


The aim of this paper is to collect relevant keywords from clicklog data including user``s keywords and URLs accessed using them. Our main hyphothesis is that two or more different keywords may be relevant if users access same URLs using them. Also, they should have higher relationship when the more same URLs are accessed using them. To validate our idea, we collect relevant keywords from clicklog data which is offered by a portal site. As a result, our experiment shows 89.32% precision when we define answer set to only semantically same words, and 99.03% when we define answer set to broader sense. Our approach has merits that it is independent on language and collects relevant words from real world data.

