Clustering XML Documents Considering The Weight of Large Items in Clusters


The KIPS Transactions:PartD, Vol. 14, No. 1, pp. 1-8, Feb. 2007
10.3745/KIPSTD.2007.14.1.1,   PDF Download:

Abstract

As the web document of XML, an exchange language of data in the advanced Internet, is increasing, a target of information retrieval becomes the web documents. Therefore, there are researches on structure, integration and retrieval of XML documents. This paper proposes a clustering method of XML documents based on frequent structures, as a basic research to efficiently process query and retrieval. To do so, first, trees representing XML documents are decomposed and we extract frequent structures from them. Second, we perform clustering considering the weight of large items to adjust cluster creation and cluster cohesion, considering frequent structures as items of transactions. Third, we show the excellence of our method through some experiments which compare with the previous methods.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
J. H. Hwang, "Clustering XML Documents Considering The Weight of Large Items in Clusters," The KIPS Transactions:PartD, vol. 14, no. 1, pp. 1-8, 2007. DOI: 10.3745/KIPSTD.2007.14.1.1.

[ACM Style]
Jeong Hee Hwang. 2007. Clustering XML Documents Considering The Weight of Large Items in Clusters. The KIPS Transactions:PartD, 14, 1, (2007), 1-8. DOI: 10.3745/KIPSTD.2007.14.1.1.