An Indexing System for Retrieving Similar Paths in XML Documents


The KIPS Transactions:PartD, Vol. 15, No. 2, pp. 171-178, Apr. 2008
10.3745/KIPSTD.2008.15.2.171,   PDF Download:

Abstract

Since the XML standard was introduced by the W3C in 1998, documents that have been written in XML have been gradually increasing. Accordingly, several systems have been developed in order to efficiently manage and retrieve massive XML documents. BitCube?a bitmap indexing system? is a representative system for this field of research. Based on the bitmap indexing technique, the path bitmap indexing system(LH06), which performs the clustering of similar paths, improved the problem that the existing BitCube system could not solve, namely, determining similar paths. The path bitmap indexing system has the advantage of a higher retrieval speed in not only exactly matched path searching but also similar path searching. However, the similarity calculation algorithm of this system has a few particular problems. Consequently, it sometimes cannot calculate the similarity even though some of two paths have extremely similar relationships; further, it results in an increment in the number of meaningless clusters. In this paper, we have proposed a novel method that calculates the similarity between the paths in order to solve these problems. The proposed system yields a stable result for clustering, and it obtains a high score in clustering precision during a performance evaluation against LH06.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
B. S. Lee and B. Y. Hwang, "An Indexing System for Retrieving Similar Paths in XML Documents," The KIPS Transactions:PartD, vol. 15, no. 2, pp. 171-178, 2008. DOI: 10.3745/KIPSTD.2008.15.2.171.

[ACM Style]
Bum Suk Lee and Byung Yeon Hwang. 2008. An Indexing System for Retrieving Similar Paths in XML Documents. The KIPS Transactions:PartD, 15, 2, (2008), 171-178. DOI: 10.3745/KIPSTD.2008.15.2.171.