A Spatial Entropy based Decision Tree Method Considering Distribution of Spatial Data


The KIPS Transactions:PartB , Vol. 13, No. 7, pp. 643-652, Dec. 2006
10.3745/KIPSTB.2006.13.7.643,   PDF Download:

Abstract

Decision trees are mainly used for the classification and prediction in data mining. The distribution of spatial data and relationships with their neighborhoods are very important when conducting classification for spatial data mining in the real world. Spatial decision trees in previous works have been designed for reflecting spatial data characteristic by rating Euclidean distance. But it only explains the distance of objects in spatial dimension so that it is hard to represent the distribution of spatial data and their relationships. This paper proposes a decision tree based on spatial entropy that represents the distribution of spatial data with the dispersion and dissimilarity. The dispersion presents the distribution of spatial objects within the belonged class. And dissimilarity indicates the distribution and its relationship with other classes. The rate of dispersion by dissimilarity presents that how related spatial distribution and classified data with non-spatial attributes are. Our experiment evaluates accuracy and building time of a decision tree as compared to previous methods. We achieve an improvement in performance by about 18%, 11%, respectively.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
Y. K. Jang, B. S. You, D. W. Lee, S. K. Cho, H. Y. Bae, "A Spatial Entropy based Decision Tree Method Considering Distribution of Spatial Data," The KIPS Transactions:PartB , vol. 13, no. 7, pp. 643-652, 2006. DOI: 10.3745/KIPSTB.2006.13.7.643.

[ACM Style]
Youn Kyung Jang, Byeong Seob You, Dong Wook Lee, Sook Kyung Cho, and Hae Young Bae. 2006. A Spatial Entropy based Decision Tree Method Considering Distribution of Spatial Data. The KIPS Transactions:PartB , 13, 7, (2006), 643-652. DOI: 10.3745/KIPSTB.2006.13.7.643.