A Genetic Algorithm for Materialized View Selection in Data Warehouses


The KIPS Transactions:PartD, Vol. 11, No. 2, pp. 325-338, Apr. 2004
10.3745/KIPSTD.2004.11.2.325,   PDF Download:

Abstract

A data warehouse stores information that is collected from multiple, heterogeneous information sources for the purpose of complex querying and analysis. Information in the warehouse is typically stored in the form of materialized views, which represent pre-computed portions of frequently asked queries. One of the most important tasks of designing a warehouse is the selection of materialized views to be maintained in the warehouse. The goal is to select a set of views so that the total query response time over all queries can be minimized while a limited amount of time for maintaining the views is given(maintenance-cost view selection problem). In this paper, we propose an efficient solution to the maintenance-cost view selection problem using a genetic algorithm for computing a near-optimal set of views. Specifically, we explore the maintenance-cost view selection problem in the context of OR view graphs. We show that our approach represents a dramatic improvement in terms of time complexity over existing search-based approaches that use heuristics. Our analysis shows that the algorithm consistently yields a solution that only has an additional 10% of query cost of over the optimal query cost while at the same time exhibits an impressive performance of only a linear increase in execution time. We have implemented a prototype version of our algorithm that is used to evaluate our approach.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
L. M. Su, "A Genetic Algorithm for Materialized View Selection in Data Warehouses," The KIPS Transactions:PartD, vol. 11, no. 2, pp. 325-338, 2004. DOI: 10.3745/KIPSTD.2004.11.2.325.

[ACM Style]
Lee Min Su. 2004. A Genetic Algorithm for Materialized View Selection in Data Warehouses. The KIPS Transactions:PartD, 11, 2, (2004), 325-338. DOI: 10.3745/KIPSTD.2004.11.2.325.