Path Materialization Revisited: An Efficient Storage Model for XML Data

Jiang, H., Lu, H., Wang, W. and Yu, J.X.

    XML is emerging as a new major standard for representing data on the world wide web. Several XML storage models have been proposed to store XML data in different database management systems. The unique feature of model-mapping- based approaches is that no DTD information is required for XML data storage. In this paper, we present a new model- mapping-based storage model, called XParent. Unlike the existing work on model-mapping-based approaches that emphasized on converting XML documents to/from database schema and translation of XML queries into SQL queries, in this pa- per, we focus ourselves on the effectiveness of storage models in terms of query processing. We study the key issues that affect query performance, namely, storage schema design (storing XML data across multiple tables) and path materialization (storing path information in databases). We show that similar but different storage models significantly affect query performance. A performance study is conducted using three data sets and query sets. The experimental results are presented.
Cite as: Jiang, H., Lu, H., Wang, W. and Yu, J.X. (2002). Path Materialization Revisited: An Efficient Storage Model for XML Data. In Proc. Thirteenth Australasian Database Conference (ADC2002), Melbourne, Australia. CRPIT, 5. Zhou, X., Ed. ACS. 85-94.
pdf (from crpit.com) pdf (local if available) BibTeX EndNote GS