XVM: A bridge between XML data and its behavior
Quanzhong Li, Michalle Y. Kim, et al.
WWW 2004
XML has been widely accepted as the de facto format for data representation and exchange. However, it is also known for the excessive information redundancy in its representation. While various compression schemes have been proposed and some of them can support query processing over compressed files, it is usually inevitable to perform partial (or full) data decompression which is expensive and in some cases may dominate the query processing time. In this paper, we propose a new XML compression scheme based on the Sequitur compression algorithm. By organizing the compression result as a set of context free grammar rules, the scheme supports efficient processing of XPath queries without decompression. The experimental results show that this scheme achieves comparable compression ratio as gzip while its query processing time is among the best of existing algorithms. Copyright 2005 ACM.
Quanzhong Li, Michalle Y. Kim, et al.
WWW 2004
Like Gao, Min Wang, et al.
SAC 2005
Andrey Balmin, Quanzhong Li, et al.
VLDB
Quanzhong Li, Michelle Y. Kim, et al.
SAC 2004