Agarwal, S ; Agrawa, R ; Deshpande, P.M ; Gupta, A ; Naughton, J.F ; Ramakrishnan, R ; Sarawagi, S (1996) On the computation of multidimensional aggregates 22nd Int'l Conference on Very Large Databases . pp. 506-521.
Full text not available from this repository.
Abstract
At the heart of all OLAP or multidimensional data analysis applications is the ability to simultaneously aggregate across many sets of dimensions. Computing multidimensional aggregates is a performance bottleneck for these applications. This chapter presents fast algorithms for computing a collection of group-bys. We focus on a special case of the aggregation problem — computation of the CUBE operator. The CUBE operator requires computing group-bys on all possible combinations of a list of attributes, and is equivalent to the union of a number of standard group-by operations. We show how the structure of CUBE computation can be viewed in terms of a hierarchy of group-by operations. Our algorithms extend sort-based and hash-based grouping methods with several optimizations, like combining common operations across multiple group-bys, caching, and using pre-computed group-bys for computing other group-bys. Empirical evaluation shows that the resulting algorithms give much better performance compared to straightforward methods. This chapter combines work done concurrently on computing the data cube by two different teams as reported in [SAG96] and [AAD+96b].
Item Type: | Article |
---|---|
Source: | Copyright of this article belongs to ResearchGate GmbH |
ID Code: | 128437 |
Deposited On: | 20 Oct 2022 10:40 |
Last Modified: | 14 Nov 2022 11:59 |
Repository Staff Only: item control page