In many application contexts, like statistical databases, transaction recording systems, scientific databases, query optimizers, OLAP, and so on, data are summarized as histograms of aggregate values. When the task of reconstructing range queries on original data from aggregate data is performed, a certain estimation error cannot be avoided, due to the loss of information in compressing data. Error size strongly depends both on how histograms partition data domains and on how estimation inside each bucket is done. We propose a new type of histogram, based on an unbalanced binary-tree partition, suitable for providing quick answers to hierarchical range queries, and we use adaptive tree-indexing for better approximating frequencies inside buckets. As the results from our experiments demonstrate, our histogram behaves considerably better than state-of-the-art histograms, showing smaller errors in all considered data sets at the same storage space.

Binary-Tree Histograms with Tree Indices

FURFARO F.;SACCA', Domenico
2002-01-01

Abstract

In many application contexts, like statistical databases, transaction recording systems, scientific databases, query optimizers, OLAP, and so on, data are summarized as histograms of aggregate values. When the task of reconstructing range queries on original data from aggregate data is performed, a certain estimation error cannot be avoided, due to the loss of information in compressing data. Error size strongly depends both on how histograms partition data domains and on how estimation inside each bucket is done. We propose a new type of histogram, based on an unbalanced binary-tree partition, suitable for providing quick answers to hierarchical range queries, and we use adaptive tree-indexing for better approximating frequencies inside buckets. As the results from our experiments demonstrate, our histogram behaves considerably better than state-of-the-art histograms, showing smaller errors in all considered data sets at the same storage space.
2002
3540441263
Theoretical Computer Science; Computer Science (all)
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11770/147505
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? ND
social impact