Catálogo de publicaciones - libros

Compartir en
redes sociales

Data Warehousing and Knowledge Discovery: 4th International Conference, DaWaK 2002 Aix-en-Provence, France, September 4-6, 2002. Proceedings

Yahiko Kambayashi ; Werner Winiwarter ; Masatoshi Arikawa (eds.)

En conferencia: 4º International Conference on Data Warehousing and Knowledge Discovery (DaWaK) . Aix-en-Provence, France . September 4, 2002 - September 6, 2002

Resumen/Descripción – provisto por la editorial

No disponible.

Palabras clave – provistas por la editorial

No disponibles.

Disponibilidad

Institución detectada	Año de publicación	Navegá	Descargá	Solicitá
No detectada	2002	SpringerLink

Información

Tipo de recurso:

libros

ISBN impreso

978-3-540-44123-6

ISBN electrónico

978-3-540-46145-6

Editor responsable

Springer Nature

País de edición

Reino Unido

Fecha de publicación

2002

Información sobre derechos de publicación

Cobertura temática

Ciencias de la computación e información

Tabla de contenidos

Verificá que desde tu institución tengas acceso para descargar o solicitar el libro completo o alguno de sus capítulos.

doi: 10.1007/3-540-46145-0_31

Striving towards Near Real-Time Data Integration for Data Warehouses

Robert M. Bruckner; Beate List; Josef Schiefer

The amount of information available to large-scale enterprises is growing rapidly. While operational systems are designed to meet well-specified (short) response time requirements, the focus of data warehouses is generally the strategic analysis of business data integrated from heterogeneous source systems. The decision making process in traditional data warehouse environments is often delayed because data cannot be propagated from the source system to the data warehouse in time. A real-time data warehouse aims at decreasing the time it takes to make business decisions and tries to attain zero latency between the cause and effect of a business decision. In this paper we present an architecture of an ETL environment for real-time data warehouses, which supports a continual near real-time data propagation. The architecture takes full advantage of existing J2EE (Java 2 Platform, Enterprise Edition) technology and enables the implementation of a distributed, scalable, near real-time ETL environment. Instead of using vendor proprietary ETL (extraction, transformation, loading) solutions, which are often hard to scale and often do not support an optimization of allocated time frames for data extracts, we propose in our approach ETLets (spoken “et-lets”) and Enterprise Java Beans (EJB) for the ETL processing tasks.

- Data Warehouse Maintenance | Pp. 317-326

doi: 10.1007/3-540-46145-0_32

Time-Interval Sampling for Improved Estimations in Data Warehouses

Pedro Furtado; João Pedro Costa

In large data warehouses it is possible to return very fast approximate answers to user queries using pre-computed sampling summaries well-fit for all types of exploration analysis. However, their usage is constrained by the fact that there must be a representative number of samples in grouping intervals to yield acceptable accuracy. In this paper we propose and evaluate a technique that deals with the representation issue by using time interval-biased stratified samples (TISS). The technique is able to deliver fast accurate analysis to the user by taking advantage of the importance of the time dimension in most user analysis. It is designed as a transparent middle layer, which analyzes and rewrites the query to use a summary instead of the base data warehouse. The estimations and error bounds returned using the technique are compared to those of traditional sampling summaries, to show that it achieves significant improvement in accuracy.

- Data Warehouse Maintenance | Pp. 327-337