Minimize Staleness and Stretch in Streaming Data Warehouses
Journal: International Journal of Science and Research (IJSR) (Vol.2, No. 9)Publication Date: 2013-09-15
Authors : S. M. Subhani M. Nagendramma;
Page : 375-377
Keywords : Data warehouse maintenance; online scheduling;
Abstract
We study scheduling algorithms for loading data feeds into real time data warehouses, which are used in applications such as IP network monitoring, online financial trading, and credit card fraud detection. In these applications, the warehouse collects a large number of streaming data feeds that are generated by external sources and arrive asynchronously. We discuss update scheduling in streaming data warehouses, which combine the features of traditional data warehouses and data stream systems. In our setting, external sources push append-only data streams into the warehouse with a wide range of inter-arrival times. While traditional data warehouses are typically refreshed during downtimes, streaming warehouses are updated as new data arrive. In this paper we develop a theory of temporal consistency for stream warehouses that allows for multiple consistency levels. We model the streaming warehouse update problem as a scheduling problem, where jobs correspond to processes that load new data into tables, and whose objective is to minimize data staleness over time.
Other Latest Articles
- Carbon Financing for Renewable Energy Projects in Zimbabwe ? A Case of Chipendeke Micro-Hydro Scheme
- Constraints and Opportunities to Rabbit Production in Zimbabwe: A Case Study of the Midlands Province, Zimbabwe
- Radiochemical Properties of Irradiated PVA\AgNO3 Film by Electron Beam
- Effectiveness of ERules in Generating Non Redundant Rule Sets in Pharmacy Database
- Voice Morphing System for People Suffering from Laryngectomy
Last modified: 2013-10-01 23:34:32