ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

A Framework for Parallel Data Processing in Cloud Systems

Journal: International Journal of Science and Research (IJSR) (Vol.3, No. 8)

Publication Date:

Authors : ; ;

Page : 91-94

Keywords : Dynamic Resource Allocation for Efficient Parallel Data Processing in the Cloud;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

In recent years ad hoc parallel data processing has emerged to be one of the effective applications for Infrastructure-as-a-Service (IaaS) clouds. Major Cloud computing companies have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for customers to access these services and to deploy their programs. However, the processing frameworks which are currently used have been designed for static, homogeneous cluster setups and disregard the particular nature of a cloud. Consequently, the allocated compute resources may be inadequate for big parts of the submitted job and unnecessarily increase processing time and cost. In this project, we discuss the opportunities and challenges for efficient parallel data processing in clouds and present our research project Nephele. Nephele is the first data processing framework to explicitly exploit the dynamic resource allocation offered by todays IaaS clouds for both, task scheduling and execution. Particular tasks of a processing job can be assigned to different types of virtual machines which are automatically instantiated and terminated during the job execution.

Last modified: 2021-06-30 21:05:59