Fault Tolerance Testing for Crash and Omission Transient Failure during Resource Scheduling of Grid Computing?

Journal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.3, No. 5)

Publication Date: 2014-05-30

Authors : Inderpreet Kaur; Sarpreet Singh;

Page : 547-551

Keywords : Resource Management; Fault tolerance; Task Replication; Job Scheduling;

Source : Download Find it from : Google Scholar

Abstract

In computational Grid, fault tolerance is an imperative issue to be considered during job scheduling. Due to the widespread use of resources, systems are highly prone to errors and failures. Hence fault tolerance plays a key role in grid to avoid the problem of unreliability. The two main techniques for implementing fault tolerance in grid environment are check pointing and replication. Grid Computing involves a network of computers that are utilized together to gain large supercomputing type computing resources. Scheduling the task to the appropriate resource is a vital requirement in computational Grid. This paper presents an overview of Resource Management; its basic function and structure, fault tolerance techniques. The proposed method is to improve one of the Fault Tolerance Algorithm that is the fittest resource scheduling algorithm, by scheduling the job in coordination with job replication when crash occurs.

Main Menu

Searching By

PARTNERS

Fault Tolerance Testing for Crash and Omission Transient Failure during Resource Scheduling of Grid Computing?

Abstract

Advertisement