ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Batch Job Active-Active Resiliency System and Method

Journal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.13, No. 5)

Publication Date:

Authors : ; ;

Page : 89-99

Keywords : Active-Active Resiliency; Batch Job Workflow; State Management; Automated Retries; Auditability; Observability; Workflow Orchestration;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

This manuscript presents a robust solution for orchestrating batch job workflows in an active-active resiliency mode across multiple regions. The proposed system abstracts the complexities of job and step/task state management, provides automated and customizable retries, and ensures auditability and observability. The system maintains operational continuity and fault tolerance, leveraging lock marker files and a configurable scheduler, even during regional failures. This approach addresses the challenges of asynchronous workflow management, offering enhanced visibility, control, and efficiency in processing batch jobs.

Last modified: 2024-05-23 20:34:56