ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Hadoop2 Yarn

Journal: IPASJ International Journal of Computer Science (IIJCS) (Vol.3, No. 9)

Publication Date:

Authors : ; ;

Page : 18-26

Keywords : Keywords: Hadoop2.0; Yarn; MapReduce; Big Data; Cluster; High Availability; ResourceManager; NodeManager;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

ABSTRACT Apache Hadoop 2 (Hadoop 2.0) is the second iteration of the Hadoop framework for distributed data processing Hadoop 2 adds support for running non-batch applications through the introduction of YARN, a redesigned cluster resource manager that eliminates Hadoop's sole reliance on the MapReduce programming model. Short for Yet Another Resource Negotiator, YARN puts resource management and job scheduling functions in a separate layer beneath the data processing one, enabling Hadoop 2 to run a variety of applications. Overall, the changes made in Hadoop 2 position the framework for wider use in big data analytics and other enterprise applications. For example, it is now possible to run event processing as well as streaming, real-time and operational applications. HDFS federation brings important measures of scalability and reliability to Hadoop. YARN, the other major advance in Hadoop 2, brings significant performance improvements for some applications, supports additional processing models, and implements a more flexible execution engine. YARN is a resource manager that was created by separating the processing engine and resource management capabilities of MapReduce as it was implemented in Hadoop 1.YARN is often called the operating system of Hadoop because it is responsible for managing and monitoring workloads, maintaining a multi-tenant environment, implementing security controls, and managing high availability features of Hadoop

Last modified: 2015-10-10 15:32:54