ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

HADOOP BASED APPLICATION USING MULTINODE CLUSTERS

Journal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.6, No. 5)

Publication Date:

Authors : ; ; ;

Page : 706-711

Keywords : ;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

In the present era, data is considered as precious as gold for many organizations. Data management and storage is of utmost importance. In today's scenario, data is being generated in massive quantities every single day. Hence, the storage and processing of data using the conventional storing methods like RDBMS is not efficient and effective. So, new ways have been evolved to manage this massive amount of data, also termed as Big Data. This Big Data is a combination of both structured and unstructured data. Hadoop is an open source software that helps to store and process this Big Data. The Hadoop divides the data in blocks and stores them on different nodes and also does replication of these blocks for fault tolerance. The Hadoop Distribution File system (HDFS) and MapReduce are the two key components of Hadoop. MapReduce is used to process the data. In this paper 3-nodes cluster is proposed to store file and process the data for word-count application.

Last modified: 2017-06-01 20:17:58