Performance Analysis of Multi-Node Hadoop Clusters using Amazon EC2 Instances
Journal: International Journal of Science and Research (IJSR) (Vol.4, No. 10)Publication Date: 2015-10-05
Authors : Ruchi Mittal; Ruhi Bagga;
Page : 1646-1650
Keywords : Cloud Computing; Hadoop; MapReduce; Multi-node cluster;
Abstract
Hadoop, an open source implementation of MapReduce model, is an effective tool for handling, processing and analyzing unstructured data generated these days by different cloud applications. Hadoop considers its nodes to be homogeneous in terms of their processing capability in a cluster. But in real word applications nodes in a cluster are heterogeneous in terms of their processing capability. In such cases, Hadoop does not yields effective performance levels In this paper, we had evaluated and analyzed the performance of WordCount MapReduce application using Hadoop on Amazon EC2 using different Ubuntu instances. The performance has been evaluated both on single node and multi-node clusters. Multi-node clusters include both the homogeneous and the heterogeneous clusters. The performance is evaluated in terms of execution time of the application on different file sizes.
Other Latest Articles
- Creative Retention Strategies in Service Sectors ?Thinking Outside the Box to Keep the Cream of the Crop?
- The Effect of Different Coverages of Bismuth on a 5% Pt/G Supported Catalysts on Enantiomeric Excess (ee) and Reaction Rate of Ethyl Pyruvate Hydrogenation Rate in Dichloromethane
- State Observer Design for Active Surge Control in Centrifugal Compressors
- Cuk Converter Fed BLDC Motor with a Sensorless Control Method
- Strategy Management and Practices of Talent Employee Retention and Effectiveness
Last modified: 2021-07-01 14:25:16