Performance Analysis for Optimizing Hadoop MapReduce Execution
Journal: International Journal of Science and Research (IJSR) (Vol.5, No. 6)Publication Date: 2016-06-05
Authors : Samiksha Misal; P. S Desai;
Page : 2219-2223
Keywords : MapReduce; Hadoop; Self-tuning; Optimization; Decision tree;
Abstract
The Apache Hadoop data changing writing computer programs is doused in an intricate situation made out of gigantic machine bunches, limitless data sets, and a couple taking care of vocations. Managing a Hadoop situation is time escalated, toilsome and obliges expert customers. Likewise, nonappearance of learning may include misconfigurations adulterating the gathering execution. To address misconfiguration issues we propose an answer completed on top of Hadoop. The goal is showing a tuning toward oneself segment for Hadoop businesses on Big Data circumstances. Late years have witness the improvement of distributed computing and the huge information period, which raises difficulties to conventional choice tree calculations. In the first place, as the measure of dataset turns out to be to a great degree huge, the procedure of building a choice tree can be very tedious. Second, on the grounds that the information can't fit in memory any all the more, some calculation must be moved to the outer stockpiling and hence builds the I/O cost. To this end, we propose to execute a normal choice tree calculation, C4.5, utilizing MapReduce programming model.
Other Latest Articles
- To Study and Realization of Quad-Band Bandpass Filter using Advanced Design System (ADS) Software
- Weaving Co-operative Societies & Handloom Products in Rajnandgaon District of Chhattisgarh State
- Document Clustering using Improved K-means Algorithm
- Efficiency of Local Government Units in Northwestern Philippines as to the Attainment of the Millenium Development Goals
- Energy Efficient WSN using GPSR for Mobile Sink
Last modified: 2021-07-01 14:39:08