ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

HDFS: Erasure-Coded Information Repository System for Hadoop Clusters

Journal: International Journal of Trend in Scientific Research and Development (Vol.2, No. 5)

Publication Date:

Authors : ;

Page : 1957-1960

Keywords : Hadoop distributed ?le system; replica-based storage clusters; archival performance; erasure codes; parallel encoding;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Existing disk based recorded stockpiling frameworks are insufficient for Hadoop groups because of the obliviousness of information copies and the guide decrease programming model. To handle this issue, a deletion coded information chronicled framework called HD-FS is developed for Hadoop bunches, where codes are utilized to file information copies in the Hadoop dispersed document framework or HD-FS. Here there are two chronicled systems that HDFS-Grouping and HDFS-Pipeline in HDFS to accelerate the information documented process. HDFS-Grouping is a Map Reduce-based information chronicling plan - keeps every mapper's moderate yield Key-Value matches in a nearby key-esteem store and unions all the transitional key-esteem sets with a similar key into one single key-esteem combine, trailed by rearranging the single Key-Value match to reducers to create last equality squares. HDFS-Pipeline frames an information recorded pipeline utilizing numerous information hub in a Hadoop group. HDFS-Pipeline conveys the consolidated single key-esteem combine to an ensuing hub's nearby key-esteem store. Last hub in the pipeline is mindful to yield equality squares. HD-FS is executed in a true Hadoop group. The exploratory outcomes demonstrate that HDFS-Grouping and HDFS-Pipeline accelerate Baseline's rearrange and diminish stages by a factor of 10 and 5, individually. At the point when square size is bigger than 32 M-B, HD-FS enhances the execution of HDFS-RA-ID and HDFS-EC by roughly 31.8 and 15.7 percent, separately. Ameena Anjum | Prof. Shivleela Patil"HDFS: Erasure-Coded Information Repository System for Hadoop Clusters" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-5 , August 2018, URL: http://www.ijtsrd.com/papers/ijtsrd18206.pdf http://www.ijtsrd.com/computer-science/other/18206/hdfs-erasure-coded-information-repository-system-for-hadoop-clusters/ameena-anjum

Last modified: 2018-09-27 19:30:03