HDFS: Erasure-Coded Information Repository System for Hadoop Clusters

Abstract

Existing disk based recorded stockpiling frameworks are insufficient for Hadoop groups because of the obliviousness of information copies and the guide decrease programming model. To handle this issue, a deletion coded information chronicled framework called HD-FS is developed for Hadoop bunches, where codes are utilized to file information copies in the Hadoop dispersed document framework or HD-FS. Here there are two chronicled systems that HDFS-Grouping and HDFS-Pipeline in HDFS to accelerate the information documented process. HDFS-Grouping is a Map Reduce-based information chronicling plan - keeps every mapper's moderate yield Key-Value matches in a nearby key-esteem store and unions all the transitional key-esteem sets with a similar key into one single key-esteem combine, trailed by rearranging the single Key-Value match to reducers to create last equality squares. HDFS-Pipeline frames an information recorded pipeline utilizing numerous information hub in a Hadoop group. HDFS-Pipeline conveys the consolidated single key-esteem combine to an ensuing hub's nearby key-esteem store. Last hub in the pipeline is mindful to yield equality squares. HD-FS is executed in a true Hadoop group. The exploratory outcomes demonstrate that HDFS-Grouping and HDFS-Pipeline accelerate Baseline's rearrange and diminish stages by a factor of 10 and 5, individually. At the point when square size is bigger than 32 M-B, HD-FS enhances the execution of HDFS-RA-ID and HDFS-EC by roughly 31.8 and 15.7 percent, separately. Ameena Anjum | Prof. Shivleela Patil"HDFS: Erasure-Coded Information Repository System for Hadoop Clusters" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-5 , August 2018, URL: http://www.ijtsrd.com/papers/ijtsrd18206.pdf http://www.ijtsrd.com/computer-science/other/18206/hdfs-erasure-coded-information-repository-system-for-hadoop-clusters/ameena-anjum

Authors and Affiliations

Keywords

Related Articles

Social Media Use and Junior High School Student’s Academic Performance in the Division of Northern Samar

This descriptive correlational study was conducted to determine the social media utilization and its effects on student’s academic performance in selected secondary schools in the Division of Northern Samar. This study u...

Stability Indicating HPLC Method Development A Review

High performance liquid chromatography HPLC is an essential analytical tool for evaluating drug stability. HPLC methods must be able to isolate, detect, and quantify drug related degradation products that may form during...

An Efficient and Safe Data Sharing Scheme for Mobile Cloud Computing

As the popularity of cloud computing is increasing, mobile devices at any time can store or retrieve personal information from anywhere. As a result, the issue of data protection in the mobile cloud is becoming increasin...

A Review on use of Bituminous Pavementwastes in Cement Concrete

In general, aggregate make up 60 75 of concrete volume, so their selection is important, also they control concrete properties. Aggregate provide strength and wear resistance in these applications. Hence, the selection a...

Survey Paper on 3-D Bio-Printing for Hard Tissue

Three-dimensional bioprinting is basically for creating or formation of the natural developing which includes allocating cells into the biocompatible stage by applying a liberal layer-by-layer for dealing with the tissue...

Download PDF file
  • EP ID EP389959
  • DOI -
  • Views 66
  • Downloads 0

How To Cite

(2018). HDFS: Erasure-Coded Information Repository System for Hadoop Clusters. International Journal of Trend in Scientific Research and Development, 2(5), 1957-1960. https://www.europub.co.uk/articles/-A-389959