Survey on Schedulers Optimization to Handle Multiple Jobs in Hadoop Cluster
Journal Title: UNKNOWN - Year 2015, Vol 4, Issue 3
Abstract
An apache Hadoop project is a good platform that supports cost-effective such as commodity hardware implementation, with scalable infrastructure called Hadoop Distributed File System HDFS with parallel processing mechanism called MapReduce. Hadoop is well known for Big Data analytics requires more resources for collecting; storing, processing petabytes of large data requires one meaningful resource management. Hadoop handles jobs in batch modes, allocating resources, scheduling to these modes is an important issue in terms of Network Bandwidth, CPU time and Memory. Resources handled by MapReduce schedulers assigning resources in the form of MapReduceTasks. The MapReduceTasks are carefully handled by MapReduce schedulers by setting some benchmarks for individual namely MapCombineReducer and different Block Sizes, which ensures schedulers are optimized to achieve maximum efficiency in storage capacity, time and cost for handling multiple batch jobs in multiple cluster while guaranteeing the effectiveness of individual scheduler in terms of job execution.
Privacy Preserving Suppression Algorithm for Anonymous Databases
Suppose a medical facility connected with a research institution and the researchers can use the medical details of a patient without knowing the personal details. Thus the research data base used by the researchers must...
Microbial Quality Indicators of Poultry Meat during Processing in Modern and Traditional Slaughterhouses - Omdurman Locality Khartoum State - Sudan
"Abstract Background: Poultry meat recently became a major source of animal protein for a lot of population, This required restricted regulations , procedures , standards and quality indicators to ensure wholesomeness an...
Pattern of Retinitis Pigmentosa in Manipur among the Patients Attending Retina Clinic Regional Institute of Medical Sciences, Imphal, Manipur, India
Purpose: To evaluate the Frequency and Pattern of Retinitis pigmentosa in Retina Clinic of a Tertiary Care Hospital Regional Institute of Medical Sciences, Imphal. Introduction: Retinitis pigmentosa (RP) comprises a grou...
Matrix Convolution using Parallel Programming
The convolution theorem is used to multiply matrices of two different sizes i.e. matrices in which the number of rows in the first matrix is not equal to the number of columns in the second matrix. In this study, the mul...
System for Secure Storage and Auditing of Cloud Data
"Cloud computing is compilation of existing technique and technologies,packaged within a new infrastructure paradigm that offers improved scalability, elasticity, business agility, faster startup time, reduced management...