Big Data Analysis and Its Scheduling Policy – Hadoop
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2015, Vol 17, Issue 1
Abstract
Abstract: This paper is deals with Parallel Distributed system. Hadoop has become a central platform tostore big data through its Hadoop Distributed File System (HDFS) as well as to run analytics on this stored bigdata using its MapReduce component. Map Reduce programming model have shown great value inprocessing huge amount of data. Map Reduce is a common framework for data-intensive distributedcomputing of batch jobs. Hadoop Distributed File System (HDFS) is a Java-based file system that providesscalable and reliable data storage that is designed to span large clusters of commodity servers. In all Hadoopimplementations, the default FIFO scheduler is available where jobs are scheduled in FIFO order with supportfor other priority based schedulers also. During this paper, we are going to study a Hadoop framework, HDFSdesign and Map reduce Programming model. And also various schedulers possible with Hadoop and providedsome behavior of the current scheduling schemes in Hadoop on a locally deployed cluster is described.
Authors and Affiliations
Divya S , Kanya Rajesh R , Rini Mary NithilaI , Vinothini M
Resource-Diversity Tolerant: Resource Allocation in the Cloud Infrastructure Services
Abstract: The cloud offers data processing, data centers to process and preserve the transactional data of the clients. Dynamic capacity provisioning is promising approach for reducing energy consumption by dynamicallych...
Privacy Preservation by Using AMDSRRC for Hiding Highly Sensitive Association Rule
Abstract: Researchers are needed for settling on the choice of information mining. In any case a few associations to help with some external counsellor for the procedure of information mining on the grounds that th...
Tool Support for the Service Oriented Modelling
Abstract: The Tool Support for the Service Orientated modelling is one of the technical project and to model the changes from the existing metamodel and to define the grammar for the internal policy and the coordination...
Performance Analysis of Hybrid (supervised and unsupervised) method for multiclass data set
Abstract: Due to the increasing demand for multivariate data analysis from the various application the dimensionality reduction becomes an important task to represent the data in low dimensional space for the robus...
Agriculture Ontology for Sustainable Development in Nigeria
Nigeria, a country of more than 160 million people; also, the biggest oil exporter in Africa [1] Nigeria with her oil wealth, food security, and unemployment remains a serious problem. Shortage and increase in...