slugReview Paper on Clustering and Validation Techniques
Journal Title: International Journal for Research in Applied Science and Engineering Technology (IJRASET) - Year 2014, Vol 2, Issue 5
Abstract
Clustering is important in data analysis and data mining applications. It is the task of grouping a set of objects so that objects in the same group are more similar to each other than to those in other groups (clusters). The overall goal of the data mining process is to extract information from a large data set and transform it into an understandable form for further use. Clustering can be done by the different no. of algorithms such as hierarchical, partitioning, grid and density based algorithms. Hierarchical clustering is the connectivity based clustering. Partitioning is the centroid based clustering, the value of k - mean is set. Clustering has been applied to serve various purposes like, to gain insight to data distribution, generate hypotheses, to observe the characteristic and find anomalies. The intension of this paper is to provide a categorization of some well known clustering algorithms. It also describes the clustering process and overview of the different clustering methods. The validation of clustering structures is the most difficult and frustrating part of cluster analysis. Validation comparing the results of two clusters and find out the best cluster.
Authors and Affiliations
Jyoti, Neha Kaushik, Rekha
Mathematical Modeling and Analysis of Different Type of Fuel Injector in Scramjet Engine Using CFD Simulation in Fluent
At Present the most promising propulsive systems, the scramjet engine has drawn the attention of many researchers. The two-dimensional coupled implicit NS equations, the standard k-ε turbulence model and the finite-rate...
Optimization of Forming Process Parameters for Appropriate Distribution of Wall Thickness in Sheet Metal Component
While obtaining the final sheet metal component the defects are occurred in sheet metal forming process which are reduced by varying the forming process parameters by trial and error method. This causes loss in terms of...
Study of the Impact of Financial Flexibility on Dividend Policy with Respect to the Life Cycle (A Study Case: Tehran Stock Exchange Listed Companies, Iran)
The main purpose of this study is determination of the relation between financial flexibility and dividend policy under the moderating effect of lifecycle among the firms listed on Tehran's stock exchange between 2008 a...
Concept-Based Document Clustering Using Bisecting K-Means Algorithm
Document Clustering has been extensively investigated as a methodology for improving document search and retrieval. Although good clustering algorithms are widely available, good solutions for labeling the clustered res...
Green Computing - An Efficient Computer Power Consumption Benchmarking
Computer power consumption is becoming a more important topic as electricity prices climb and as pollution is becoming a bigger problem in the world. It is common knowledge that most of the world's power plants emit pol...