ENHANCED NEIGHBORHOOD NORMALIZED POINTWISE MUTUAL INFORMATION ALGORITHM FOR CONSTRAINT AWARE DATA CLUSTERING
Journal Title: ICTACT Journal on Soft Computing - Year 2016, Vol 6, Issue 4
Abstract
Clustering of similar data items is an important technique in mining useful patterns. To enhance the performance of Clustering, training or learning is an important task. A constraint learning semi-supervised methodology is proposed which incorporates SVM and Normalized Pointwise Mutual Information Computation Strategy to increase the relevance as well as the performance efficiency of clustering. The SVM Classifier is of Hard Margin Type to roughly classify the initial set. A recursive re-clustering approach is proposed for achieving higher degree of relevance in the final clustered set by incorporating ENNPI algorithm. An overall enriched F-Measure value of 94.09% is achieved as compared to existing algorithms.
Authors and Affiliations
Pushpa C N, Gerard Deepak, Mohammed Zakir, Thriveni J, Venugopal K R
ENHANCED HYBRID PSO – ACO ALGORITHM FOR GRID SCHEDULING
Grid computing is a high performance computing environment to solve larger scale computational demands. Grid computing contains resource management, task scheduling, security problems, information management and so on. T...
FUZZY PROBABILISTIC AND FRACTAL DIMENSIONAL APPROACH FOR CHLORIDE INDUCED CORROSION TIME (CICT)
An attempt for exertion is made to utilize the capacity of fuzzy arbitrariness in dealt with instabilities to develop a generic approach for strength based administration life plan of fortified cement basic individuals....
HEART DISEASE PREDICTION USING DATA MINING TECHNIQUES
Mining is a technique that is performed on large databases for extracting hidden patterns by using combinational strategy from statistical analysis, machine learning and database technology. Further, the medical data min...
IMPLEMENTATION OF NONLINEAR FUZZY LOGIC FRACTIONAL ORDER PID CONTROLLER (NFL-FOPID) WITH FIRST ORDER TRANSFER FUNCTION
Today, the requirement of controllers in the field of engineering and process industries is going to be increased in order to control. Among all controllers, Proportional Integral Derivative (PID) controllers are widely...
MULTI-DOCUMENT TEXT SUMMARIZATION USING CLUSTERING TECHNIQUES AND LEXICAL CHAINING
This paper investigates the use of clustering and lexical chains to produce coherent summaries of multiple documents in text format to generate an indicative, less redundant summary. The summary is designed as per user’s...