Imputation And Classification Of Missing Data Using Least Square Support Vector Machines – A New Approach In Dementia Diagnosis
Journal Title: International Journal of Advanced Research in Artificial Intelligence(IJARAI) - Year 2012, Vol 1, Issue 4
Abstract
This paper presents a comparison of different data imputation approaches used in filling missing data and proposes a combined approach to estimate accurately missing attribute values in a patient database. The present study suggests a more robust technique that is likely to supply a value closer to the one that is missing for effective classification and diagnosis. Initially data is clustered and z-score method is used to select possible values of an instance with missing attribute values. Then multiple imputation method using LSSVM (Least Squares Support Vector Machine) is applied to select the most appropriate values for the missing attributes. Five imputed datasets have been used to demonstrate the performance of the proposed method. Experimental results show that our method outperforms conventional methods of multiple imputation and mean substitution. Moreover, the proposed method CZLSSVM (Clustered Z-score Least Square Support Vector Machine) has been evaluated in two classification problems for incomplete data. The efficacy of the imputation methods have been evaluated using LSSVM classifier. Experimental results indicate that accuracy of the classification is increases with CZLSSVM in the case of missing attribute value estimation. It is found that CZLSSVM outperforms other data imputation approaches like decision tree, rough sets and artificial neural networks, K-NN (K-Nearest Neighbour) and SVM. Further it is observed that CZLSSVM yields 95 per cent accuracy and prediction capability than other methods included and tested in the study.
Authors and Affiliations
T Sivapriya, A. R. Banu Kamal, V. Thavavel
Moving Domestic Robotics Control Method Based on Creating and Sharing Maps with Shortest Path Findings and Obstacle Avoidance
Control method for moving robotics in closed areas based on creation and sharing maps through shortest path findings and obstacle avoidance is proposed. Through simulation study, a validity of the proposed method is conf...
Comparison Between Linear and Nonlinear Models of Mixed Pixels in Remote Sensing Satellite Images Based on Cierniewski Surface BRDF Model by Means of Monte Carlo Ray Tracing Simulation
Comparative study on linear and nonlinear mixed pixel models of which pixels in remote sensing satellite images is composed with plural ground cover materials mixed together, is conducted for remote sensing satelli...
Automatic Melakarta Raaga Identification Syste: Carnatic Music
It is through experience one could as certain that the classifier in the arsenal or machine learning technique is the Nearest Neighbour Classifier. Automatic melakarta raaga identification system is achieved by identifyi...
Relation Between Chlorophyll-A Concentration and Red Tide in the Intensive Study Area of the Ariake Sea, Japan in Winter Seasons by using MODIS Data
Relation between chlorophyll-a concentration and red tide in the intensive study area of the back of Ariake Sea, Japan in the recent winter seasons is investigated by using MODIS data. Mechanism of red tide appeara...
Sensitivity Analysis on Sea Surface Temperature Estimation Methods with Thermal Infrared Radiometer Data through Simulations
Sensitivity analysis on Sea Surface Temperature: SST estimation with Thermal Infrared Radiometer: TIR data through simulations is conducted. Also Conjugate Gradient Method: CGM based SST estimation method is propos...