Ant Colony Optimization of Interval Type-2 Fuzzy C-Means with Subtractive Clustering and Multi-Round Sampling for Large Data

Abstract

Fuzzy C-Means (FCM) is widely accepted as a clustering technique. However, it cannot often manage different uncertainties associated with data. Interval Type-2 Fuzzy C-Means (IT2FCM) is an improvement over FCM since it can model and minimize the effect of uncertainty efficiently. However, IT2FCM for large data often gets trapped in local optima and fails to find optimal cluster centers. To overcome this challenge an Ant Colony-based Optimization (ACO) is proposed. Another challenge encountered is determining the number of clusters to perform clustering. Subtractive clustering (SC) is an efficient technique to estimate appropriate number of clusters. Though for large datasets the convergence rate of ACO and SC becomes high and thus, it becomes challenging to cluster data and evaluate correct number of clusters. To encounter the challenges of large dataset, Multi-Round Sampling (MRS) technique is proposed. IT2FCM-ACO with SC and MRS technique performs clustering on subsets of data and determines suitable cluster centers and cluster number. The obtained clusters are then extended to the entire dataset. This eliminates the need for IT2FCM to work on the complete dataset. Thus, the objective of this paper is to optimize IT2FCM using ACO algorithm and to estimate the optimal number of clusters using SC while employing MRS to handle the challenges of voluminous data. Results obtained from several clustering evaluation measures shows the improved performance of IT2FCM-ACO-MRS compared to ITFCM-ACO and IT2FCM. Speed up for different sample size of dataset is computed and is found that IT2FCM-ACO-MRS is ≈1–5 times faster than IT2FCM and IT2FCM-ACO for medium datasets whereas for large datasets it is reported to be ≈ 30–150 times faster.

Authors and Affiliations

Sana Qaiyum, Izzatdin Aziz, Jafreezal Jaafar, Adam Kai Leung Wong

Keywords

Related Articles

Novel Geo-Location Technique for Tourism Guide and Emergency Evacuation at Grand Mosque Al Haram Makkah

Grand Mosque AL Haram is always crowded with pilgrim. The most concentration of crowd happens during Hajj season. Even the grand mosque is already furnished with a lot of route sign board, exit or emergency sign boards....

Analysis of Resource Utilization on GPU

The problems arising due to massive data storage and data analysis can be handled by recent technologies, like cloud computing and parallel computing. MapReduce, MPI, CUDA, OpenMP, OpenCL are some of the widely available...

On Integrating Mobile Applications into the Digital Forensic Investigative Process

What if a tool existed that allowed digital forensic investigators to create their own apps that would assist them with the evidence identification and collection process at crime scenes? First responders are responsible...

Toward Secure Web Application Design: Comparative Analysis of Major Languages and Framework Choices

We will examine the benefits and drawbacks in the selection of various software development languages and web application frameworks. In particular, we will consider five of the ten threats outlined in the Open Web Appli...

Movement Direction Estimation on Video using Optical Flow Analysis on Multiple Frames

This study proposed a model for determining the movement direction of the object based on the optical flow features. To increase the speed of computational time, optical flow features derived into a Histograms of Oriente...

Download PDF file
  • EP ID EP448671
  • DOI 10.14569/IJACSA.2019.0100106
  • Views 107
  • Downloads 0

How To Cite

Sana Qaiyum, Izzatdin Aziz, Jafreezal Jaafar, Adam Kai Leung Wong (2019). Ant Colony Optimization of Interval Type-2 Fuzzy C-Means with Subtractive Clustering and Multi-Round Sampling for Large Data. International Journal of Advanced Computer Science & Applications, 10(1), 47-57. https://www.europub.co.uk/articles/-A-448671