Feature Selection and Extraction Framework for DNA Methylation in Cancer
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2017, Vol 8, Issue 7
Abstract
Feature selection methods for cancer classification are aimed to overcome the high dimensionality of the biomedical data which is a challenging task. Most of the feature selection methods based on DNA methylation are time consuming during testing phase to identify the best pertinent features subset that are relevant to accurate prediction. However, the hybridization between feature selection and extraction methods will bring a method that is far fast than only feature selection method. This paper proposes a framework based on both novel feature selection methods that employ statistical variation, standard deviation and entropy, along with extraction methods to predict cancer using three new features, namely, Hypomethylation, Midmethylation and Hypermethylation. These new features represent the average methylation density of the corresponding three regions. The three features are extracted from the selected features based on the analysis of the methylation behavior. The effectiveness of the proposed framework is evaluated by the breast cancer classification accuracy. The results give 98.85% accuracy using only three features out of 485,577 features. This result proves the capability of the proposed approach for breast cancer diagnosis and confirms that feature selection and extraction methods are critical for practical implementation.
Authors and Affiliations
Abeer A. Raweh, Mohammad Nassef, Amr Badr
Designing Novel Queries for Analysing NoSQL Data of Gene-Disease Associations
To precisely identify gene associated diseases has been an open area of research for biological scientists to ensure clinical and psychological symptoms and treatment for human diseases. Because whole Human Genome is def...
The Fundamentals of Unimodal Palmprint Authentication based on a Biometric System: A Review
Biometric system can be defined as the automated method of identifying or authenticating the identity of a living person based on physiological or behavioral traits. Palmprint biometric-based authentication has gained co...
Evaluating the Quality of UCP-Based Framework using CK Metrics
Software effort estimation is one of the most important concerns in the software industry. It has received much attention since the last 40 years to improve the accuracy of effort estimate at early stages of software dev...
A Novel Approach to Implement Fixed to Mobile Convergence in Mobile Adhoc Networks
Fixed to Mobile Convergence, FMC is one of the most celebrated applications of wireless networks, where a telephonic call from some fixed telephonic infrastructure is forwarded to a mobile device. Problem of extending th...
Dynamic Weight Dropping Policy for Improve High-Priority Message Delivery Delay in Vehicular Delay-Tolerant Network
Vehicular Delay-Tolerant Network (VDTN) is a special case of Delay-Tolerant Network (DTN) in which connectivity is provided by movement of vehicles with traffic prioritization to meet the requirements of different applic...