Performance Analysis of Machine Learning Techniques on Software Defect Prediction using NASA Datasets

Abstract

Defect prediction at early stages of software development life cycle is a crucial activity of quality assurance process and has been broadly studied in the last two decades. The early prediction of defective modules in developing software can help the development team to utilize the available resources efficiently and effectively to deliver high quality software product in limited time. Until now, many researchers have developed defect prediction models by using machine learning and statistical techniques. Machine learning approach is an effective way to identify the defective modules, which works by extracting the hidden patterns among software attributes. In this study, several machine learning classification techniques are used to predict the software defects in twelve widely used NASA datasets. The classification techniques include: Naïve Bayes (NB), Multi-Layer Perceptron (MLP). Radial Basis Function (RBF), Support Vector Machine (SVM), K Nearest Neighbor (KNN), kStar (K*), One Rule (OneR), PART, Decision Tree (DT), and Random Forest (RF). Performance of used classification techniques is evaluated by using various measures such as: Precision, Recall, F-Measure, Accuracy, MCC, and ROC Area. The detailed results in this research can be used as a baseline for other researches so that any claim regarding the improvement in prediction through any new technique, model or framework can be compared and verified.

Authors and Affiliations

Ahmed Iqbal, Shabib Aftab, Umair Ali, Zahid Nawaz, Laraib Sana, Munir Ahmad, Arif Husen

Keywords

Related Articles

A Vertical Handover Management for Mobile Telemedicine System using Heterogeneous Wireless Networks

Application of existing mobile telemedicine system is restricted by the imperfection of network coverage, network capacity, and mobility. In this paper, a novel telemedicine based handover decision making (THODM) algorit...

Designing Graphical Data Storage Model for Gene-Protein and Gene-Gene Interaction Networks

Graph is an expressive way to represent dynamic and complex relationships in highly connected data. In today’s highly connected world, general purpose graph databases are providing opportunities to experience benefits of...

The Ethical and Social Issues of Information Technology: A Case Study

The present study is conducted among 283 students from University of Zabol to identify the harm and ethical and social issues in the field of information technology and to classify the immoral practices that students are...

 A Conceptual Framework for an Ontology-Based Examination System

There is an increasing reliance on the web for many software application deployments. Millions of services ranging from commerce, education, tourism and entertainment are now available on the web, making the web to be th...

A Novel Approach to Implement Fixed to Mobile Convergence in Mobile Adhoc Networks

Fixed to Mobile Convergence, FMC is one of the most celebrated applications of wireless networks, where a telephonic call from some fixed telephonic infrastructure is forwarded to a mobile device. Problem of extending th...

Download PDF file
  • EP ID EP578420
  • DOI 10.14569/IJACSA.2019.0100538
  • Views 123
  • Downloads 0

How To Cite

Ahmed Iqbal, Shabib Aftab, Umair Ali, Zahid Nawaz, Laraib Sana, Munir Ahmad, Arif Husen (2019). Performance Analysis of Machine Learning Techniques on Software Defect Prediction using NASA Datasets. International Journal of Advanced Computer Science & Applications, 10(5), 300-308. https://www.europub.co.uk/articles/-A-578420