Extracting Credit Rules from Imbalanced Data: The Case of an Iranian Export Development Bank

Journal Title: Journal of Information Systems and Telecommunication - Year 2015, Vol 3, Issue 1

Abstract

Credit scoring is an important topic, and banks collect different data from their loan applicant to make an appropriate and correct decision. Rule bases are of more attention in credit decision making because of their ability to explicitly distinguish between good and bad applicants. The credit scoring datasets are usually imbalanced. This is mainly because the number of good applicants in a portfolio of loan is usually much higher than the number of loans that default. This paper use previous applied rule bases in credit scoring, including RIPPER, OneR, Decision table, PART and C4.5 to study the reliability and results of sampling on its own dataset. A real database of one of an Iranian export development bank is used and, imbalanced data issues are investigated by randomly Oversampling the minority class of defaulters, and three times under sampling of majority of non-defaulters class. The performance criterion chosen to measure the reliability of rule extractors is the area under the receiver operating characteristic curve (AUC), accuracy and number of rules. Friedman’s statistic is used to test for significance differences between techniques and datasets. The results from study show that PART is better and good and bad samples of data affect its results less.

Authors and Affiliations

Seyed Mahdi Sadatrasoul, Mohammad Reza Gholamian, Kamran Shahanaghi

Keywords

Related Articles

A Low-Jitter 20-110MHz DLL Based on a Simple PD and Common-Mode Voltage Level Corrected Differential Delay Elements

In this paper, a 16-phases 20MHz to 110MHz low jitter delay locked loop, DLL, is proposed in a 0.35µm CMOS process. A sensitive open loop phase detector, PD, is introduced based on a novel idea to simply detect small pha...

A New Method for Detecting the Number of Coherent Sources in the Presence of Colored Noise

In this paper, a new method for determining the number of coherent/correlated signals in the presence of colored noise is proposed which is based on the Eigen Increment Threshold (EIT) method. First, we present a new app...

An Improved Method for TOA Estimation in TH-UWB System considering Multipath Effects and Interference

UWB ranging is usually based on the time-of-arrival (TOA) estimation of the first path. There are two major challenges in TOA estimation. One challenge is to deal with multipath channel, especially in indoor environments...

Low Complexity Median Filter Hardware for Image Impulsive Noise Reduction

Median filters are commonly used for removal of the impulse noise from images. De-noising is a preliminary step in online processing of images, thus hardware implementation of median filters is of great interest. Hence,...

PSO-Algorithm-Assisted Multiuser Detection for Multiuser and Inter-symbol Interference Suppression in CDMA Communications

Applying particle swarm optimization (PSO) algorithm has become a widespread heuristic technique in many fields of engineering. In this paper, we apply PSO algorithm in additive white Gaussian noise (AWGN) and multipath...

Download PDF file
  • EP ID EP184747
  • DOI 10.7508/jist.2015.01.004
  • Views 109
  • Downloads 0

How To Cite

Seyed Mahdi Sadatrasoul, Mohammad Reza Gholamian, Kamran Shahanaghi (2015). Extracting Credit Rules from Imbalanced Data: The Case of an Iranian Export Development Bank. Journal of Information Systems and Telecommunication, 3(1), 22-28. https://www.europub.co.uk/articles/-A-184747