banner

A novel credit scoring system in financial institutions using artificial intelligence technology

Geetha Manikanta Jakka, Amrutanshu Panigrahi, Abhilash Pati, Manmath Nath Das, Jyotsnarani Tripathy

Abstract


In order to evaluate a person’s or a company’s creditworthiness, financial institutions must use credit scoring. Traditional credit scoring algorithms frequently rely on manual and rule-based methods, which can be tedious and inaccurate. Recent developments in artificial intelligence (AI) technology have opened up possibilities for creating more reliable and effective credit rating systems. The data are pre-processed, including scaling using the 0–1 normalization method and resolving missing values by imputation. Information gain (IG), gain ratio (GR), and chi-square are three feature selection methodologies covered in the study. While GR normalizes IG by dividing it by the total entropy of the feature, IG quantifies the reduction in total entropy by adding a new feature. Based on chi-squared statistics, the most vital traits are determined using chi-square. This research employs different ML models to develop a hybrid model for credit score prediction. The ML algorithms support vector machine (SVM), neural networks (NNs), decision trees (DTs), random forest (RF), and logistic regression (LR) classifiers are employed here for experiments along with IG, GR, and chi-square feature selection methodologies for credit prediction over Australian and German datasets. The study offers an understanding of the decision-making process for informative characteristics and the functionality of machine learning (ML) in credit prediction tasks. The empirical analysis shows that in the case of the German dataset, the DT with GR feature selection and hyperparameter optimization outperforms SVM and NN with an accuracy of 99.78%. For the Australian dataset, SVM with GR feature selection outperforms NN and DT with an accuracy of 99.98%.


Keywords


credit scoring system; machine learning (ML); classification techniques; feature selection algorithms; hyperparameter optimization

Full Text:

PDF

References


1. Pławiak P, Abdar M, Acharya UR. Application of new deep genetic cascade ensemble of SVM classifiers to predict the Australian credit scoring. Applied Soft Computing 2019; 84: 105740. doi: 10.1016/j.asoc.2019.105740

2. Tripathi D, Edla DR, Cheruku R, Kuppili V. A novel hybrid credit scoring model based on ensemble feature selection and multilayer ensemble classification. Computational Intelligence 2019; 35(2): 371–394. doi: 10.1111/coin.12200

3. Rodgers W, Hudson R, Economou F. Modeling credit and investment decisions based on AI algorithmic behavioral pathways. Technological Forecasting and Social Change 2023; 191: 122471. doi: 10.1016/j.techfore.2023.122471

4. Alaei F, Alaei A, Pal U, Blumenstein M. A comparative study of different texture features for document image retrieval. Expert Systems with Applications 2019; 121: 97–114. doi: 10.1016/j.eswa.2018.12.007

5. Ping Y, Yongheng L. Neighborhood rough set and SVM based hybrid credit scoring classifier. Expert Systems with Applications 2011; 38(9): 11300–11304. doi: 10.1016/j.eswa.2011.02.179

6. Zhang D, Zhou X, Leung SCH, Zheng J. Vertical bagging decision trees model for credit scoring. Expert Systems with Applications 2010; 37(12): 7838–7843. doi: 10.1016/j.eswa.2010.04.054

7. Dumitrescu E, Hué S, Hurlin C, Tokpavi S. Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects. European Journal of Operational Research 2022; 297(3): 1178–1192. doi: 10.1016/j.ejor.2021.06.053

8. Wei S, Yang D, Zhang W, Zhang S. A novel noise-adapted two-layer ensemble model for credit scoring based on backflow learning. IEEE Access 2019; 7: 99217–99230. doi: 10.1109/ACCESS.2019.2930332

9. Feng X, Xiao Z, Zhong B, et al. Dynamic ensemble classification for credit scoring using soft probability. Applied Soft Computing 2018; 65: 139–151. doi: 10.1016/j.asoc.2018.01.021

10. Zhang W, Yang D, Zhang S, et al. A novel multi-stage ensemble model with enhanced outlier adaptation for credit scoring. Expert Systems with Applications 2021; 165(4): 113872. doi: 10.1016/j.eswa.2020.113872

11. Xia Y, Zhao J, He L, et al. A novel tree-based dynamic heterogeneous ensemble method for credit scoring. Expert Systems with Applications 2020; 159: 113615. doi: 10.1016/j.eswa.2020.113615

12. Kuppili V, Tripathi D, Edla DR. Credit score classification using spiking extreme learning machine. Computational Intelligence 2020; 36(2): 402–426. doi: 10.1111/coin.12242

13. Niu B, Ren J, Li X. Credit scoring using machine learning by combing social network information: Evidence from peer-to-peer lending. Information 2019; 10(12): 397. doi: 10.3390/info10120397

14. Faisal MF, Saqlain MNU, Bhuiyan MAS, et al. Credit approval system using machine learning: Challenges and future directions. In: 2021 International Conference on Computing, Networking, Telecommunications & Engineering Sciences Applications (CoNTESA); 9–10 December 2021; Tirana, Albania. pp. 76–82.

15. Xia Y, Liu C, Da B, Xie F. A novel heterogeneous ensemble credit scoring model based on bstacking approach. Expert Systems with Applications 2018; 93: 182–199. doi: 10.1016/j.eswa.2017.10.022

16. Trivedi SK. A study on credit scoring modeling with different feature selection and machine learning approaches. Technology in Society 2020; 63: 101413. doi: 10.1016/j.techsoc.2020.101413

17. Pati A, Parhi M, Pattanayak BK. A review on prediction of diabetes using machine learning and data mining classification techniques. International Journal of Biomedical Engineering and Technology 2023; 41(1): 83–109. doi: 10.1504/IJBET.2023.128514

18. Pati A, Parhi M, Pattanayak BK. IHDPM: An integrated heart disease prediction model for heart disease prediction. International Journal of Medical Engineering and Informatics 2022; 14(6): 564–577. doi: 10.1504/IJMEI.2022.126526

19. Ghodselahi A, Amirmadhi A. Application of artificial intelligence techniques for credit risk evaluation. International Journal of Modeling and Optimization 2011; 1(3): 243–249. doi: 10.7763/IJMO.2011.V1.43

20. Rout SK, Sahu B, Panigrahi A, et al. Early detection of sepsis using LSTM neural network with electronic health record. In: Ambient Intelligence in Health Care: Proceedings of ICAIHC 2022; 15–16 April 2022; Bhubaneswar, India. Springer; 2022. pp. 201–207.

21. Sahu B, Panigrahi A, Rout SK, Pati A. Hybrid multiple filter embedded political optimizer for feature selection. In: 2022 International Conference on Intelligent Controller and Computing for Smart Power (ICICCSP); 21–23 July 2022; Hyderabad, India. pp. 1–6.




DOI: https://doi.org/10.32629/jai.v6i2.824

Refbacks

  • There are currently no refbacks.


Copyright (c) 2023 Geetha Manikanta Jakka, Amrutanshu Panigrahi, Abhilash Pati, Manmath Nath Das, Jyotsnarani Tripathy

License URL: https://creativecommons.org/licenses/by-nc/4.0