Leveraging L-BFGS Optimization and Activation Functions in Multi-Layer Perceptron for Small-Scale Credit Risk Assessment Dataset
DOI:
https://doi.org/10.24090/tids.v3i1.14945Keywords:
Credit Risk, Multi-Layer Perceptron, Small Data, Optimizer and Activation PerformanceAbstract
The urgent need for accurate credit risk models often clashes with the practical reality of operating with small, imbalanced datasets, where standard deep learning configurations can be inefficient. This research addresses the critical gap in optimizing models for such common yet understudied scenarios. We conduct a rigorous empirical study using a Multi-Layer Perceptron on a 381-instance mortgage dataset, systematically comparing the performance of a quasi-Newton optimizer (L-BFGS) against first-order methods (Adam, SGD) across four distinct activation functions. The methodology is grounded in the theory that L-BFGS is better suited for deterministic, full-batch optimization. The results show that while the Adam optimizer with a Logistic function achieved the highest F1-Score (0.8966), the L-BFGS optimizer produced a highly competitive F1-Score (0.8849). Critically, L-BFGS converged in just 55 iterations compared to Adam's 1556, validating its theoretical efficiency for this problem scale and revealing a significant performance-versus-cost trade-off. This study concludes that L-BFGS provides a superior practical solution for small-scale credit risk assessment, offering a near-optimal balance between high predictive accuracy and exceptional computational efficiency, especially in resource-constrained environments where rapid convergence is essential and model interpretability, robustness, and reproducibility are equally prioritized for deployment.References
[1] W. Zhiqin, "Artificial intelligence and machine learning in credit risk assessment: Enhancing accuracy and ensuring fairness," vol. 12, no. 11, pp. 19-34, 2024.
[2] N. Suhadolnik, J. Ueyama, and S. Da Silva, "Machine learning for enhanced credit risk assessment: An empirical approach," vol. 16, no. 12, p. 496, 2023.
[3] J. Aria, C. Diego, D. Jose, and E. Ok, "Machine Learning in Credit Risk Management: Evaluating the Role of AI and Blockchain," 2024.
[4] S. Mestiri, S. J. D. S. i. F. Mestiri, and Economics, "Credit scoring using machine learning and deep Learning-Based models," vol. 2, p. 236, 2024.
[5] W. A. Addy, A. O. Ajayi-Nifise, B. G. Bello, S. T. Tula, O. Odeyemi, and T. Falaiye, "AI in credit scoring: A comprehensive review of models and predictive analytics," vol. 18, no. 2, pp. 118-129, 2024.
[6] G. S. da S. Gomes, T. B. Ludermir, and L. M. Lima, "Comparison of new activation functions in neural network for forecasting financial time series," vol. 20, no. 3, pp. 417-439, 2011.
[7] O. Gbeminiyi Deborah, "The role of machine learning in enhancing credit scoring models for financial inclusion," World Journal of Advanced Research and Reviews, vol. 17, no. 3, pp. 1095-1106, 2023.
[8] Z. Askar and K. Akmaral, "COMPARATIVE OVERVIEW OF MACHINE LEARNING MODELS FOR CREDIT SCORING," vol. 10, no. 5 (134), pp. 36-40, 2025.
[9] T. Xu, "Comparative Analysis of Machine Learning Algorithms for Consumer Credit Risk Assessment," vol. 4, pp. 60-67, 2024.
[10] J. P. Noriega, L. A. Rivera, and J. A. Herrera, "Machine learning for credit risk prediction: A systematic literature review," vol. 8, no. 11, p. 169, 2023.
[11] J. W. Wanjohi, B. N. Mpinda, and O. O. Awe, "Comparing logistic regression and neural networks for predicting skewed credit score: a LIME-based explainability approach," vol. 25, no. 3, pp. 49-67, 2024.
[12] F. Zocco and S. McLoone, "An adaptive memory multi-batch l-bfgs algorithm for neural network training," vol. 53, no. 2, pp. 8199-8204, 2020.
[13] E. Jafarigol and T. Trafalis, "A review of machine learning techniques in imbalanced data and future trends," 2023.
[14] M. X. Low et al., "Comparison of label encoding and evidence counting for malware classification," vol. 12, no. 6, pp. 17-30, 2022.
[15] R. Mohammed, J. Rawashdeh, and M. Abdullah, "Machine learning with oversampling and undersampling techniques: overview study and experimental results," in 2020 11th international conference on information and communication systems (ICICS), 2020, pp. 243-248: IEEE.
[16] N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, "SMOTE: synthetic minority over-sampling technique," vol. 16, pp. 321-357, 2002.
[17] R. Bollapragada, J. Nocedal, D. Mudigere, H.-J. Shi, and P. T. P. Tang, "A progressive batching L-BFGS method for machine learning," in International Conference on Machine Learning, 2018, pp. 620-629: PMLR.
[18] N. Mohammadi and M. Zangeneh, “Customer Credit Risk Assessment using Artificial Neural Networks,” International Journal of Information Technology and Computer Science, vol. 8, no. 3, pp. 58–66, Mar. 2016, doi: 10.5815/ijitcs.2016.03.07.
[19] O. Shchur, M. Mumme, A. Bojchevski, and S. Günnemann, “Pitfalls of Graph Neural Network Evaluation,” Jun. 2019, [Online]. Available: http://arxiv.org/abs/1811.05868
[20] R. Rofik, R. Aulia, K. Musaadah, S. S. F. Ardyani, and A. A. Hakim, “Optimization of Credit Scoring Model Using Stacking Ensemble Learning and Oversampling Techniques,” Journal of Information System Exploration and Research, vol. 2, no. 1, Dec. 2023, doi: 10.52465/joiser.v2i1.203.
Downloads
Published
How to Cite
License
Copyright (c) 2026 Nguyen N. Thiện

This work is licensed under a Creative Commons Attribution 4.0 International License.

