Leveraging Ensemble Machine Learning for Credit Risk Assessment in Underserved U.S. Small Businesses

Authors

  • Zaolin Zhang City University of New York, New York, New York, 10022, USA Author

DOI:

https://doi.org/10.71222/5tj1w207

Keywords:

ensemble learning, credit risk, underserved businesses, boosting models, financial inclusion, machine learning interpretability

Abstract

This study investigates the role of ensemble machine learning techniques in improving credit risk assessment for underserved small businesses in the United States. Conventional credit evaluation systems rely heavily on formal financial documentation and collateral requirements, excluding a large proportion of early-stage, minority-owned, or cash-based enterprises. Drawing on the uploaded document's empirical modeling structure, the study integrates gradient boosting frameworks, random forest classifiers, and macro-behavioral feature engineering to construct a multidimensional risk assessment model. The methodology incorporates structured preprocessing, cross-validated ensemble training, and interpretability analysis using SHAP values. Results show that boosting-based models consistently outperform traditional approaches, achieving stronger precision, recall, and AUC scores while capturing nuanced behavioral and macroeconomic interactions. The findings highlight the pot ential of ensemble learning to expand access to credit, reduce misclassification bias, and support inclusive economic development. The study concludes with recommendations for lenders, policymakers, and researchers regarding the deployment of ensemble analytics and the future integration of fairness-aware modeling.

References

1. L. Breiman, "Random forests," Machine learning, vol. 45, no. 1, pp. 5-32, 2001. doi: 10.1023/a:1010933404324

2. T. Chen, "XGBoost: A Scalable Tree Boosting System," Cornell University, 2016.

3. G. Ke, Q. Meng, T. Finley, T. Wang, W. Chen, W. Ma, and T. Y. Liu, "Lightgbm: A highly efficient gradient boosting decision tree," Advances in neural information processing systems, vol. 30, 2017.

4. FR Banks, "Small business credit survey," Report on Minority-Owned, 2017.

5. K. Langenbucher, "Responsible AI-based credit scoring-a legal framework," European Business Law Review, vol. 31, no. 4, 2020.

Downloads

Published

12 January 2026

Issue

Section

Article

How to Cite

Zhang, Z. (2026). Leveraging Ensemble Machine Learning for Credit Risk Assessment in Underserved U.S. Small Businesses. European Journal of AI, Computing & Informatics, 2(1), 14-20. https://doi.org/10.71222/5tj1w207