Covariate Feature Importance and Cross-Indication Transferability Analysis for Phase III Oncology Trial Outcome Prediction
Keywords:
clinical trial outcome prediction, covariate feature importance, cross-indication transferability, technical success probabilityAbstract
Oncology drug development suffers from the lowest clinical success rates among all therapeutic areas, with the likelihood of approval (LOA) from Phase I to regulatory approval estimated at 3.4%--5.3%. Accurate prediction of Phase III trial outcomes is essential for optimizing resource allocation and reducing late-stage attrition costs. This study presents a comparative regression analysis framework designed to identify and rank covariate features that influence Phase III oncology trial success, and to evaluate the temporal stability and cross-indication transferability of these features. Using 1,203 Phase III oncology trials registered between 2005 and 2023, sourced from ClinicalTrials.gov and the BioMedTracker database, 42 candidate covariates spanning trial design, molecular characteristics, patient baseline demographics, and historical performance metrics were extracted and analyzed. SHAP-based importance ranking revealed that prior Phase II efficacy endpoints, biomarker-driven patient selection, and sponsor therapeutic area experience constituted the three most influential predictive features. A comparison of five regression techniques demonstrated that gradient boosted regression trees achieved the highest discriminative performance (AUC = 0.823, 95% CI: 0.791--0.855) on the hold-out validation cohort. Cross-indication transferability analysis across six major cancer types showed moderate to strong covariate consistency for trial design features (Spearman ρ = 0.71--0.89) and lower consistency for molecular target features (ρ = 0.43--0.62). These findings provide quantitative evidence supporting the development of more robust technical success probability (PTS) calculation methods.References
1. I. Kola and J. Landis, "Can the pharmaceutical industry reduce attrition rates?" Nature Reviews Drug Discovery, vol. 3, no. 8, pp. 711--716, 2004.
2. J. A. DiMasi, H. G. Grabowski, and R. W. Hansen, "Innovation in the pharmaceutical industry: New estimates of R&D costs," Journal of Health Economics, vol. 47, pp. 20--33, 2016.
3. M. Hay, D. W. Thomas, J. L. Craighead, C. Economides, and J. Rosenthal, "Clinical development success rates for investigational drugs," Nature Biotechnology, vol. 32, no. 1, pp. 40--51, 2014.
4. C. H. Wong, K. W. Siah, and A. W. Lo, "Estimation of clinical trial success rates and related parameters," Biostatistics, vol. 20, no. 2, pp. 273--286, 2019.
5. D. Sun, W. Gao, H. Hu, and S. Zhou, "Why 90% of clinical drug development fails and how to improve it?" Acta Pharmaceutica Sinica B, vol. 12, no. 7, pp. 3049--3062, 2022.
6. A. W. Lo, K. W. Siah, and C. H. Wong, "Machine learning with statistical imputation for predicting drug approvals," Harvard Data Science Review, vol. 1, no. 1, 2019.
7. K. M. Gayvert, N. S. Madhukar, and O. Elemento, "A data-driven approach to predicting successes and failures of clinical trials," Cell Chemical Biology, vol. 23, no. 10, pp. 1294--1301, 2016.
8. K. W. Siah, N. W. Kelley, S. Engstrom, B. Filinger, B. Tsao, and A. W. Lo, "Predicting drug approvals: The Novartis data science and artificial intelligence challenge," Patterns, vol. 2, no. 8, p. 100312, 2021.
9. T. Fu, K. Huang, C. Xiao, L. M. Glass, and J. Sun, "HINT: Hierarchical interaction network for clinical-trial-outcome predictions," Patterns, vol. 3, no. 4, p. 100445, 2022.
10. J. W. Mandema, E. Cox, and J. Alderman, "Therapeutic benefit of eletriptan compared to sumatriptan for the acute relief of migraine pain---results of a model-based meta-analysis that accounts for encapsulation," Cephalalgia, vol. 25, no. 9, pp. 715--725, 2005.
11. M. Boucher and M. Bennetts, "The many flavors of model-based meta-analysis: Part I---Introduction and landmark data," CPT: Pharmacometrics & Systems Pharmacology, vol. 5, no. 2, pp. 54--64, 2016.
12. V. V. Upreti and K. Venkatakrishnan, "Model-based meta-analysis: Optimizing research, development, and utilization of therapeutics using the totality of evidence," Clinical Pharmacology & Therapeutics, vol. 106, no. 5, pp. 981--992, 2019.
13. D. W. Thomas, J. Burns, J. Audette, A. Carroll, C. Dow-Hygelund, and M. Hay, "Clinical development success rates and contributing factors 2011--2020," BIO Industry Analysis, Informa Pharma Intelligence, QLS Advisors, 2021.
14. F. Feijoo, M. Palopoli, J. Bernstein, S. Siddiqui, and T. E. Albright, "Key indicators of phase transition for clinical trials through machine learning," Drug Discovery Today, vol. 25, no. 2, pp. 414--421, 2020.
15. A. Aliper et al., "Prediction of clinical trials outcomes based on target choice and clinical trial design with multi-modal artificial intelligence," Clinical Pharmacology & Therapeutics, vol. 114, no. 5, pp. 972--980, 2023.
16. A. V. Schperberg, P. F. Engstrom, and R. Kurzrock, "Machine learning model to predict oncologic outcomes for drugs in randomized clinical trials," International Journal of Cancer, vol. 147, no. 9, pp. 2537--2549, 2020.
17. Z. Teng, J. Hu, G. L. Faltys, and A. V. Bhattaram, "Model-based meta-analysis for multiple myeloma: A quantitative drug-independent framework for efficient decisions in oncology drug development," Clinical and Translational Science, vol. 11, no. 2, pp. 218--225, 2018.
18. P. Chan, K. Peskov, and X. Song, "Applications of model-based meta-analysis in drug development," Pharmaceutical Research, vol. 39, no. 8, pp. 1761--1777, 2022.
19. L. V. Hampson et al., "A new comprehensive approach to assess the probability of success of development programs before pivotal trials," Clinical Pharmacology & Therapeutics, vol. 111, no. 5, pp. 1050--1060, 2022.
20. T. Otsuka, R. Wada, and T. Shiroyama, "Factors associated with successful phase III trials for solid tumors: A systematic review," Contemporary Clinical Trials Communications, vol. 24, p. 100859, 2021.
21. G. Carrigan et al., "Using electronic health records to derive control arms for early phase single-arm lung cancer trials: Proof-of-concept in randomized controlled trials," Clinical Pharmacology & Therapeutics, vol. 107, no. 2, pp. 369--377, 2020.
22. A. Cetinyurek Yavuz et al., "On the concepts, methods, and use of 'probability of success' for drug development decision-making: A scoping review," Clinical Pharmacology & Therapeutics, vol. 117, no. 4, pp. 928--940, 2025.
23. T. J. Hwang et al., "Failure of investigational drugs in late-stage clinical development and publication of trial results," JAMA Internal Medicine, vol. 176, no. 12, pp. 1826--1833, 2016.
24. U.S. Food and Drug Administration, "Considerations for the use of artificial intelligence to support regulatory decision-making for drug and biological products (FDA-2024-D-4689)," U.S. Department of Health and Human Services, 2025.

