TY - JOUR
T1 - Development and Optimization of a Machine-Learning Prediction Model for Acute Desquamation After Breast Radiation Therapy in the Multicenter REQUITE Cohort
AU - REQUITE consortium
AU - Aldraimli, Mahmoud
AU - Osman, Sarah
AU - Grishchuck, Diana
AU - Ingram, Samuel
AU - Lyon, Robert
AU - Mistry, Anil
AU - Oliveira, Jorge
AU - Samuel, Robert
AU - Shelley, Leila E.A.
AU - Soria, Daniele
AU - Dwek, Miriam V.
AU - Aguado-Barrera, Miguel E.
AU - Azria, David
AU - Chang-Claude, Jenny
AU - Dunning, Alison
AU - Giraldo, Alexandra
AU - Green, Sheryl
AU - Gutiérrez-Enríquez, Sara
AU - Herskind, Carsten
AU - van Hulle, Hans
AU - Lambrecht, Maarten
AU - Lozza, Laura
AU - Rancati, Tiziana
AU - Reyes, Victoria
AU - Rosenstein, Barry S.
AU - de Ruysscher, Dirk
AU - de Santis, Maria C.
AU - Seibold, Petra
AU - Sperk, Elena
AU - Symonds, R. Paul
AU - Stobart, Hilary
AU - Taboada-Valadares, Begoña
AU - Talbot, Christopher J.
AU - Vakaet, Vincent J.L.
AU - Vega, Ana
AU - Veldeman, Liv
AU - Veldwijk, Marlon R.
AU - Webb, Adam
AU - Weltens, Caroline
AU - West, Catharine M.
AU - Chaussalet, Thierry J.
AU - Rattay, Tim
N1 - Publisher Copyright:
© 2022 The Authors
PY - 2022/5/1
Y1 - 2022/5/1
N2 - Purpose: Some patients with breast cancer treated by surgery and radiation therapy experience clinically significant toxicity, which may adversely affect cosmesis and quality of life. There is a paucity of validated clinical prediction models for radiation toxicity. We used machine learning (ML) algorithms to develop and optimise a clinical prediction model for acute breast desquamation after whole breast external beam radiation therapy in the prospective multicenter REQUITE cohort study. Methods and Materials: Using demographic and treatment-related features (m = 122) from patients (n = 2058) at 26 centers, we trained 8 ML algorithms with 10-fold cross-validation in a 50:50 random-split data set with class stratification to predict acute breast desquamation. Based on performance in the validation data set, the logistic model tree, random forest, and naïve Bayes models were taken forward to cost-sensitive learning optimisation. Results: One hundred and ninety-two patients experienced acute desquamation. Resampling and cost-sensitive learning optimisation facilitated an improvement in classification performance. Based on maximising sensitivity (true positives), the “hero” model was the cost-sensitive random forest algorithm with a false-negative: false-positive misclassification penalty of 90:1 containing m = 114 predictive features. Model sensitivity and specificity were 0.77 and 0.66, respectively, with an area under the curve of 0.77 in the validation cohort. Conclusions: ML algorithms with resampling and cost-sensitive learning generated clinically valid prediction models for acute desquamation using patient demographic and treatment features. Further external validation and inclusion of genomic markers in ML prediction models are worthwhile, to identify patients at increased risk of toxicity who may benefit from supportive intervention or even a change in treatment plan.
AB - Purpose: Some patients with breast cancer treated by surgery and radiation therapy experience clinically significant toxicity, which may adversely affect cosmesis and quality of life. There is a paucity of validated clinical prediction models for radiation toxicity. We used machine learning (ML) algorithms to develop and optimise a clinical prediction model for acute breast desquamation after whole breast external beam radiation therapy in the prospective multicenter REQUITE cohort study. Methods and Materials: Using demographic and treatment-related features (m = 122) from patients (n = 2058) at 26 centers, we trained 8 ML algorithms with 10-fold cross-validation in a 50:50 random-split data set with class stratification to predict acute breast desquamation. Based on performance in the validation data set, the logistic model tree, random forest, and naïve Bayes models were taken forward to cost-sensitive learning optimisation. Results: One hundred and ninety-two patients experienced acute desquamation. Resampling and cost-sensitive learning optimisation facilitated an improvement in classification performance. Based on maximising sensitivity (true positives), the “hero” model was the cost-sensitive random forest algorithm with a false-negative: false-positive misclassification penalty of 90:1 containing m = 114 predictive features. Model sensitivity and specificity were 0.77 and 0.66, respectively, with an area under the curve of 0.77 in the validation cohort. Conclusions: ML algorithms with resampling and cost-sensitive learning generated clinically valid prediction models for acute desquamation using patient demographic and treatment features. Further external validation and inclusion of genomic markers in ML prediction models are worthwhile, to identify patients at increased risk of toxicity who may benefit from supportive intervention or even a change in treatment plan.
UR - http://www.scopus.com/inward/record.url?scp=85124294257&partnerID=8YFLogxK
U2 - 10.1016/j.adro.2021.100890
DO - 10.1016/j.adro.2021.100890
M3 - Article
AN - SCOPUS:85124294257
VL - 7
JO - Advances in Radiation Oncology
JF - Advances in Radiation Oncology
SN - 2452-1094
IS - 3
M1 - 100890
ER -