TY - JOUR
T1 - Cluster model incorporating heterogeneous dose distribution of partial parotid irradiation for radiotherapy induced xerostomia prediction with machine learning methods
AU - Chao, Ming
AU - El Naqa, Issam
AU - Bakst, Richard L.
AU - Lo, Yeh Chi
AU - Peñagarícano, José A.
N1 - Publisher Copyright:
© 2022 Acta Oncologica Foundation.
PY - 2022
Y1 - 2022
N2 - Purpose: A cluster model incorporating heterogeneous dose distribution within the parotid gland was developed and validated retrospectively for radiotherapy (RT) induced xerostomia prediction with machine learning (ML) techniques. Methods: Sixty clusters were obtained at 1 Gy step size with threshold doses ranging from 1 to 60 Gy, for each of the enrolled 155 patients with HNC from three institutions. Feature clusters were selected with the neighborhood component analysis (NCA) and subsequently fed into four supervised ML models for xerostomia prediction comparison: support vector machines (SVM), k-nearest neighbor (kNN), naïve Bayes (NB), and random forest (RF). The predictive performance of each model was evaluated using cross validation resampling with the area-under-the-curves (AUC) of the receiver-operating-characteristic (ROC). The xerostomia predicting capacity using testing data was assessed with accuracy, sensitivity, and specificity for these models and three cluster connectivity choices. Mean dose based logistic regression served as the benchmark for evaluation. Results: Feature clusters identified by NCA fell in three threshold dose ranges: 5–15Gy, 25–35Gy, and 45–50Gy. Mean dose predictive power was 15% lower than that of the cluster model using the logistic regression classifier. Model validation demonstrated that kNN model outperformed slightly other three models but no substantial difference was observed. Applying the fine-tuned models to testing data yielded that the mean accuracy from SVM, kNN and NB models were between 0.68 and 0.7 while that of RF was ∼0.6. SVM model yielded the best sensitivity (0.76) and kNN model delivered consistent sensitivity and specificity. This is consistent with cross validation. Clusters calculated with three connectivity choices exhibited minimally different predictions. Conclusion: Compared to mean dose, the proposed cluster model has shown its improvement as the xerostomia predictor. When combining with ML techniques, it could provide a clinically useful tool for xerostomia prediction and facilitate decision making during radiotherapy planning for patients with HNC.
AB - Purpose: A cluster model incorporating heterogeneous dose distribution within the parotid gland was developed and validated retrospectively for radiotherapy (RT) induced xerostomia prediction with machine learning (ML) techniques. Methods: Sixty clusters were obtained at 1 Gy step size with threshold doses ranging from 1 to 60 Gy, for each of the enrolled 155 patients with HNC from three institutions. Feature clusters were selected with the neighborhood component analysis (NCA) and subsequently fed into four supervised ML models for xerostomia prediction comparison: support vector machines (SVM), k-nearest neighbor (kNN), naïve Bayes (NB), and random forest (RF). The predictive performance of each model was evaluated using cross validation resampling with the area-under-the-curves (AUC) of the receiver-operating-characteristic (ROC). The xerostomia predicting capacity using testing data was assessed with accuracy, sensitivity, and specificity for these models and three cluster connectivity choices. Mean dose based logistic regression served as the benchmark for evaluation. Results: Feature clusters identified by NCA fell in three threshold dose ranges: 5–15Gy, 25–35Gy, and 45–50Gy. Mean dose predictive power was 15% lower than that of the cluster model using the logistic regression classifier. Model validation demonstrated that kNN model outperformed slightly other three models but no substantial difference was observed. Applying the fine-tuned models to testing data yielded that the mean accuracy from SVM, kNN and NB models were between 0.68 and 0.7 while that of RF was ∼0.6. SVM model yielded the best sensitivity (0.76) and kNN model delivered consistent sensitivity and specificity. This is consistent with cross validation. Clusters calculated with three connectivity choices exhibited minimally different predictions. Conclusion: Compared to mean dose, the proposed cluster model has shown its improvement as the xerostomia predictor. When combining with ML techniques, it could provide a clinically useful tool for xerostomia prediction and facilitate decision making during radiotherapy planning for patients with HNC.
KW - Xerostomia; cluster model; machine learning; radiation therapy; head and neck cancer
UR - http://www.scopus.com/inward/record.url?scp=85129763943&partnerID=8YFLogxK
U2 - 10.1080/0284186X.2022.2073187
DO - 10.1080/0284186X.2022.2073187
M3 - Article
C2 - 35527717
AN - SCOPUS:85129763943
SN - 0284-186X
VL - 61
SP - 842
EP - 848
JO - Acta Oncologica
JF - Acta Oncologica
IS - 7
ER -