TY - JOUR
T1 - Automated Lugano Metabolic Response Assessment in 18F-Fluorodeoxyglucose–Avid Non-Hodgkin Lymphoma With Deep Learning on 18F-Fluorodeoxyglucose–Positron Emission Tomography
AU - Jemaa, Skander
AU - Ounadjela, Souhila
AU - Wang, Xiaoyong
AU - El-Galaly, Tarec C.
AU - Kostakoglu, Lale
AU - Knapp, Andrea
AU - Ku, Grace
AU - Musick, Lisa
AU - Sahin, Denis
AU - Wei, Michael C.
AU - Yin, Shen
AU - Bengtsson, Thomas
AU - De Crespigny, Alex
AU - Carano, Richard A.D.
N1 - Publisher Copyright:
© 2024 by American Society of Clinical Oncology.
PY - 2024/9/1
Y1 - 2024/9/1
N2 - PURPOSE Artificial intelligence can reduce the time used by physicians on radiological assessments. For 18F-fluorodeoxyglucose–avid lymphomas, obtaining complete metabolic response (CMR) by end of treatment is prognostic. METHODS Here, we present a deep learning–based algorithm for fully automated treatment response assessments according to the Lugano 2014 classification. The proposed four-stage method, trained on a multicountry clinical trial (ClinicalTrials.gov identifier: NCT01287741) and tested in three independent multicenter and multicountry test sets on different non-Hodgkin lymphoma subtypes and different lines of treatment (ClinicalTrials.gov identifiers NCT02257567, NCT02500407; 20% holdout in ClinicalTrials.gov identifier NCT01287741), outputs the detected lesions at baseline and follow-up to enable focused radiologist review. RESULTS The method’s response assessment achieved high agreement with the adjudicated radiologic responses (eg, agreement for overall response assessment of 93%, 87%, and 85% in ClinicalTrials.gov identifiers NCT01287741, NCT02500407, and NCT02257567, respectively) similar to inter-radiologist agreement and was strongly prognostic of outcomes with a trend toward higher accuracy for death risk than adjudicated radiologic responses (hazard ratio for end of treatment by-model CMR of 0.123, 0.054, and 0.205 in ClinicalTrials.gov identifiers NCT01287741, NCT02500407, and NCT02257567, compared with, respectively, 0.226, 0.292, and 0.272 for CMR by the adjudicated responses). Furthermore, a radiologist review of the algorithm’s assessments was conducted. The radiologist median review time was 1.38 minutes/assessment, and no statistically significant differences were observed in the level of agreement of the radiologist with the model’s response compared with the level of agreement of the radiologist with the adjudicated responses. CONCLUSION These results suggest that the proposed method can be incorporated into radiologic response assessment workflows in cancer imaging for significant time savings and with performance similar to trained medical experts.
AB - PURPOSE Artificial intelligence can reduce the time used by physicians on radiological assessments. For 18F-fluorodeoxyglucose–avid lymphomas, obtaining complete metabolic response (CMR) by end of treatment is prognostic. METHODS Here, we present a deep learning–based algorithm for fully automated treatment response assessments according to the Lugano 2014 classification. The proposed four-stage method, trained on a multicountry clinical trial (ClinicalTrials.gov identifier: NCT01287741) and tested in three independent multicenter and multicountry test sets on different non-Hodgkin lymphoma subtypes and different lines of treatment (ClinicalTrials.gov identifiers NCT02257567, NCT02500407; 20% holdout in ClinicalTrials.gov identifier NCT01287741), outputs the detected lesions at baseline and follow-up to enable focused radiologist review. RESULTS The method’s response assessment achieved high agreement with the adjudicated radiologic responses (eg, agreement for overall response assessment of 93%, 87%, and 85% in ClinicalTrials.gov identifiers NCT01287741, NCT02500407, and NCT02257567, respectively) similar to inter-radiologist agreement and was strongly prognostic of outcomes with a trend toward higher accuracy for death risk than adjudicated radiologic responses (hazard ratio for end of treatment by-model CMR of 0.123, 0.054, and 0.205 in ClinicalTrials.gov identifiers NCT01287741, NCT02500407, and NCT02257567, compared with, respectively, 0.226, 0.292, and 0.272 for CMR by the adjudicated responses). Furthermore, a radiologist review of the algorithm’s assessments was conducted. The radiologist median review time was 1.38 minutes/assessment, and no statistically significant differences were observed in the level of agreement of the radiologist with the model’s response compared with the level of agreement of the radiologist with the adjudicated responses. CONCLUSION These results suggest that the proposed method can be incorporated into radiologic response assessment workflows in cancer imaging for significant time savings and with performance similar to trained medical experts.
UR - https://www.scopus.com/pages/publications/85202550014
U2 - 10.1200/JCO.23.01978
DO - 10.1200/JCO.23.01978
M3 - Article
C2 - 38843483
AN - SCOPUS:85202550014
SN - 0732-183X
VL - 42
SP - 2966
EP - 2977
JO - Journal of Clinical Oncology
JF - Journal of Clinical Oncology
IS - 25
ER -