TY - JOUR
T1 - From Bench-to-Bedside
T2 - How Artificial Intelligence is Changing Thyroid Nodule Diagnostics, a Systematic Review
AU - Sant, Vivek R.
AU - Radhachandran, Ashwath
AU - Ivezic, Vedrana
AU - Lee, Denise T.
AU - Livhits, Masha J.
AU - Wu, James X.
AU - Masamed, Rinat
AU - Arnold, Corey W.
AU - Yeh, Michael W.
AU - Speier, William
N1 - Publisher Copyright:
© The Author(s) 2024. Published by Oxford University Press on behalf of the Endocrine Society. All rights reserved.
PY - 2024/7/1
Y1 - 2024/7/1
N2 - Context: Use of artificial intelligence (AI) to predict clinical outcomes in thyroid nodule diagnostics has grown exponentially over the past decade. The greatest challenge is in understanding the best model to apply to one’s own patient population, and how to operationalize such a model in practice. Evidence Acquisition: A literature search of PubMed and IEEE Xplore was conducted for English-language publications between January 1, 2015 and January 1, 2023, studying diagnostic tests on suspected thyroid nodules that used AI. We excluded articles without prospective or external validation, nonprimary literature, duplicates, focused on nonnodular thyroid conditions, not using AI, and those incidentally using AI in support of an experimental diagnostic outside standard clinical practice. Quality was graded by Oxford level of evidence. Evidence Synthesis: A total of 61 studies were identified; all performed external validation, 16 studies were prospective, and 33 compared a model to physician prediction of ground truth. Statistical validation was reported in 50 papers. A diagnostic pipeline was abstracted, yielding 5 high-level outcomes: (1) nodule localization, (2) ultrasound (US) risk score, (3) molecular status, (4) malignancy, and (5) long-term prognosis. Seven prospective studies validated a single commercial AI; strengths included automating nodule feature assessment from US and assisting the physician in predicting malignancy risk, while weaknesses included automated margin prediction and interobserver variability. Conclusion: Models predominantly used US images to predict malignancy. Of 4 Food and Drug Administration–approved products, only S-Detect was extensively validated. Implementing an AI model locally requires data sanitization and revalidation to ensure appropriate clinical performance.
AB - Context: Use of artificial intelligence (AI) to predict clinical outcomes in thyroid nodule diagnostics has grown exponentially over the past decade. The greatest challenge is in understanding the best model to apply to one’s own patient population, and how to operationalize such a model in practice. Evidence Acquisition: A literature search of PubMed and IEEE Xplore was conducted for English-language publications between January 1, 2015 and January 1, 2023, studying diagnostic tests on suspected thyroid nodules that used AI. We excluded articles without prospective or external validation, nonprimary literature, duplicates, focused on nonnodular thyroid conditions, not using AI, and those incidentally using AI in support of an experimental diagnostic outside standard clinical practice. Quality was graded by Oxford level of evidence. Evidence Synthesis: A total of 61 studies were identified; all performed external validation, 16 studies were prospective, and 33 compared a model to physician prediction of ground truth. Statistical validation was reported in 50 papers. A diagnostic pipeline was abstracted, yielding 5 high-level outcomes: (1) nodule localization, (2) ultrasound (US) risk score, (3) molecular status, (4) malignancy, and (5) long-term prognosis. Seven prospective studies validated a single commercial AI; strengths included automating nodule feature assessment from US and assisting the physician in predicting malignancy risk, while weaknesses included automated margin prediction and interobserver variability. Conclusion: Models predominantly used US images to predict malignancy. Of 4 Food and Drug Administration–approved products, only S-Detect was extensively validated. Implementing an AI model locally requires data sanitization and revalidation to ensure appropriate clinical performance.
KW - artificial intelligence
KW - diagnostics
KW - machine learning
KW - thyroid nodules
UR - http://www.scopus.com/inward/record.url?scp=85196326441&partnerID=8YFLogxK
U2 - 10.1210/clinem/dgae277
DO - 10.1210/clinem/dgae277
M3 - Review article
C2 - 38679750
AN - SCOPUS:85196326441
SN - 0021-972X
VL - 109
SP - 1684
EP - 1693
JO - Journal of Clinical Endocrinology and Metabolism
JF - Journal of Clinical Endocrinology and Metabolism
IS - 7
ER -