TY - JOUR
T1 - Uncertainty-informed deep learning models enable high-confidence predictions for digital histopathology
AU - Dolezal, James M.
AU - Srisuwananukorn, Andrew
AU - Karpeyev, Dmitry
AU - Ramesh, Siddhi
AU - Kochanny, Sara
AU - Cody, Brittany
AU - Mansfield, Aaron S.
AU - Rakshit, Sagar
AU - Bansal, Radhika
AU - Bois, Melanie C.
AU - Bungum, Aaron O.
AU - Schulte, Jefree J.
AU - Vokes, Everett E.
AU - Garassino, Marina Chiara
AU - Husain, Aliya N.
AU - Pearson, Alexander T.
N1 - Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12
Y1 - 2022/12
N2 - A model’s ability to express its own predictive uncertainty is an essential attribute for maintaining clinical user confidence as computational biomarkers are deployed into real-world medical settings. In the domain of cancer digital histopathology, we describe a clinically-oriented approach to uncertainty quantification for whole-slide images, estimating uncertainty using dropout and calculating thresholds on training data to establish cutoffs for low- and high-confidence predictions. We train models to identify lung adenocarcinoma vs. squamous cell carcinoma and show that high-confidence predictions outperform predictions without uncertainty, in both cross-validation and testing on two large external datasets spanning multiple institutions. Our testing strategy closely approximates real-world application, with predictions generated on unsupervised, unannotated slides using predetermined thresholds. Furthermore, we show that uncertainty thresholding remains reliable in the setting of domain shift, with accurate high-confidence predictions of adenocarcinoma vs. squamous cell carcinoma for out-of-distribution, non-lung cancer cohorts.
AB - A model’s ability to express its own predictive uncertainty is an essential attribute for maintaining clinical user confidence as computational biomarkers are deployed into real-world medical settings. In the domain of cancer digital histopathology, we describe a clinically-oriented approach to uncertainty quantification for whole-slide images, estimating uncertainty using dropout and calculating thresholds on training data to establish cutoffs for low- and high-confidence predictions. We train models to identify lung adenocarcinoma vs. squamous cell carcinoma and show that high-confidence predictions outperform predictions without uncertainty, in both cross-validation and testing on two large external datasets spanning multiple institutions. Our testing strategy closely approximates real-world application, with predictions generated on unsupervised, unannotated slides using predetermined thresholds. Furthermore, we show that uncertainty thresholding remains reliable in the setting of domain shift, with accurate high-confidence predictions of adenocarcinoma vs. squamous cell carcinoma for out-of-distribution, non-lung cancer cohorts.
UR - http://www.scopus.com/inward/record.url?scp=85141088516&partnerID=8YFLogxK
U2 - 10.1038/s41467-022-34025-x
DO - 10.1038/s41467-022-34025-x
M3 - Article
AN - SCOPUS:85141088516
SN - 2041-1723
VL - 13
JO - Nature Communications
JF - Nature Communications
IS - 1
M1 - 6572
ER -