Skip to main navigation Skip to search Skip to main content

Pathologist-Read vs AI-Driven Assessment of Tumor-Infiltrating Lymphocytes in Melanoma

  • Thazin N. Aung
  • , Matthew Liu
  • , David Su
  • , Saba Shafi
  • , Ceren Boyaci
  • , Sanna Steen
  • , Nikolaos Tsiknakis
  • , Joan Martinez Vidal
  • , Nigel Maher
  • , Goran Micevic
  • , Samuel X. Tan
  • , Matthew D. Vesely
  • , Saeed Nourmohammadi
  • , Yalai Bai
  • , Dijana Djureinovic
  • , Pok Fai Wong
  • , Katherine Bates
  • , Nay N.N. Chan
  • , Niki Gavirelatou
  • , Mengni He
  • Sneha Burela, Robert Barna, Martina Bosic, Konstantin Bräutigam, Irineu Illabochaca, Zhou Chenhao, Joao Gama, Bianca Kreis, Reka Mohacsi, Nir Pillar, Joao Pinto, Christos Poulios, Maria Angeliki Toli, Evangelos Tzoras, Yadriel Bracero, Francesca Bosisio, Gábor Cserni, Alis Dema, Francesco Fortarezza, Mercedes Solorzano Gonzalez, Irene Gullo, Francisco Javier Queipo Gutiérrez, Ezgi Hacihasanoglu, Viktor Jovic, Bianca Lazar, Maria Olinca, Christina Neppl, Rui Caetano Oliveira, Federica Pezzuto, Daniel Gomes Pinto, Vanda Plotar, Ovidiu Pop, Tilman Rau, Kristijan Skok, Wenwen Sun, Ezgi Dicle Serbes, Wiebke Solass, Olga Stanowska, Marcell Szasz, Krzysztof Szymonski, Franziska Thimm, Danielle Vignati, Alon Vigdorovits, Victor Prieto, Tobias Sinnberg, James Wilmott, Shawn Cowper, Jonathan Warrell, Yvonne Saenger, Johan Hartman, Jasmine Plummer, Iman Osman, David L. Rimm, Balazs Acs

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

IMPORTANCE Tumor-infiltrating lymphocytes (TILs) are a provocative biomarker in melanoma, influencing diagnosis, prognosis, and immunotherapy outcomes; however, traditional pathologist-read TIL assessment on hematoxylin and eosin-stained slides is prone to interobserver variability, leading to inconsistent clinical decisions. Therefore, development of newer TIL scoring approaches that produce more reliable and consistent readouts is important. OBJECTIVE To evaluate the analytical and clinical validity of a machine learning algorithm for TIL quantification in melanoma compared with traditional pathologist-read methods. DESIGN, SETTING, AND PARTICIPANTS This multioperator, global, multi-institutional prognostic study compared TIL scoring reproducibility between traditional pathologist-read methods and an artificial intelligence (AI)-driven approach. The study was conducted using retrospective cohorts of patients with melanoma between January 2022 and June 2023 across 45 institutions, with tissue evaluated by participants from academic, clinical, and research institutions. Participants were selected to ensure diverse expertise and professional backgrounds. MAIN OUTCOMES AND MEASURES Intraclass correlation coefficient (ICC) values were calculated for the manual and AI-assisted arms using log-transformed data. Kendall W values were calculated for Clark scores (brisk = 3, nonbrisk = 2, and sparse = 1). Reliabilities of ICC and W values were classified as moderate (0.40-0.60), good (0.61-0.80), or excellent (>0.80). AI TIL measurements were dichotomized using the 16.6 and median cutoffs. Univariable and multivariable Cox regression analyses assessed the prognostic value of TIL scores adjusted for clinicopathologic variables. RESULTS There were 111 patients with melanoma in the independent testing cohort (median [range] age at diagnosis, 61.0 [25.0-87.0] years; 56 [50.5%] male) who contributed melanoma whole tissue sections. A total of 98 participants evaluated TILs on 60 hematoxylin and eosin-stained melanoma tissue sections. All 40 participants in the manual arm were pathologists, while the AI-assisted arm included 11 pathologists and 47 nonpathologists (scientists). The AI algorithm demonstrated superior reproducibility, with ICCs higher than 0.90 for all machine learning TIL variables, significantly outperforming manual assessments (ICC, 0.61 for AI-derived stromal TILs vs Kendall W, 0.44 for manual Clark TIL scoring). AI-based TIL scores showed prognostic associations with patient outcomes (n = 111) using the median cutoff approach with a hazard ratio (HR) of 0.45 (95% CI, 0.26-0.80; P = .005), and using the cutoff of 16.6, with an HR of 0.56 (95% CI, 0.32-0.98; P = .04). CONCLUSIONS AND RELEVANCE In this prognostic study of TIL quantification in melanoma, the AI algorithm demonstrated superior reproducibility and prognostic associations compared with traditional methods. Although the retrospective nature of the cohorts limits demonstration of clinical utility, the publicly available dataset and open-source AI tool offer a foundation for future validation and integration into melanoma management.

Original languageEnglish
Article number2518906
JournalJAMA network open
Volume8
Issue number7
DOIs
StatePublished - Jul 2025
Externally publishedYes

Fingerprint

Dive into the research topics of 'Pathologist-Read vs AI-Driven Assessment of Tumor-Infiltrating Lymphocytes in Melanoma'. Together they form a unique fingerprint.

Cite this