TY - JOUR
T1 - Deep Learning Modeling to Differentiate Multiple Sclerosis from MOG Antibody-Associated Disease
AU - MAGNIMS Study Group
AU - Cortese, Rosa
AU - Sforazzini, Francesco
AU - Gentile, Giordano
AU - De Mauro, Anna
AU - Luchetti, Ludovico
AU - Amato, Maria Pia
AU - Apóstolos-Pereira, Samira Luisa
AU - Arrambide, Georgina
AU - Bellenberg, Barbara
AU - Bianchi, Alessia
AU - Bisecco, Alvino
AU - Bodini, Benedetta
AU - Calabrese, Massimiliano
AU - Camera, Valentina
AU - Celius, Elisabeth G.
AU - De Medeiros Rimkus, Carolina
AU - Duan, Yunyun
AU - Durand-Dubief, Françoise
AU - Filippi, Massimo
AU - Gallo, Antonio
AU - Gasperini, Claudio
AU - Granziera, Cristina
AU - Groppa, Sergiu
AU - Grothe, Matthias
AU - Gueye, Mor
AU - Inglese, Matilde
AU - Jacob, Anu
AU - Lapucci, Caterina
AU - Lazzarotto, Andrea
AU - Liu, Yaou
AU - Llufriu, Sara
AU - Lukas, Carsten
AU - Marignier, Romain
AU - Messina, Silvia
AU - Müller, Jannis
AU - Palace, Jacqueline
AU - Pastó, Luisa
AU - Paul, Friedemann
AU - Prados, Ferran
AU - Pröbstel, Anne Katrin
AU - Rovira, Àlex
AU - Rocca, Maria Assunta
AU - Ruggieri, Serena
AU - Sastre-Garriga, Jaume
AU - Sato, Douglas Kazutoshi
AU - Schneider, Ruth
AU - Sepulveda, Maria
AU - Sowa, Piotr
AU - Stankoff, Bruno
AU - Tortorella, Carla
N1 - Publisher Copyright:
© 2025 American Academy of Neurology.
PY - 2025/9/23
Y1 - 2025/9/23
N2 - Background and ObjectivesMultiple sclerosis (MS) is common in adults while myelin oligodendrocyte glycoprotein antibody-associated disease (MOGAD) is rare. Our previous machine-learning algorithm, using clinical variables, ≤6 brain lesions, and no Dawson fingers, achieved 79% accuracy, 78% sensitivity, and 80% specificity in distinguishing MOGAD from MS but lacked validation. The aim of this study was to (1) evaluate the clinical/MRI algorithm for distinguishing MS from MOGAD, (2) develop a deep learning (DL) model, (3) assess the benefit of combining both, and (4) identify key differentiators using probability attention maps (PAMs).MethodsThis multicenter, retrospective, cross-sectional MAGNIMS study included scans from 19 centers. Inclusion criteria were as follows: adults with non-acute MS and MOGAD, with high-quality T2-fluid-attenuated inversion recovery and T1-weighted scans. Brain scans were scored by 2 readers to assess the performance of the clinical/MRI algorithm on the validation data set. A DL-based classifier using a ResNet-10 convolutional neural network was developed and tested on an independent validation data set. PAMs were generated by averaging correctly classified attention maps from both groups, identifying key differentiating regions.ResultsWe included 406 MRI scans (218 with relapsing remitting MS [RRMS], mean age: 39 years ±11, 69% F; 188 with MOGAD, age: 41 years ±14, 61% F), split into 2 data sets: a training/testing set (n = 265: 150 with RRMS, age: 39 years ±10, 72% F; 115 with MOGAD, age: 42 years ±13, 61% F) and an independent validation set (n = 141: 68 with RRMS, age: 40 years ±14, 65% F; 73 with MOGAD, age: 40 years ±15, 63% F). The clinical/MRI algorithm predicted RRMS over MOGAD with 75% accuracy (95% CI 67-82), 96% sensitivity (95% CI 88-99), and specificity 56% (95% CI 44-68) in the validation cohort. The DL model achieved 77% accuracy (95% CI 64-89), 73% sensitivity (95% CI 57-89), and 83% specificity (95% CI 65-96) in the training/testing cohort, and 70% accuracy (95% CI 63-77), 67% sensitivity (95% CI 55-79), and 73% specificity (95% CI 61-83) in the validation cohort without retraining. When combined, the classifiers reached 86% accuracy (95% CI 81-92), 84% sensitivity (95% CI 75-92), and 89% specificity (95% CI 81-96). PAMs identified key region volumes: corpus callosum (1872 mm3), left precentral gyrus (341 mm3), right thalamus (193 mm3), and right cingulate cortex (186 mm3) for identifying RRMS and brainstem (629 mm3), hippocampus (234 mm3), and parahippocampal gyrus (147 mm3) for identifying MOGAD.DiscussionBoth classifiers effectively distinguished RRMS from MOGAD. The clinical/MRI model showed higher sensitivity while the DL model offered higher specificity, suggesting complementary roles. Their combination improved diagnostic accuracy, and PAMs revealed distinct damage patterns. Future prospective studies should validate these models in diverse, real-world settings.Classification of EvidenceThis study provides Class III evidence that both a clinical/MRI algorithm and an MRI-based DL model accurately distinguish RRMS from MOGAD.
AB - Background and ObjectivesMultiple sclerosis (MS) is common in adults while myelin oligodendrocyte glycoprotein antibody-associated disease (MOGAD) is rare. Our previous machine-learning algorithm, using clinical variables, ≤6 brain lesions, and no Dawson fingers, achieved 79% accuracy, 78% sensitivity, and 80% specificity in distinguishing MOGAD from MS but lacked validation. The aim of this study was to (1) evaluate the clinical/MRI algorithm for distinguishing MS from MOGAD, (2) develop a deep learning (DL) model, (3) assess the benefit of combining both, and (4) identify key differentiators using probability attention maps (PAMs).MethodsThis multicenter, retrospective, cross-sectional MAGNIMS study included scans from 19 centers. Inclusion criteria were as follows: adults with non-acute MS and MOGAD, with high-quality T2-fluid-attenuated inversion recovery and T1-weighted scans. Brain scans were scored by 2 readers to assess the performance of the clinical/MRI algorithm on the validation data set. A DL-based classifier using a ResNet-10 convolutional neural network was developed and tested on an independent validation data set. PAMs were generated by averaging correctly classified attention maps from both groups, identifying key differentiating regions.ResultsWe included 406 MRI scans (218 with relapsing remitting MS [RRMS], mean age: 39 years ±11, 69% F; 188 with MOGAD, age: 41 years ±14, 61% F), split into 2 data sets: a training/testing set (n = 265: 150 with RRMS, age: 39 years ±10, 72% F; 115 with MOGAD, age: 42 years ±13, 61% F) and an independent validation set (n = 141: 68 with RRMS, age: 40 years ±14, 65% F; 73 with MOGAD, age: 40 years ±15, 63% F). The clinical/MRI algorithm predicted RRMS over MOGAD with 75% accuracy (95% CI 67-82), 96% sensitivity (95% CI 88-99), and specificity 56% (95% CI 44-68) in the validation cohort. The DL model achieved 77% accuracy (95% CI 64-89), 73% sensitivity (95% CI 57-89), and 83% specificity (95% CI 65-96) in the training/testing cohort, and 70% accuracy (95% CI 63-77), 67% sensitivity (95% CI 55-79), and 73% specificity (95% CI 61-83) in the validation cohort without retraining. When combined, the classifiers reached 86% accuracy (95% CI 81-92), 84% sensitivity (95% CI 75-92), and 89% specificity (95% CI 81-96). PAMs identified key region volumes: corpus callosum (1872 mm3), left precentral gyrus (341 mm3), right thalamus (193 mm3), and right cingulate cortex (186 mm3) for identifying RRMS and brainstem (629 mm3), hippocampus (234 mm3), and parahippocampal gyrus (147 mm3) for identifying MOGAD.DiscussionBoth classifiers effectively distinguished RRMS from MOGAD. The clinical/MRI model showed higher sensitivity while the DL model offered higher specificity, suggesting complementary roles. Their combination improved diagnostic accuracy, and PAMs revealed distinct damage patterns. Future prospective studies should validate these models in diverse, real-world settings.Classification of EvidenceThis study provides Class III evidence that both a clinical/MRI algorithm and an MRI-based DL model accurately distinguish RRMS from MOGAD.
UR - https://www.scopus.com/pages/publications/105015578418
U2 - 10.1212/WNL.0000000000214075
DO - 10.1212/WNL.0000000000214075
M3 - Article
C2 - 40906978
AN - SCOPUS:105015578418
SN - 0028-3878
VL - 105
JO - Neurology
JF - Neurology
IS - 6
M1 - e214075
ER -