TY - JOUR
T1 - CAGI6 ID panel challenge
T2 - assessment of phenotype and variant predictions in 415 children with neurodevelopmental disorders (NDDs)
AU - Aspromonte, Maria Cristina
AU - Del Conte, Alessio
AU - Zhu, Shaowen
AU - Tan, Wuwei
AU - Shen, Yang
AU - Zhang, Yexian
AU - Li, Qi
AU - Wang, Maggie Haitian
AU - Babbi, Giulia
AU - Bovo, Samuele
AU - Martelli, Pier Luigi
AU - Casadio, Rita
AU - Althagafi, Azza
AU - Toonsi, Sumyyah
AU - Kulmanov, Maxat
AU - Hoehndorf, Robert
AU - Katsonis, Panagiotis
AU - Williams, Amanda
AU - Lichtarge, Olivier
AU - Xian, Su
AU - Surento, Wesley
AU - Pejaver, Vikas
AU - Mooney, Sean D.
AU - Sunderam, Uma
AU - Srinivasan, Rajgopal
AU - Murgia, Alessandra
AU - Piovesan, Damiano
AU - Tosatto, Silvio C.E.
AU - Leonardi, Emanuela
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025
Y1 - 2025
N2 - The Genetics of Neurodevelopmental Disorders Lab in Padua provided a new intellectual disability (ID) Panel challenge for computational methods to predict patient phenotypes and their causal variants in the context of the Critical Assessment of the Genome Interpretation, 6th edition (CAGI6). Eight research teams submitted a total of 30 models to predict phenotypes based on the sequences of 74 genes (VCF format) in 415 pediatric patients affected by Neurodevelopmental Disorders (NDDs). NDDs are clinically and genetically heterogeneous conditions, with onset in infant age. Here, we assess the ability and accuracy of computational methods to predict comorbid phenotypes based on clinical features described in each patient and their causal variants. We also evaluated predictions for possible genetic causes in patients without a clear genetic diagnosis. Like the previous ID Panel challenge in CAGI5, seven clinical features (ID, ASD, ataxia, epilepsy, microcephaly, macrocephaly, hypotonia), and variants (Pathogenic/Likely Pathogenic, Variants of Uncertain Significance and Risk Factors) were provided. The phenotypic traits and variant data of 150 patients from the CAGI5 ID Panel Challenge were provided as training set for predictors. The CAGI6 challenge confirms CAGI5 results that predicting phenotypes from gene panel data is highly challenging, with AUC values close to random, and no method able to predict relevant variants with both high accuracy and precision. However, a significant improvement is noted for the best method, with recall increasing from 66% to 82%. Several groups also successfully predicted difficult-to-detect variants, emphasizing the importance of variants initially excluded by the Padua NDD Lab.
AB - The Genetics of Neurodevelopmental Disorders Lab in Padua provided a new intellectual disability (ID) Panel challenge for computational methods to predict patient phenotypes and their causal variants in the context of the Critical Assessment of the Genome Interpretation, 6th edition (CAGI6). Eight research teams submitted a total of 30 models to predict phenotypes based on the sequences of 74 genes (VCF format) in 415 pediatric patients affected by Neurodevelopmental Disorders (NDDs). NDDs are clinically and genetically heterogeneous conditions, with onset in infant age. Here, we assess the ability and accuracy of computational methods to predict comorbid phenotypes based on clinical features described in each patient and their causal variants. We also evaluated predictions for possible genetic causes in patients without a clear genetic diagnosis. Like the previous ID Panel challenge in CAGI5, seven clinical features (ID, ASD, ataxia, epilepsy, microcephaly, macrocephaly, hypotonia), and variants (Pathogenic/Likely Pathogenic, Variants of Uncertain Significance and Risk Factors) were provided. The phenotypic traits and variant data of 150 patients from the CAGI5 ID Panel Challenge were provided as training set for predictors. The CAGI6 challenge confirms CAGI5 results that predicting phenotypes from gene panel data is highly challenging, with AUC values close to random, and no method able to predict relevant variants with both high accuracy and precision. However, a significant improvement is noted for the best method, with recall increasing from 66% to 82%. Several groups also successfully predicted difficult-to-detect variants, emphasizing the importance of variants initially excluded by the Padua NDD Lab.
UR - http://www.scopus.com/inward/record.url?scp=85217180047&partnerID=8YFLogxK
U2 - 10.1007/s00439-024-02722-w
DO - 10.1007/s00439-024-02722-w
M3 - Article
C2 - 39786577
AN - SCOPUS:85217180047
SN - 0340-6717
JO - Human Genetics
JF - Human Genetics
ER -