TY - JOUR
T1 - Comprehensive identification of somatic nucleotide variants in human brain tissue
AU - Brain Somatic Mosaicism Network
AU - Wang, Yifan
AU - Bae, Taejeong
AU - Thorpe, Jeremy
AU - Sherman, Maxwell A.
AU - Jones, Attila G.
AU - Cho, Sean
AU - Daily, Kenneth
AU - Dou, Yanmei
AU - Ganz, Javier
AU - Galor, Alon
AU - Lobon, Irene
AU - Pattni, Reenal
AU - Rosenbluh, Chaggai
AU - Tomasi, Simone
AU - Tomasini, Livia
AU - Yang, Xiaoxu
AU - Zhou, Bo
AU - Akbarian, Schahram
AU - Ball, Laurel L.
AU - Bizzotto, Sara
AU - Emery, Sarah B.
AU - Doan, Ryan
AU - Fasching, Liana
AU - Jang, Yeongjun
AU - Juan, David
AU - Lizano, Esther
AU - Luquette, Lovelace J.
AU - Moldovan, John B.
AU - Narurkar, Rujuta
AU - Oetjens, Matthew T.
AU - Rodin, Rachel E.
AU - Sekar, Shobana
AU - Shin, Joo Heon
AU - Soriano, Eduardo
AU - Straub, Richard E.
AU - Zhou, Weichen
AU - Chess, Andrew
AU - Gleeson, Joseph G.
AU - Marquès-Bonet, Tomas
AU - Park, Peter J.
AU - Peters, Mette A.
AU - Pevsner, Jonathan
AU - Walsh, Christopher A.
AU - Weinberger, Daniel R.
AU - Vaccarino, Flora M.
AU - Moran, John V.
AU - Urban, Alexander E.
AU - Kidd, Jeffrey M.
AU - Mills, Ryan E.
AU - Abyzov, Alexej
N1 - Publisher Copyright:
© 2021, The Author(s).
PY - 2021/12
Y1 - 2021/12
N2 - Background: Post-zygotic mutations incurred during DNA replication, DNA repair, and other cellular processes lead to somatic mosaicism. Somatic mosaicism is an established cause of various diseases, including cancers. However, detecting mosaic variants in DNA from non-cancerous somatic tissues poses significant challenges, particularly if the variants only are present in a small fraction of cells. Results: Here, the Brain Somatic Mosaicism Network conducts a coordinated, multi-institutional study to examine the ability of existing methods to detect simulated somatic single-nucleotide variants (SNVs) in DNA mixing experiments, generate multiple replicates of whole-genome sequencing data from the dorsolateral prefrontal cortex, other brain regions, dura mater, and dural fibroblasts of a single neurotypical individual, devise strategies to discover somatic SNVs, and apply various approaches to validate somatic SNVs. These efforts lead to the identification of 43 bona fide somatic SNVs that range in variant allele fractions from ~ 0.005 to ~ 0.28. Guided by these results, we devise best practices for calling mosaic SNVs from 250× whole-genome sequencing data in the accessible portion of the human genome that achieve 90% specificity and sensitivity. Finally, we demonstrate that analysis of multiple bulk DNA samples from a single individual allows the reconstruction of early developmental cell lineage trees. Conclusions: This study provides a unified set of best practices to detect somatic SNVs in non-cancerous tissues. The data and methods are freely available to the scientific community and should serve as a guide to assess the contributions of somatic SNVs to neuropsychiatric diseases.
AB - Background: Post-zygotic mutations incurred during DNA replication, DNA repair, and other cellular processes lead to somatic mosaicism. Somatic mosaicism is an established cause of various diseases, including cancers. However, detecting mosaic variants in DNA from non-cancerous somatic tissues poses significant challenges, particularly if the variants only are present in a small fraction of cells. Results: Here, the Brain Somatic Mosaicism Network conducts a coordinated, multi-institutional study to examine the ability of existing methods to detect simulated somatic single-nucleotide variants (SNVs) in DNA mixing experiments, generate multiple replicates of whole-genome sequencing data from the dorsolateral prefrontal cortex, other brain regions, dura mater, and dural fibroblasts of a single neurotypical individual, devise strategies to discover somatic SNVs, and apply various approaches to validate somatic SNVs. These efforts lead to the identification of 43 bona fide somatic SNVs that range in variant allele fractions from ~ 0.005 to ~ 0.28. Guided by these results, we devise best practices for calling mosaic SNVs from 250× whole-genome sequencing data in the accessible portion of the human genome that achieve 90% specificity and sensitivity. Finally, we demonstrate that analysis of multiple bulk DNA samples from a single individual allows the reconstruction of early developmental cell lineage trees. Conclusions: This study provides a unified set of best practices to detect somatic SNVs in non-cancerous tissues. The data and methods are freely available to the scientific community and should serve as a guide to assess the contributions of somatic SNVs to neuropsychiatric diseases.
UR - http://www.scopus.com/inward/record.url?scp=85103562667&partnerID=8YFLogxK
U2 - 10.1186/s13059-021-02285-3
DO - 10.1186/s13059-021-02285-3
M3 - Article
C2 - 33781308
AN - SCOPUS:85103562667
SN - 1474-7596
VL - 22
JO - Genome Biology
JF - Genome Biology
IS - 1
M1 - 92
ER -