A literature review at genome scale: improving clinical variant assessment

  • Christopher A. Cassa
  • , Daniel M. Jordan
  • , Ivan Adzhubei
  • , Shamil Sunyaev

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Purpose: Over 150,000 variants have been reported to cause Mendelian disease in the medical literature. It is still difficult to leverage this knowledge base in clinical practice, as many reports lack strong statistical evidence or may include false associations. Clinical laboratories assess whether these variants (along with newly observed variants that are adjacent to these published ones) underlie clinical disorders. Methods: We investigated whether citation data—including journal impact factor and the number of cited variants (NCV) in each gene with published disease associations—can be used to improve variant assessment. Results: Surprisingly, we found that impact factor is not predictive of pathogenicity, but the NCV score for each gene can provide statistical support for prediction of pathogenicity. When this gene-level citation metric is combined with variant-level evolutionary conservation and structural features, classification accuracy reaches 89.5%. Further, variants identified in clinical exome sequencing cases have higher NCVs than do simulated rare variants from the Exome Aggregation Consortium database within the same set of genes and functional consequences (P < 2.22 × 10−16). Conclusion: Aggregate citation data can complement existing variant-based predictive algorithms, and can boost their performance without the need to access and review large numbers of papers. The NCV is a slow-growing metric of scientific knowledge about each gene’s association with disease.

Original languageEnglish
Pages (from-to)936-941
Number of pages6
JournalGenetics in Medicine
Volume20
Issue number9
DOIs
StatePublished - 1 Sep 2018

Fingerprint

Dive into the research topics of 'A literature review at genome scale: improving clinical variant assessment'. Together they form a unique fingerprint.

Cite this