Extent to which array genotyping and imputation with large reference panels approximate deep whole-genome sequencing

NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Understanding the genetic basis of human diseases and traits is dependent on the identification and accurate genotyping of genetic variants. Deep whole-genome sequencing (WGS), the gold standard technology for SNP and indel identification and genotyping, remains very expensive for most large studies. Here, we quantify the extent to which array genotyping followed by genotype imputation can approximate WGS in studies of individuals of African, Hispanic/Latino, and European ancestry in the US and of Finnish ancestry in Finland (a population isolate). For each study, we performed genotype imputation by using the genetic variants present on the Illumina Core, OmniExpress, MEGA, and Omni 2.5M arrays with the 1000G, HRC, and TOPMed imputation reference panels. Using the Omni 2.5M array and the TOPMed panel, ≥90% of bi-allelic single-nucleotide variants (SNVs) are well imputed (r2 > 0.8) down to minor-allele frequencies (MAFs) of 0.14% in African, 0.11% in Hispanic/Latino, 0.35% in European, and 0.85% in Finnish ancestries. There was little difference in TOPMed-based imputation quality among the arrays with >700k variants. Individual-level imputation quality varied widely between and within the three US studies. Imputation quality also varied across genomic regions, producing regions where even common (MAF > 5%) variants were consistently not well imputed across ancestries. The extent to which array genotyping and imputation can approximate WGS therefore depends on reference panel, genotype array, sample ancestry, and genomic location. Imputation quality by variant or genomic region can be queried with our new tool, RsqBrowser, now deployed on the Michigan Imputation Server.

Original languageEnglish
Pages (from-to)1653-1666
Number of pages14
JournalAmerican Journal of Human Genetics
Volume109
Issue number9
DOIs
StatePublished - 1 Sep 2022

Keywords

  • genotype imputation
  • genotyping array
  • whole-genome sequencing

Fingerprint

Dive into the research topics of 'Extent to which array genotyping and imputation with large reference panels approximate deep whole-genome sequencing'. Together they form a unique fingerprint.

Cite this