Skip to main navigation Skip to search Skip to main content

RFMix: A discriminative modeling approach for rapid and robust local-ancestry inference

Research output: Contribution to journalArticlepeer-review

649 Scopus citations

Abstract

Local-ancestry inference is an important step in the genetic analysis of fully sequenced human genomes. Current methods can only detect continental-level ancestry (i.e., European versus African versus Asian) accurately even when using millions of markers. Here, we present RFMix, a powerful discriminative modeling approach that is faster (∼30×) and more accurate than existing methods. We accomplish this by using a conditional random field parameterized by random forests trained on reference panels. RFMix is capable of learning from the admixed samples themselves to boost performance and autocorrect phasing errors. RFMix shows high sensitivity and specificity in simulated Hispanics/Latinos and African Americans and admixed Europeans, Africans, and Asians. Finally, we demonstrate that African Americans in HapMap contain modest (but nonzero) levels of Native American ancestry (∼0.4%).

Original languageEnglish
Pages (from-to)278-288
Number of pages11
JournalAmerican Journal of Human Genetics
Volume93
Issue number2
DOIs
StatePublished - 8 Aug 2013

Fingerprint

Dive into the research topics of 'RFMix: A discriminative modeling approach for rapid and robust local-ancestry inference'. Together they form a unique fingerprint.

Cite this