TY - JOUR
T1 - Population genetic inference from personal genome data
T2 - Impact of ancestry and admixture on human genomic variation
AU - Kidd, Jeffrey M.
AU - Gravel, Simon
AU - Byrnes, Jake
AU - Moreno-Estrada, Andres
AU - Musharoff, Shaila
AU - Bryc, Katarzyna
AU - Degenhardt, Jeremiah D.
AU - Brisbin, Abra
AU - Sheth, Vrunda
AU - Chen, Rong
AU - McLaughlin, Stephen F.
AU - Peckham, Heather E.
AU - Omberg, Larsson
AU - Bormann Chung, Christina A.
AU - Stanley, Sarah
AU - Pearlstein, Kevin
AU - Levandowsky, Elizabeth
AU - Acevedo-Acevedo, Suehelay
AU - Auton, Adam
AU - Keinan, Alon
AU - Acuña-Alonzo, Victor
AU - Barquera-Lozano, Rodrigo
AU - Canizales-Quinteros, Samuel
AU - Eng, Celeste
AU - Burchard, Esteban G.
AU - Russell, Archie
AU - Reynolds, Andy
AU - Clark, Andrew G.
AU - Reese, Martin G.
AU - Lincoln, Stephen E.
AU - Butte, Atul J.
AU - De La Vega, Francisco M.
AU - Bustamante, Carlos D.
N1 - Funding Information:
We thank Melisa Barker, Fiona Hyland, Clarence Lee, Kirk Lohmueller, Kevin McKernan and Koni Wright for advice about this project. J.M.K was supported by National Institutes of Health training grant T32HG000044. Additional support was provided by grants R01ES015794 and R01HL088133 (E.G.B.) and 1R44HG3667 (M.G.R). Vrunda Sheth, Stephen F. McLaughlin, Heather E. Peckham, Christina A. Bormann Chung, Sarah Stanley, Kevin Pearlstein, Elizabeth Levandowsky, and Francisco M. De La Vega are or were employees of Life Technologies during this study. Archie Russell and Martin Reese are employees of Omicia. The Varimed database has been licensed to Personalis. Atul Butte is a founder and consults for Personalis. Carlos D. Bustamante also consults for Personalis, as well as Ancestry.com , Locus Development, and the 23andMe.com project “Roots into the Future.” Stephen E. Lincoln was an employee of Complete Genomics during this study. Jake Byrnes is an employee of Ancestry.com .
PY - 2012/10/5
Y1 - 2012/10/5
N2 - Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas - 70% of the European ancestry in today's African Americans dates back to European gene flow happening only 7-8 generations ago.
AB - Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas - 70% of the European ancestry in today's African Americans dates back to European gene flow happening only 7-8 generations ago.
UR - http://www.scopus.com/inward/record.url?scp=84867241644&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2012.08.025
DO - 10.1016/j.ajhg.2012.08.025
M3 - Article
C2 - 23040495
AN - SCOPUS:84867241644
SN - 0002-9297
VL - 91
SP - 660
EP - 671
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 4
ER -