Inference of human population history from individual whole-genome sequences - PubMed (original) (raw)
Inference of human population history from individual whole-genome sequences
Heng Li et al. Nature. 2011.
Abstract
The history of human population size is important for understanding human evolution. Various studies have found evidence for a founder event (bottleneck) in East Asian and European populations, associated with the human dispersal out-of-Africa event around 60 thousand years (kyr) ago. However, these studies have had to assume simplified demographic models with few parameters, and they do not provide a precise date for the start and stop times of the bottleneck. Here, with fewer assumptions on population size changes, we present a more detailed history of human population sizes between approximately ten thousand and a million years ago, using the pairwise sequentially Markovian coalescent model applied to the complete diploid genome sequences of a Chinese male (YH), a Korean male (SJK), three European individuals (J. C. Venter, NA12891 and NA12878 (ref. 9)) and two Yoruba males (NA18507 (ref. 10) and NA19239). We infer that European and Chinese populations had very similar population-size histories before 10-20 kyr ago. Both populations experienced a severe bottleneck 10-60 kyr ago, whereas African populations experienced a milder bottleneck from which they recovered earlier. All three populations have an elevated effective population size between 60 and 250 kyr ago, possibly due to population substructure. We also infer that the differentiation of genetically modern humans may have started as early as 100-120 kyr ago, but considerable genetic exchanges may still have occurred until 20-40 kyr ago.
Figures
Figure 1. Illustration of the PSMC model and its application to simulated data
(a) The PSMC infers the local time to the most recent common ancestor (TMRCA) based on the local density of heterozygotes, using a Hidden Markov Model, where the observation is a diploid sequence, the hidden states are discretized TMRCA and the transitions represent ancestral recombination events. (b) We used the ms software to simulate the TMRCA relating the two alleles of an individual across a 200kb region (the thick red line), and inferred the local TMRCA at each locus using the PSMC (the heat map). The inference usually includes the correct time, with the greatest errors at transition points.
Figure 2. PSMC estimate on simulated data
(a) PSMC estimate on data simulated by msHOT. The blue curve is the population size history used in simulation; the red curve is the PSMC estimate on the originally simulated sequence; the 100 thin green curves are the PSMC estimates on 100 sequences randomly resampled from the original sequence. (b) PSMC estimate on data with variable mutation rate or with hotspots.
Figure 3. PSMC estimate on real data
(a) The population sizes inferred from autosomes of six individuals. 5%, 10% and 29% of heterozygotes are assumed to be missing in CHN.A, KOR.A and EUR1.A, respectively. (b) The population sizes inferred from male-combined X chromosomes and the simulated African-Asian combined sequences from the best-fit model by Schaffner et al. Sizes inferred from X chromosome data are scaled by 4/3; the neutral mutation rate on X, which is used in time scaling, is estimated with the ratio of male-to-female mutation rate α equal to 2 (Methods).
Similar articles
- PSMC analysis of effective population sizes in molecular ecology and its application to black-and-white Ficedula flycatchers.
Nadachowska-Brzyska K, Burri R, Smeds L, Ellegren H. Nadachowska-Brzyska K, et al. Mol Ecol. 2016 Mar;25(5):1058-72. doi: 10.1111/mec.13540. Epub 2016 Feb 15. Mol Ecol. 2016. PMID: 26797914 Free PMC article. - Demographic inference in barn swallows using whole-genome data shows signal for bottleneck and subspecies differentiation during the Holocene.
Smith CCR, Flaxman SM, Scordato ESC, Kane NC, Hund AK, Sheta BM, Safran RJ. Smith CCR, et al. Mol Ecol. 2018 Nov;27(21):4200-4212. doi: 10.1111/mec.14854. Epub 2018 Sep 22. Mol Ecol. 2018. PMID: 30176075 - Estimating variable effective population sizes from multiple genomes: a sequentially markov conditional sampling distribution approach.
Sheehan S, Harris K, Song YS. Sheehan S, et al. Genetics. 2013 Jul;194(3):647-62. doi: 10.1534/genetics.112.149096. Epub 2013 Apr 22. Genetics. 2013. PMID: 23608192 Free PMC article. - Ancestral Population Genomics.
Dutheil JY, Hobolth A. Dutheil JY, et al. Methods Mol Biol. 2019;1910:555-589. doi: 10.1007/978-1-4939-9074-0_18. Methods Mol Biol. 2019. PMID: 31278677 Review. - [The origin and evolution history of East Asian populations from genetic perspectives].
Tian JY, Li YC, Kong QP, Zhang YP. Tian JY, et al. Yi Chuan. 2018 Oct 20;40(10):814-824. doi: 10.16288/j.yczz.18-202. Yi Chuan. 2018. PMID: 30369466 Review. Chinese.
Cited by
- Genomic and fitness consequences of a near-extinction event in the northern elephant seal.
Hoffman JI, Vendrami DLJ, Hench K, Chen RS, Stoffel MA, Kardos M, Amos W, Kalinowski J, Rickert D, Köhrer K, Wachtmeister T, Goebel ME, Bonin CA, Gulland FMD, Dasmahapatra KK. Hoffman JI, et al. Nat Ecol Evol. 2024 Sep 27. doi: 10.1038/s41559-024-02533-2. Online ahead of print. Nat Ecol Evol. 2024. PMID: 39333394 - Contemporary intergeneric hybridization and backcrossing among birds-of-paradise.
Thörn F, Soares AER, Müller IA, Päckert M, Frahnert S, van Grouw H, Kamminga P, Peona V, Suh A, Blom MPK, Irestedt M. Thörn F, et al. Evol Lett. 2024 Jun 8;8(5):680-694. doi: 10.1093/evlett/qrae023. eCollection 2024 Sep. Evol Lett. 2024. PMID: 39328285 Free PMC article. - Exact Decoding of a Sequentially Markov Coalescent Model in Genetics.
Ki C, Terhorst J. Ki C, et al. J Am Stat Assoc. 2024;119(547):2242-2255. doi: 10.1080/01621459.2023.2252570. Epub 2023 Oct 3. J Am Stat Assoc. 2024. PMID: 39323740 - Genomic divergence and mutation load in the Begonia masoniana complex from limestone karsts.
Chen Y, Dong L, Yi H, Kidner C, Kang M. Chen Y, et al. Plant Divers. 2024 Apr 15;46(5):575-584. doi: 10.1016/j.pld.2024.04.001. eCollection 2024 Sep. Plant Divers. 2024. PMID: 39290887 Free PMC article. - Whole Genomes Inform Genetic Rescue Strategy for Montane Red Foxes in North America.
Quinn CB, Preckler-Quisquater S, Buchalski MR, Sacks BN. Quinn CB, et al. Mol Biol Evol. 2024 Sep 4;41(9):msae193. doi: 10.1093/molbev/msae193. Mol Biol Evol. 2024. PMID: 39288165 Free PMC article.
References
References for online methods
- Miyata T, Hayashida H, Kuma K, Mitsuyasu K, Yasunaga T. Male-driven molecular evolution: a model and nucleotide sequence analysis. Cold Spring Harb Symp Quant Biol. 1987;52:863–867. - PubMed
References
- Reich DE, et al. Linkage disequilibrium in the human genome. Nature. 2001;411:199–204. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources