Pedro Soares | Universidade do Minho (original) (raw)

Papers by Pedro Soares

Research paper thumbnail of Phylogeography and Ethnogenesis of Aboriginal Southeast Asians

Molecular Biology and Evolution, 2006

Studying the genetic history of the Orang Asli of Peninsular Malaysia can provide crucial clues t... more Studying the genetic history of the Orang Asli of Peninsular Malaysia can provide crucial clues to the peopling of Southeast Asia as a whole. We have analyzed mitochondrial DNA (mtDNAs) control-region and coding-region markers in 447 mtDNAs from the region, including 260 Orang Asli, representative of each of the traditional groupings, the Semang, the Senoi, and the Aboriginal Malays, allowing us to test hypotheses about their origins. All of the Orang Asli groups have undergone high levels of genetic drift, but phylogeographic traces nevertheless remain of the ancestry of their maternal lineages. The Semang have a deep ancestry within the Malay Peninsula, dating to the initial settlement from Africa .50,000 years ago. The Senoi appear to be a composite group, with approximately half of the maternal lineages tracing back to the ancestors of the Semang and about half to Indochina. This is in agreement with the suggestion that they represent the descendants of early Austroasiatic speaking agriculturalists, who brought both their language and their technology to the southern part of the peninsula ;4,000 years ago and coalesced with the indigenous population. The Aboriginal Malays are more diverse, and although they show some connections with island Southeast Asia, as expected, they also harbor haplogroups that are either novel or rare elsewhere. Contrary to expectations, complete mtDNA genome sequences from one of these, R9b, suggest an ancestry in Indochina around the time of the Last Glacial Maximum, followed by an early-Holocene dispersal through the Malay Peninsula into island Southeast Asia.

Research paper thumbnail of Climate Change and Postglacial Human Dispersals in Southeast Asia

Molecular Biology and Evolution, 2008

Modern humans have been living in Island Southeast Asia (ISEA) for at least 50,000 years. Largely... more Modern humans have been living in Island Southeast Asia (ISEA) for at least 50,000 years. Largely because of the influence of linguistic studies, however, which have a shallow time depth, the attention of archaeologists and geneticists has usually been focused on the last 6,000 years-in particular, on a proposed Neolithic dispersal from China and Taiwan. Here we use complete mitochondrial DNA (mtDNA) genome sequencing to spotlight some earlier processes that clearly had a major role in the demographic history of the region but have hitherto been unrecognized. We show that haplogroup E, an important component of mtDNA diversity in the region, evolved in situ over the last 35,000 years and expanded dramatically throughout ISEA around the beginning of the Holocene, at the time when the ancient continent of Sundaland was being broken up into the present-day archipelago by rising sea levels. It reached Taiwan and Near Oceania more recently, within the last ;8,000 years. This suggests that global warming and sea-level rises at the end of the Ice Age, 15,000-7,000 years ago, were the main forces shaping modern human diversity in the region.

Research paper thumbnail of Extensive Admixture and Selective Pressure Across the Sahel Belt

Genome-wide studies of African populations have the potential to reveal powerful insights into th... more Genome-wide studies of African populations have the potential to reveal powerful insights into the evolution of our species, as these diverse populations have been exposed to intense selective pressures imposed by infectious diseases, diet, and environmental factors. Within Africa, the Sahel Belt extensively overlaps the geographical center of several endemic infections such as malaria, trypanoso-miasis, meningitis, and hemorrhagic fevers. We screened 2.5 million single nucleotide polymorphisms in 161 individuals from 13 Sahelian populations, which together with published data cover Western, Central, and Eastern Sahel, and include both nomadic and sedentary groups. We confirmed the role of this Belt as a main corridor for human migrations across the continent. Strong admixture was observed in both Central and Eastern Sahelian populations, with North Africans and Near Eastern/Arabians, respectively, but it was inexistent in Western Sahelian populations. Genome-wide local ancestry inference in admixed Sahelian populations revealed several candidate regions that were significantly enriched for non-autochthonous haplotypes, and many showed to be under positive selection. The DARC gene region in Arabs and Nubians was enriched for African ancestry, whereas the RAB3GAP1/LCT/MCM6 region in Oromo, the TAS2R gene family in Fulani, and the ALMS1/NAT8 in Turkana and Samburu were enriched for non-African ancestry. Signals of positive selection varied in terms of geographic amplitude. Some genomic regions were selected across the Belt, the most striking example being the malaria-related DARC gene. Others were Western-specific (oxytocin, calcium, and heart pathways), Eastern-specific (lipid pathways), or even population-restricted (TAS2R genes in Fulani, which may reflect sexual selection).

Research paper thumbnail of Quantifying the legacy of the Chinese Neolithic on the maternal genetic heritage of Taiwan and Island Southeast Asia

Human Genetics, 2016

There has been a long-standing debate concerning the extent to which the spread of Neolithic cera... more There has been a long-standing debate concerning the extent to which the spread of Neolithic ceramics and Malay-Polynesian languages in Island Southeast Asia (ISEA) were coupled to an agriculturally driven demic dispersal out of Taiwan 4000 years ago (4 ka). We previously addressed this question using founder analysis of mitochondrial DNA (mtDNA) control-region sequences to identify major lineage clusters most likely to have dispersed from Taiwan into ISEA, proposing that the dispersal had a relatively minor impact on the extant genetic structure of ISEA, and that the role of agriculture in the expansion of the Austronesian languages was therefore likely to have been correspondingly minor. Here we test these conclusions by sequencing whole mtDNAs from across Taiwan and ISEA, using their higher chronological precision to resolve the overall proportion that participated in the "out-of-Taiwan" mid-Holocene dispersal as opposed to earlier, postglacial expansions in the Early Holocene. We show that, in total, about 20 % of mtDNA lineages in the modern ISEA pool result from the "out-of-Taiwan" dispersal, with most of the remainder signifying earlier processes, mainly due to sea-level rises after the Last Glacial Maximum. Notably, we show that every one of these founder clusters previously entered Taiwan from China, 6-7 ka, where rice-farming originated, and remained distinct from the indigenous Taiwanese population until after the subsequent dispersal into ISEA.

Research paper thumbnail of Early Holocenic and Historic mtDNA African Signatures in the Iberian Peninsula: The Andalusian Region as a Paradigm

PloS one, 2015

Determining the timing, identity and direction of migrations in the Mediterranean Basin, the role... more Determining the timing, identity and direction of migrations in the Mediterranean Basin, the role of "migratory routes" in and among regions of Africa, Europe and Asia, and the effects of sex-specific behaviors of population movements have important implications for our understanding of the present human genetic diversity. A crucial component of the Mediterranean world is its westernmost region. Clear features of transcontinental ancient contacts between North African and Iberian populations surrounding the maritime region of Gibraltar Strait have been identified from archeological data. The attempt to discern origin and dates of migration between close geographically related regions has been a challenge in the field of uniparental-based population genetics. Mitochondrial DNA (mtDNA) studies have been focused on surveying the H1, H3 and V lineages when trying to ascertain north-south migrations, and U6 and L in the opposite direction, assuming that those lineages are good ...

Research paper thumbnail of Fine Time-Scaling of Purifying Selection on Human Non-Synonymous mtDNA Mutations Based on Worldwide Population Tree and Mother-Child Pairs

Human mutation, Jan 7, 2015

A high-resolution mtDNA phylogenetic tree allowed us to look backward in time to investigate puri... more A high-resolution mtDNA phylogenetic tree allowed us to look backward in time to investigate purifying selection. Purifying selection was very strong in the last 2,500 years, continuously eliminating pathogenic mutations back until the end of the Younger Dryas (∼11,000 years ago), when a large population expansion likely relaxed selection pressure. This was preceded by a phase of stable selection until another relaxation occurred in the out-of-Africa migration. Demography and selection are closely related: expansions led to relaxation of selection and higher pathogenicity mutations significantly decreased the growth of descendants. The only detectible positive selection was the recurrence of highly pathogenic non-synonymous mutations (m.3394T>C-m.3397A>G-m.3398T>C) at interior branches of the tree, preventing the formation of a dinucleotide STR (TATATA) in the MT-ND1 gene. At the most recent timescale in 124 mother-children transmissions, purifying selection was detectable ...

Research paper thumbnail of The genetic impact of the lake chad basin population in north africa as documented by mitochondrial diversity and internal variation of the L3e5 haplogroup

Annals of Human Genetics, 2013

Research paper thumbnail of Evaluating Purifying Selection in the Mitochondrial DNA of Various Mammalian Species

PLoS ONE, 2013

Mitochondrial DNA (mtDNA), the circular DNA molecule inside the mitochondria of all eukaryotic ce... more Mitochondrial DNA (mtDNA), the circular DNA molecule inside the mitochondria of all eukaryotic cells, has been shown to be under the effect of purifying selection in several species. Traditional testing of purifying selection has been based simply on ratios of nonsynonymous to synonymous mutations, without considering the relative age of each mutation, which can be determined by phylogenetic analysis of this non-recombining molecule. The incorporation of a mutation time-ordering from phylogeny and of predicted pathogenicity scores for nonsynonymous mutations allow a quantitative evaluation of the effects of purifying selection in human mtDNA. Here, by using this additional information, we show that purifying selection undoubtedly acts upon the mtDNA of other mammalian species/genera, namely Bos sp., Canis lupus, Mus musculus, Orcinus orca, Pan sp. and Sus scrofa. The effects of purifying selection were comparable in all species, leading to a significant major proportion of nonsynonymous variants with higher pathogenicity scores in the younger branches of the tree. We also derive recalibrated mutation rates for age estimates of ancestors of these various species and proposed a correction curve in order to take into account the effects of selection. Understanding this selection is fundamental to evolutionary studies and to the identification of deleterious mutations.

Research paper thumbnail of The First Modern Human Dispersals across Africa

PLoS ONE, 2013

The emergence of more refined chronologies for climate change and archaeology in prehistoric Afri... more The emergence of more refined chronologies for climate change and archaeology in prehistoric Africa, and for the evolution of human mitochondrial DNA (mtDNA), now make it feasible to test more sophisticated models of early modern human dispersals suggested by mtDNA distributions. Here we have generated 42 novel whole-mtDNA genomes belonging to haplogroup L0, the most divergent clade in the maternal line of descent, and analysed them alongside the growing database of African lineages belonging to L0's sister clade, L1'6. We propose that the last common ancestor of modern human mtDNAs (carried by "mitochondrial Eve") possibly arose in central Africa ~180 ka, at a time of low population size. By ~130 ka two distinct groups of anatomically modern humans co-existed in Africa: broadly, the ancestors of many modern-day Khoe and San populations in the south and a second central/ eastern African group that includes the ancestors of most extant worldwide populations. Early modern human dispersals correlate with climate changes, particularly the tropical African "megadroughts" of MIS 5 (marine isotope stage 5, 135-75 ka) which paradoxically may have facilitated expansions in central and eastern Africa, ultimately triggering the dispersal out of Africa of people carrying haplogroup L3 ~60 ka. Two south to east migrations are discernible within haplogroup LO. One, between 120 and 75 ka, represents the first unambiguous long-range modern human dispersal detected by mtDNA and might have allowed the dispersal of several markers of modernity. A second one, within the last 20 ka signalled by L0d, may have been responsible for the spread of southern clickconsonant languages to eastern Africa, contrary to the view that these eastern examples constitute relicts of an ancient, much wider distribution.

Research paper thumbnail of Mutational Spectrum and Linkage Disequilibrium Patterns at the Ornithine Transcarbamylase Gene (OTC)

Annals of Human Genetics, 2006

Ornithine transcarbamylase (OTC; EC 2.1.3.3) is a hepatic enzyme involved in ammonia elimination ... more Ornithine transcarbamylase (OTC; EC 2.1.3.3) is a hepatic enzyme involved in ammonia elimination via the urea cycle. Since the sequence of the OTC gene was reported many types of mutations continue to be found in OTC deficiency patients, continuing to increase the already wide mutational spectrum known for this gene. In this study we present the clinical, biochemical and molecular features of thirteen late-onset OTC deficiency patients. Mutations were identified in all these patients, among which six were novel point substitutions (L59R, A137P, L148S, Y176L, L186P, and K210N) and one was a 2-bp deletion at exon 4 (341-342delAA). In addition, a de novo genomic deletion of maternal origin encompassing exons 1 to 5 was also identified by the analysis of LD patterns using intragenic polymorphic markers. This work exemplifies the potential value of population genetic studies for the detection of large deletions.

Research paper thumbnail of Mitochondrial DNA Signals of Late Glacial Recolonization of Europe from Near Eastern Refugia

The American Journal of Human Genetics, 2012

Research paper thumbnail of Comparing Phylogeny and the Predicted Pathogenicity of Protein Variations Reveals Equal Purifying Selection across the Global Human mtDNA Diversity

The American Journal of Human Genetics, 2011

We used detailed phylogenetic trees for human mtDNA, combined with pathogenicity predictions for ... more We used detailed phylogenetic trees for human mtDNA, combined with pathogenicity predictions for each amino acid change, to evaluate selection on mtDNA-encoded protein variants. Protein variants with high pathogenicity scores were significantly rarer in the older branches of the tree. Variants that have formed and survived multiple times in the human phylogenetics tree had significantly lower pathogenicity scores than those that only appear once in the tree. We compared the distribution of pathogenicity scores observed on the human phylogenetic tree to the distribution of all possible protein variations to define a measure of the effect of selection on these protein variations. The measured effect of selection increased exponentially with increasing pathogenicity score. We found no measurable difference in this measure of purifying selection in mtDNA across the global population, represented by the macrohaplogroups L, M, and N. We provide a list of all possible single amino acid variations for the human mtDNA-encoded proteins with their predicted pathogenicity scores and our measured selection effect as a tool for assessing novel protein variations that are often reported in patients with mitochondrial disease of unknown origin or for assessing somatic mutations acquired through aging or detected in tumors.

Research paper thumbnail of Resolving the ancestry of Austronesian-speaking populations

There are two very different interpretations of the prehistory of Island Southeast Asia (ISEA), w... more There are two very different interpretations of the prehistory of Island Southeast Asia (ISEA), with genetic evidence invoked in support of both. The “out-of-Taiwan” model proposes a major Late Holocene expansion of Neolithic Austronesian speakers from Taiwan. An alternative, proposing that Late Glacial/postglacial sea-level rises triggered largely autochthonous dispersals, accounts for some otherwise enigmatic genetic patterns, but fails to explain the Austronesian language dispersal. Combining mitochondrial DNA (mtDNA), Y-chromosome and genome-wide data, we performed the most comprehensive analysis of the region to date, obtaining highly consistent results across all three systems and allowing us to reconcile the models. We infer a primarily common ancestry for Taiwan/ISEA populations established before the Neolithic, but also detected clear signals of two minor Late Holocene migrations, probably representing Neolithic input from both Mainland Southeast Asia and South China, via Taiwan. This latter may therefore have mediated the Austronesian language dispersal, implying small-scale migration and language shift rather than large-scale expansion.

Research paper thumbnail of Pleistocene-Holocene boundary in southern Arabia from the perspective of human mtDNA variation

American Journal of Physical Anthropology, 2012

It is now known that several population movements have taken place at different times through- ou... more It is now known that several population movements have taken place at different times through- out southern Arabian prehistory. One of the principal questions under debate is if the Early Holocene peopling of southern Arabia was mainly due to input from the Le- vant during the Pre-Pottery Neolithic B, to the expansion of an autochthonous population, or some combination of these demographic processes. Since previous genetic stud- ies have not been able to include all parts of southern Arabia, we have helped fill this lacuna by collecting new population datasets from Oman (Dhofar) and Yemen (Al- Mahra and Bab el-Mandab). We identified several new haplotypes belonging to haplogroup R2 and generated its whole genome mtDNA tree with age estimates under- taken by different methods. R2, together with other considerably frequent southern Arabian mtDNA haplogroups (R0a, HV1, summing up more than 20% of the South Ara- bian gene pool) were used to infer the past effective popu- lation size through Bayesian skyline plots. These data indicate that the southern Arabian population underwent a large expansion already some 12 ka. A founder analysis of these haplogroups shows that this expansion is largely attributed to demographic input from the Near East. These results support thus the spread of a population coming from the north, but at a significantly earlier date than presently considered by archaeologists. Our data suggest that some of the mtDNA lineages found in south- ern Arabia have persisted in the region since the end of the Last Ice Age.

Research paper thumbnail of 60,000 years of interactions between Central and Eastern Africa documented by major African mitochondrial haplogroup L2

Scientific reports, 2015

Mitochondrial DNA (mtDNA) haplogroup L2 originated in Western Africa but is nowadays spread acros... more Mitochondrial DNA (mtDNA) haplogroup L2 originated in Western Africa but is nowadays spread across the entire continent. L2 movements were previously postulated to be related to the Bantu expansion, but L2 expansions eastwards probably occurred much earlier. By reconstructing the phylogeny of L2 (44 new complete sequences) we provide insights on the complex net of within-African migrations in the last 60 thousand years (ka). Results show that lineages in Southern Africa cluster with Western/Central African lineages at a recent time scale, whereas, eastern lineages seem to be substantially more ancient. Three moments of expansion from a Central African source are associated to L2: (1) one migration at 70-50 ka into Eastern or Southern Africa, (2) postglacial movements (15-10 ka) into Eastern Africa; and (3) the southward Bantu Expansion in the last 5 ka. The complementary population and L0a phylogeography analyses indicate no strong evidence of mtDNA gene flow between eastern and sou...

Research paper thumbnail of A substantial prehistoric European ancestry amongst Ashkenazi maternal lineages

Nature Communications (2013) 4:2543

The origins of Ashkenazi Jews remain highly controversial. Like Judaism, mitochondrial DNA is pas... more The origins of Ashkenazi Jews remain highly controversial. Like Judaism, mitochondrial DNA is passed along the maternal line. Its variation in the Ashkenazim is highly distinctive, with four major and numerous minor founders. However, due to their rarity in the general population, these founders have been difficult to trace to a source. Here we show that all four major founders, ~40% of Ashkenazi mtDNA variation, have ancestry in prehistoric Europe, rather than the Near East or Caucasus. Furthermore, most of the remaining minor founders share a similar deep European ancestry. Thus the great majority of Ashkenazi maternal lineages were not brought from the Levant, as commonly supposed, nor recruited in the Caucasus, as sometimes suggested, but assimilated within Europe. These results point to a significant role for the conversion of women in the formation of Ashkenazi communities, and provide the foundation for a detailed reconstruction of Ashkenazi genealogical history.

Research paper thumbnail of Relative Y-STR mutation rates estimated from the variance inside SNP defined lineages

International Congress Series, 2006

Y specific microsatellites (STRs) have been widely used in forensic and population genetics in ag... more Y specific microsatellites (STRs) have been widely used in forensic and population genetics in age estimates of human male lineages. Previously, estimates of mutation rates from father-son pairs have given quite variable results in different studies, essentially due to the rarity of mutations. We propose an indirect approach for determining relative mutation rates of Ychromosome microsatellites based on STR allele size intra-lineage variance. Indeed, the present distribution of STR alleles offers us an insight into the mechanisms that have generated that diversity. D

Research paper thumbnail of Climate Change and Postglacial Human Dispersals in Southeast Asia

Molecular Biology and Evolution, 2008

Modern humans have been living in Island Southeast Asia (ISEA) for at least 50,000 years. Largely... more Modern humans have been living in Island Southeast Asia (ISEA) for at least 50,000 years. Largely because of the influence of linguistic studies, however, which have a shallow time depth, the attention of archaeologists and geneticists has usually been focused on the last 6,000 years-in particular, on a proposed Neolithic dispersal from China and Taiwan. Here we use complete mitochondrial DNA (mtDNA) genome sequencing to spotlight some earlier processes that clearly had a major role in the demographic history of the region but have hitherto been unrecognized. We show that haplogroup E, an important component of mtDNA diversity in the region, evolved in situ over the last 35,000 years and expanded dramatically throughout ISEA around the beginning of the Holocene, at the time when the ancient continent of Sundaland was being broken up into the present-day archipelago by rising sea levels. It reached Taiwan and Near Oceania more recently, within the last ;8,000 years. This suggests that global warming and sea-level rises at the end of the Ice Age, 15,000-7,000 years ago, were the main forces shaping modern human diversity in the region.

Research paper thumbnail of The Arabian Cradle: Mitochondrial Relicts of the First Steps along the Southern Route out of Africa

The American Journal of Human Genetics, 2012

A major unanswered question regarding the dispersal of modern humans around the world concerns th... more A major unanswered question regarding the dispersal of modern humans around the world concerns the geographical site of the first human steps outside of Africa. The ''southern coastal route'' model predicts that the early stages of the dispersal took place when people crossed the Red Sea to southern Arabia, but genetic evidence has hitherto been tenuous. We have addressed this question by analyzing the three minor west-Eurasian haplogroups, N1, N2, and X. These lineages branch directly from the first non-African founder node, the root of haplogroup N, and coalesce to the time of the first successful movement of modern humans out of Africa,~60 thousand years (ka) ago. We sequenced complete mtDNA genomes from 85 Southwest Asian samples carrying these haplogroups and compared them with a database of 300 European examples. The results show that these minor haplogroups have a relict distribution that suggests an ancient ancestry within the Arabian Peninsula, and they most likely spread from the Gulf Oasis region toward the Near East and Europe during the pluvial period 55-24 ka ago. This pattern suggests that Arabia was indeed the first staging post in the spread of modern humans around the world.

Research paper thumbnail of Musilová et al. - 2011 - Population history of the Red Sea-genetic exchanges between the Arabian Peninsula and East Africa signaled in t

Archaeological studies have revealed cultural connections between the two sides of the Red Sea da... more Archaeological studies have revealed cultural connections between the two sides of the Red Sea dating to prehistory. The issue has still not been properly addressed, however, by archaeogenetics. We focus our attention here on the mitochondrial haplogroup HV1 that is present in both the Arabian Peninsula and East Africa. The internal variation of 38 complete mitochondrial DNA sequences (20 of them presented here for the first time) affiliated into this haplogroup testify to its emergence during the late glacial maximum, most probably in the Near East, with subsequent dispersion via in Wiley Online Library (wileyonlinelibrary.com).

Research paper thumbnail of Phylogeography and Ethnogenesis of Aboriginal Southeast Asians

Molecular Biology and Evolution, 2006

Studying the genetic history of the Orang Asli of Peninsular Malaysia can provide crucial clues t... more Studying the genetic history of the Orang Asli of Peninsular Malaysia can provide crucial clues to the peopling of Southeast Asia as a whole. We have analyzed mitochondrial DNA (mtDNAs) control-region and coding-region markers in 447 mtDNAs from the region, including 260 Orang Asli, representative of each of the traditional groupings, the Semang, the Senoi, and the Aboriginal Malays, allowing us to test hypotheses about their origins. All of the Orang Asli groups have undergone high levels of genetic drift, but phylogeographic traces nevertheless remain of the ancestry of their maternal lineages. The Semang have a deep ancestry within the Malay Peninsula, dating to the initial settlement from Africa .50,000 years ago. The Senoi appear to be a composite group, with approximately half of the maternal lineages tracing back to the ancestors of the Semang and about half to Indochina. This is in agreement with the suggestion that they represent the descendants of early Austroasiatic speaking agriculturalists, who brought both their language and their technology to the southern part of the peninsula ;4,000 years ago and coalesced with the indigenous population. The Aboriginal Malays are more diverse, and although they show some connections with island Southeast Asia, as expected, they also harbor haplogroups that are either novel or rare elsewhere. Contrary to expectations, complete mtDNA genome sequences from one of these, R9b, suggest an ancestry in Indochina around the time of the Last Glacial Maximum, followed by an early-Holocene dispersal through the Malay Peninsula into island Southeast Asia.

Research paper thumbnail of Climate Change and Postglacial Human Dispersals in Southeast Asia

Molecular Biology and Evolution, 2008

Modern humans have been living in Island Southeast Asia (ISEA) for at least 50,000 years. Largely... more Modern humans have been living in Island Southeast Asia (ISEA) for at least 50,000 years. Largely because of the influence of linguistic studies, however, which have a shallow time depth, the attention of archaeologists and geneticists has usually been focused on the last 6,000 years-in particular, on a proposed Neolithic dispersal from China and Taiwan. Here we use complete mitochondrial DNA (mtDNA) genome sequencing to spotlight some earlier processes that clearly had a major role in the demographic history of the region but have hitherto been unrecognized. We show that haplogroup E, an important component of mtDNA diversity in the region, evolved in situ over the last 35,000 years and expanded dramatically throughout ISEA around the beginning of the Holocene, at the time when the ancient continent of Sundaland was being broken up into the present-day archipelago by rising sea levels. It reached Taiwan and Near Oceania more recently, within the last ;8,000 years. This suggests that global warming and sea-level rises at the end of the Ice Age, 15,000-7,000 years ago, were the main forces shaping modern human diversity in the region.

Research paper thumbnail of Extensive Admixture and Selective Pressure Across the Sahel Belt

Genome-wide studies of African populations have the potential to reveal powerful insights into th... more Genome-wide studies of African populations have the potential to reveal powerful insights into the evolution of our species, as these diverse populations have been exposed to intense selective pressures imposed by infectious diseases, diet, and environmental factors. Within Africa, the Sahel Belt extensively overlaps the geographical center of several endemic infections such as malaria, trypanoso-miasis, meningitis, and hemorrhagic fevers. We screened 2.5 million single nucleotide polymorphisms in 161 individuals from 13 Sahelian populations, which together with published data cover Western, Central, and Eastern Sahel, and include both nomadic and sedentary groups. We confirmed the role of this Belt as a main corridor for human migrations across the continent. Strong admixture was observed in both Central and Eastern Sahelian populations, with North Africans and Near Eastern/Arabians, respectively, but it was inexistent in Western Sahelian populations. Genome-wide local ancestry inference in admixed Sahelian populations revealed several candidate regions that were significantly enriched for non-autochthonous haplotypes, and many showed to be under positive selection. The DARC gene region in Arabs and Nubians was enriched for African ancestry, whereas the RAB3GAP1/LCT/MCM6 region in Oromo, the TAS2R gene family in Fulani, and the ALMS1/NAT8 in Turkana and Samburu were enriched for non-African ancestry. Signals of positive selection varied in terms of geographic amplitude. Some genomic regions were selected across the Belt, the most striking example being the malaria-related DARC gene. Others were Western-specific (oxytocin, calcium, and heart pathways), Eastern-specific (lipid pathways), or even population-restricted (TAS2R genes in Fulani, which may reflect sexual selection).

Research paper thumbnail of Quantifying the legacy of the Chinese Neolithic on the maternal genetic heritage of Taiwan and Island Southeast Asia

Human Genetics, 2016

There has been a long-standing debate concerning the extent to which the spread of Neolithic cera... more There has been a long-standing debate concerning the extent to which the spread of Neolithic ceramics and Malay-Polynesian languages in Island Southeast Asia (ISEA) were coupled to an agriculturally driven demic dispersal out of Taiwan 4000 years ago (4 ka). We previously addressed this question using founder analysis of mitochondrial DNA (mtDNA) control-region sequences to identify major lineage clusters most likely to have dispersed from Taiwan into ISEA, proposing that the dispersal had a relatively minor impact on the extant genetic structure of ISEA, and that the role of agriculture in the expansion of the Austronesian languages was therefore likely to have been correspondingly minor. Here we test these conclusions by sequencing whole mtDNAs from across Taiwan and ISEA, using their higher chronological precision to resolve the overall proportion that participated in the "out-of-Taiwan" mid-Holocene dispersal as opposed to earlier, postglacial expansions in the Early Holocene. We show that, in total, about 20 % of mtDNA lineages in the modern ISEA pool result from the "out-of-Taiwan" dispersal, with most of the remainder signifying earlier processes, mainly due to sea-level rises after the Last Glacial Maximum. Notably, we show that every one of these founder clusters previously entered Taiwan from China, 6-7 ka, where rice-farming originated, and remained distinct from the indigenous Taiwanese population until after the subsequent dispersal into ISEA.

Research paper thumbnail of Early Holocenic and Historic mtDNA African Signatures in the Iberian Peninsula: The Andalusian Region as a Paradigm

PloS one, 2015

Determining the timing, identity and direction of migrations in the Mediterranean Basin, the role... more Determining the timing, identity and direction of migrations in the Mediterranean Basin, the role of "migratory routes" in and among regions of Africa, Europe and Asia, and the effects of sex-specific behaviors of population movements have important implications for our understanding of the present human genetic diversity. A crucial component of the Mediterranean world is its westernmost region. Clear features of transcontinental ancient contacts between North African and Iberian populations surrounding the maritime region of Gibraltar Strait have been identified from archeological data. The attempt to discern origin and dates of migration between close geographically related regions has been a challenge in the field of uniparental-based population genetics. Mitochondrial DNA (mtDNA) studies have been focused on surveying the H1, H3 and V lineages when trying to ascertain north-south migrations, and U6 and L in the opposite direction, assuming that those lineages are good ...

Research paper thumbnail of Fine Time-Scaling of Purifying Selection on Human Non-Synonymous mtDNA Mutations Based on Worldwide Population Tree and Mother-Child Pairs

Human mutation, Jan 7, 2015

A high-resolution mtDNA phylogenetic tree allowed us to look backward in time to investigate puri... more A high-resolution mtDNA phylogenetic tree allowed us to look backward in time to investigate purifying selection. Purifying selection was very strong in the last 2,500 years, continuously eliminating pathogenic mutations back until the end of the Younger Dryas (∼11,000 years ago), when a large population expansion likely relaxed selection pressure. This was preceded by a phase of stable selection until another relaxation occurred in the out-of-Africa migration. Demography and selection are closely related: expansions led to relaxation of selection and higher pathogenicity mutations significantly decreased the growth of descendants. The only detectible positive selection was the recurrence of highly pathogenic non-synonymous mutations (m.3394T>C-m.3397A>G-m.3398T>C) at interior branches of the tree, preventing the formation of a dinucleotide STR (TATATA) in the MT-ND1 gene. At the most recent timescale in 124 mother-children transmissions, purifying selection was detectable ...

Research paper thumbnail of The genetic impact of the lake chad basin population in north africa as documented by mitochondrial diversity and internal variation of the L3e5 haplogroup

Annals of Human Genetics, 2013

Research paper thumbnail of Evaluating Purifying Selection in the Mitochondrial DNA of Various Mammalian Species

PLoS ONE, 2013

Mitochondrial DNA (mtDNA), the circular DNA molecule inside the mitochondria of all eukaryotic ce... more Mitochondrial DNA (mtDNA), the circular DNA molecule inside the mitochondria of all eukaryotic cells, has been shown to be under the effect of purifying selection in several species. Traditional testing of purifying selection has been based simply on ratios of nonsynonymous to synonymous mutations, without considering the relative age of each mutation, which can be determined by phylogenetic analysis of this non-recombining molecule. The incorporation of a mutation time-ordering from phylogeny and of predicted pathogenicity scores for nonsynonymous mutations allow a quantitative evaluation of the effects of purifying selection in human mtDNA. Here, by using this additional information, we show that purifying selection undoubtedly acts upon the mtDNA of other mammalian species/genera, namely Bos sp., Canis lupus, Mus musculus, Orcinus orca, Pan sp. and Sus scrofa. The effects of purifying selection were comparable in all species, leading to a significant major proportion of nonsynonymous variants with higher pathogenicity scores in the younger branches of the tree. We also derive recalibrated mutation rates for age estimates of ancestors of these various species and proposed a correction curve in order to take into account the effects of selection. Understanding this selection is fundamental to evolutionary studies and to the identification of deleterious mutations.

Research paper thumbnail of The First Modern Human Dispersals across Africa

PLoS ONE, 2013

The emergence of more refined chronologies for climate change and archaeology in prehistoric Afri... more The emergence of more refined chronologies for climate change and archaeology in prehistoric Africa, and for the evolution of human mitochondrial DNA (mtDNA), now make it feasible to test more sophisticated models of early modern human dispersals suggested by mtDNA distributions. Here we have generated 42 novel whole-mtDNA genomes belonging to haplogroup L0, the most divergent clade in the maternal line of descent, and analysed them alongside the growing database of African lineages belonging to L0's sister clade, L1'6. We propose that the last common ancestor of modern human mtDNAs (carried by "mitochondrial Eve") possibly arose in central Africa ~180 ka, at a time of low population size. By ~130 ka two distinct groups of anatomically modern humans co-existed in Africa: broadly, the ancestors of many modern-day Khoe and San populations in the south and a second central/ eastern African group that includes the ancestors of most extant worldwide populations. Early modern human dispersals correlate with climate changes, particularly the tropical African "megadroughts" of MIS 5 (marine isotope stage 5, 135-75 ka) which paradoxically may have facilitated expansions in central and eastern Africa, ultimately triggering the dispersal out of Africa of people carrying haplogroup L3 ~60 ka. Two south to east migrations are discernible within haplogroup LO. One, between 120 and 75 ka, represents the first unambiguous long-range modern human dispersal detected by mtDNA and might have allowed the dispersal of several markers of modernity. A second one, within the last 20 ka signalled by L0d, may have been responsible for the spread of southern clickconsonant languages to eastern Africa, contrary to the view that these eastern examples constitute relicts of an ancient, much wider distribution.

Research paper thumbnail of Mutational Spectrum and Linkage Disequilibrium Patterns at the Ornithine Transcarbamylase Gene (OTC)

Annals of Human Genetics, 2006

Ornithine transcarbamylase (OTC; EC 2.1.3.3) is a hepatic enzyme involved in ammonia elimination ... more Ornithine transcarbamylase (OTC; EC 2.1.3.3) is a hepatic enzyme involved in ammonia elimination via the urea cycle. Since the sequence of the OTC gene was reported many types of mutations continue to be found in OTC deficiency patients, continuing to increase the already wide mutational spectrum known for this gene. In this study we present the clinical, biochemical and molecular features of thirteen late-onset OTC deficiency patients. Mutations were identified in all these patients, among which six were novel point substitutions (L59R, A137P, L148S, Y176L, L186P, and K210N) and one was a 2-bp deletion at exon 4 (341-342delAA). In addition, a de novo genomic deletion of maternal origin encompassing exons 1 to 5 was also identified by the analysis of LD patterns using intragenic polymorphic markers. This work exemplifies the potential value of population genetic studies for the detection of large deletions.

Research paper thumbnail of Mitochondrial DNA Signals of Late Glacial Recolonization of Europe from Near Eastern Refugia

The American Journal of Human Genetics, 2012

Research paper thumbnail of Comparing Phylogeny and the Predicted Pathogenicity of Protein Variations Reveals Equal Purifying Selection across the Global Human mtDNA Diversity

The American Journal of Human Genetics, 2011

We used detailed phylogenetic trees for human mtDNA, combined with pathogenicity predictions for ... more We used detailed phylogenetic trees for human mtDNA, combined with pathogenicity predictions for each amino acid change, to evaluate selection on mtDNA-encoded protein variants. Protein variants with high pathogenicity scores were significantly rarer in the older branches of the tree. Variants that have formed and survived multiple times in the human phylogenetics tree had significantly lower pathogenicity scores than those that only appear once in the tree. We compared the distribution of pathogenicity scores observed on the human phylogenetic tree to the distribution of all possible protein variations to define a measure of the effect of selection on these protein variations. The measured effect of selection increased exponentially with increasing pathogenicity score. We found no measurable difference in this measure of purifying selection in mtDNA across the global population, represented by the macrohaplogroups L, M, and N. We provide a list of all possible single amino acid variations for the human mtDNA-encoded proteins with their predicted pathogenicity scores and our measured selection effect as a tool for assessing novel protein variations that are often reported in patients with mitochondrial disease of unknown origin or for assessing somatic mutations acquired through aging or detected in tumors.

Research paper thumbnail of Resolving the ancestry of Austronesian-speaking populations

There are two very different interpretations of the prehistory of Island Southeast Asia (ISEA), w... more There are two very different interpretations of the prehistory of Island Southeast Asia (ISEA), with genetic evidence invoked in support of both. The “out-of-Taiwan” model proposes a major Late Holocene expansion of Neolithic Austronesian speakers from Taiwan. An alternative, proposing that Late Glacial/postglacial sea-level rises triggered largely autochthonous dispersals, accounts for some otherwise enigmatic genetic patterns, but fails to explain the Austronesian language dispersal. Combining mitochondrial DNA (mtDNA), Y-chromosome and genome-wide data, we performed the most comprehensive analysis of the region to date, obtaining highly consistent results across all three systems and allowing us to reconcile the models. We infer a primarily common ancestry for Taiwan/ISEA populations established before the Neolithic, but also detected clear signals of two minor Late Holocene migrations, probably representing Neolithic input from both Mainland Southeast Asia and South China, via Taiwan. This latter may therefore have mediated the Austronesian language dispersal, implying small-scale migration and language shift rather than large-scale expansion.

Research paper thumbnail of Pleistocene-Holocene boundary in southern Arabia from the perspective of human mtDNA variation

American Journal of Physical Anthropology, 2012

It is now known that several population movements have taken place at different times through- ou... more It is now known that several population movements have taken place at different times through- out southern Arabian prehistory. One of the principal questions under debate is if the Early Holocene peopling of southern Arabia was mainly due to input from the Le- vant during the Pre-Pottery Neolithic B, to the expansion of an autochthonous population, or some combination of these demographic processes. Since previous genetic stud- ies have not been able to include all parts of southern Arabia, we have helped fill this lacuna by collecting new population datasets from Oman (Dhofar) and Yemen (Al- Mahra and Bab el-Mandab). We identified several new haplotypes belonging to haplogroup R2 and generated its whole genome mtDNA tree with age estimates under- taken by different methods. R2, together with other considerably frequent southern Arabian mtDNA haplogroups (R0a, HV1, summing up more than 20% of the South Ara- bian gene pool) were used to infer the past effective popu- lation size through Bayesian skyline plots. These data indicate that the southern Arabian population underwent a large expansion already some 12 ka. A founder analysis of these haplogroups shows that this expansion is largely attributed to demographic input from the Near East. These results support thus the spread of a population coming from the north, but at a significantly earlier date than presently considered by archaeologists. Our data suggest that some of the mtDNA lineages found in south- ern Arabia have persisted in the region since the end of the Last Ice Age.

Research paper thumbnail of 60,000 years of interactions between Central and Eastern Africa documented by major African mitochondrial haplogroup L2

Scientific reports, 2015

Mitochondrial DNA (mtDNA) haplogroup L2 originated in Western Africa but is nowadays spread acros... more Mitochondrial DNA (mtDNA) haplogroup L2 originated in Western Africa but is nowadays spread across the entire continent. L2 movements were previously postulated to be related to the Bantu expansion, but L2 expansions eastwards probably occurred much earlier. By reconstructing the phylogeny of L2 (44 new complete sequences) we provide insights on the complex net of within-African migrations in the last 60 thousand years (ka). Results show that lineages in Southern Africa cluster with Western/Central African lineages at a recent time scale, whereas, eastern lineages seem to be substantially more ancient. Three moments of expansion from a Central African source are associated to L2: (1) one migration at 70-50 ka into Eastern or Southern Africa, (2) postglacial movements (15-10 ka) into Eastern Africa; and (3) the southward Bantu Expansion in the last 5 ka. The complementary population and L0a phylogeography analyses indicate no strong evidence of mtDNA gene flow between eastern and sou...

Research paper thumbnail of A substantial prehistoric European ancestry amongst Ashkenazi maternal lineages

Nature Communications (2013) 4:2543

The origins of Ashkenazi Jews remain highly controversial. Like Judaism, mitochondrial DNA is pas... more The origins of Ashkenazi Jews remain highly controversial. Like Judaism, mitochondrial DNA is passed along the maternal line. Its variation in the Ashkenazim is highly distinctive, with four major and numerous minor founders. However, due to their rarity in the general population, these founders have been difficult to trace to a source. Here we show that all four major founders, ~40% of Ashkenazi mtDNA variation, have ancestry in prehistoric Europe, rather than the Near East or Caucasus. Furthermore, most of the remaining minor founders share a similar deep European ancestry. Thus the great majority of Ashkenazi maternal lineages were not brought from the Levant, as commonly supposed, nor recruited in the Caucasus, as sometimes suggested, but assimilated within Europe. These results point to a significant role for the conversion of women in the formation of Ashkenazi communities, and provide the foundation for a detailed reconstruction of Ashkenazi genealogical history.

Research paper thumbnail of Relative Y-STR mutation rates estimated from the variance inside SNP defined lineages

International Congress Series, 2006

Y specific microsatellites (STRs) have been widely used in forensic and population genetics in ag... more Y specific microsatellites (STRs) have been widely used in forensic and population genetics in age estimates of human male lineages. Previously, estimates of mutation rates from father-son pairs have given quite variable results in different studies, essentially due to the rarity of mutations. We propose an indirect approach for determining relative mutation rates of Ychromosome microsatellites based on STR allele size intra-lineage variance. Indeed, the present distribution of STR alleles offers us an insight into the mechanisms that have generated that diversity. D

Research paper thumbnail of Climate Change and Postglacial Human Dispersals in Southeast Asia

Molecular Biology and Evolution, 2008

Modern humans have been living in Island Southeast Asia (ISEA) for at least 50,000 years. Largely... more Modern humans have been living in Island Southeast Asia (ISEA) for at least 50,000 years. Largely because of the influence of linguistic studies, however, which have a shallow time depth, the attention of archaeologists and geneticists has usually been focused on the last 6,000 years-in particular, on a proposed Neolithic dispersal from China and Taiwan. Here we use complete mitochondrial DNA (mtDNA) genome sequencing to spotlight some earlier processes that clearly had a major role in the demographic history of the region but have hitherto been unrecognized. We show that haplogroup E, an important component of mtDNA diversity in the region, evolved in situ over the last 35,000 years and expanded dramatically throughout ISEA around the beginning of the Holocene, at the time when the ancient continent of Sundaland was being broken up into the present-day archipelago by rising sea levels. It reached Taiwan and Near Oceania more recently, within the last ;8,000 years. This suggests that global warming and sea-level rises at the end of the Ice Age, 15,000-7,000 years ago, were the main forces shaping modern human diversity in the region.

Research paper thumbnail of The Arabian Cradle: Mitochondrial Relicts of the First Steps along the Southern Route out of Africa

The American Journal of Human Genetics, 2012

A major unanswered question regarding the dispersal of modern humans around the world concerns th... more A major unanswered question regarding the dispersal of modern humans around the world concerns the geographical site of the first human steps outside of Africa. The ''southern coastal route'' model predicts that the early stages of the dispersal took place when people crossed the Red Sea to southern Arabia, but genetic evidence has hitherto been tenuous. We have addressed this question by analyzing the three minor west-Eurasian haplogroups, N1, N2, and X. These lineages branch directly from the first non-African founder node, the root of haplogroup N, and coalesce to the time of the first successful movement of modern humans out of Africa,~60 thousand years (ka) ago. We sequenced complete mtDNA genomes from 85 Southwest Asian samples carrying these haplogroups and compared them with a database of 300 European examples. The results show that these minor haplogroups have a relict distribution that suggests an ancient ancestry within the Arabian Peninsula, and they most likely spread from the Gulf Oasis region toward the Near East and Europe during the pluvial period 55-24 ka ago. This pattern suggests that Arabia was indeed the first staging post in the spread of modern humans around the world.

Research paper thumbnail of Musilová et al. - 2011 - Population history of the Red Sea-genetic exchanges between the Arabian Peninsula and East Africa signaled in t

Archaeological studies have revealed cultural connections between the two sides of the Red Sea da... more Archaeological studies have revealed cultural connections between the two sides of the Red Sea dating to prehistory. The issue has still not been properly addressed, however, by archaeogenetics. We focus our attention here on the mitochondrial haplogroup HV1 that is present in both the Arabian Peninsula and East Africa. The internal variation of 38 complete mitochondrial DNA sequences (20 of them presented here for the first time) affiliated into this haplogroup testify to its emergence during the late glacial maximum, most probably in the Near East, with subsequent dispersion via in Wiley Online Library (wileyonlinelibrary.com).