Mitochondrial and Y-chromosome diversity of the Tharus (Nepal): a reservoir of genetic variation - PubMed (original) (raw)

Mitochondrial and Y-chromosome diversity of the Tharus (Nepal): a reservoir of genetic variation

Simona Fornarino et al. BMC Evol Biol. 2009.

Abstract

Background: Central Asia and the Indian subcontinent represent an area considered as a source and a reservoir for human genetic diversity, with many markers taking root here, most of which are the ancestral state of eastern and western haplogroups, while others are local. Between these two regions, Terai (Nepal) is a pivotal passageway allowing, in different times, multiple population interactions, although because of its highly malarial environment, it was scarcely inhabited until a few decades ago, when malaria was eradicated. One of the oldest and the largest indigenous people of Terai is represented by the malaria resistant Tharus, whose gene pool could still retain traces of ancient complex interactions. Until now, however, investigations on their genetic structure have been scarce mainly identifying East Asian signatures.

Results: High-resolution analyses of mitochondrial-DNA (including 34 complete sequences) and Y-chromosome (67 SNPs and 12 STRs) variations carried out in 173 Tharus (two groups from Central and one from Eastern Terai), and 104 Indians (Hindus from Terai and New Delhi and tribals from Andhra Pradesh) allowed the identification of three principal components: East Asian, West Eurasian and Indian, the last including both local and inter-regional sub-components, at least for the Y chromosome.

Conclusion: Although remarkable quantitative and qualitative differences appear among the various population groups and also between sexes within the same group, many mitochondrial-DNA and Y-chromosome lineages are shared or derived from ancient Indian haplogroups, thus revealing a deep shared ancestry between Tharus and Indians. Interestingly, the local Y-chromosome Indian component observed in the Andhra-Pradesh tribals is present in all Tharu groups, whereas the inter-regional component strongly prevails in the two Hindu samples and other Nepalese populations.The complete sequencing of mtDNAs from unresolved haplogroups also provided informative markers that greatly improved the mtDNA phylogeny and allowed the identification of ancient relationships between Tharus and Malaysia, the Andaman Islands and Japan as well as between India and North and East Africa. Overall, this study gives a paradigmatic example of the importance of genetic isolates in revealing variants not easily detectable in the general population.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Geographic map of Nepal. Sampled areas, in circles.

Figure 2

Figure 2

Phylogeny and frequencies (%) of mtDNA haplogroups in the populations studied. Haplogroups (East Asian in grey; West Eurasian in white; Indian in black) were assigned on the basis of both the control-region motifs and the coding-region polymorphisms [see Additional file 1] following published criteria (see Materials and Methods). Coding-region markers are reported as mutated nucleotide positions according to the rCRS [17] Mutations are transitions unless a base change is explicitly indicated. The 9-bp polymorphism: deletion = del; insertion = ins. Haplogroups with an asterix (*) include samples negative for the examined sub-groups.

Figure 3

Figure 3

Phylogeny and frequencies (%) of Y-chromosome haplogroups in the populations studied. Haplogroups: East Asian in grey; West Eurasian in white; Indian in black. The nomenclature and the hierarchical order of the mutations are according to the Y-Chromosome Consortium [62,67,73], updated with more recent markers: M429 [38]; M481 and P31 T-del (present study). The nomenclature of haplogroup H differs from that presented by Karafet et al. [73], in that all of our M82 samples were also M370 positive. H: intra-population haplogroup diversity, according to Nei [41]. In italics: markers not found. In parentheses: markers inferred. Haplogroups with an asterix* include samples negative for the examined sub-groups.

Figure 4

Figure 4

Phylogenetic tree of 51 mtDNA sequences. Mutations are scored relative to the rCRS [17]For the tree construction, the length variation in the poly-C stretch at np 16193 was not used, while the variation at np 309 is indicated only when phylogenetically relevant. Mutations are shown on the branches. They are transitions unless a base change is explicitly indicated; insertions are suffixed with a plus sign (+) and the inserted nucleotide(s), and deletions are characterized by "d"; back mutations are highlighted by "@"; recurrent mutations are underlined. Dating is reported in kilo years. Some sequences are incomplete: * from 3694 to 3738 nps; ** from 8353 to 8472 nps. For the ethnic/geographic origins of the samples, see Additional file 2. Population codes: Th-CI and Th-CII: Central Tharus; Th-E: Eastern Tharus; H-ND: Hindus from New Delhi; H-Te: Hindus from Terai; AP: Andhra Pradesh; UP: Uttar Pradesh; N-In, NE-In, S-In: North, North-East, South Indians, respectively; PK: Pakistan. Symbols surrounded by a circle are from literature. (a) Nomenclature different from that (M45) reported in the “mtDNA tree Build 5 (8 Jul 2009)” (

).

Figure 5

Figure 5

Phylogenetic tree of 30 mtDNA sequences. Mutations are scored relative to the rCRS [17]For the tree construction, the length variation in the poly-C stretch at np 16193 was not used, while the variation at np 309 is indicated only when phylogenetically relevant. Mutations are shown on the branches. They are transitions unless a base change is explicitly indicated; insertions are suffixed with a plus sign (+) and the inserted nucleotide(s), and deletions are characterized by “d”; back mutations are highlighted by “@”; recurrent mutations (considered in the global phylogeny of the 81 mtDNAs) are underlined. Dating is reported in kilo years. * Sequence incomplete from 411 to 628 and from 16189 to 16290 nps. For the ethnic/geographic origins of the samples, see Additional file 2. Population codes: Th-CI and Th-CII: Central Tharus; Th-E: Eastern Tharus; AP: Andhra Pradesh; UP: Uttar Pradesh; Pun: Punjab; NE-In: North-East Indians; E-In: East Indians; PK: Pakistan; Mal: Malaysia; And: Andaman Islands. Symbols surrounded by a circle are from literature. (a) Nomenclature different from that (M13b) reported in the “mtDNA tree Build 5 (8 Jul 2009)” (

).

Figure 6

Figure 6

Histograms of the mtDNA (a) and Y-chromosome (b) components observed in the populations studied. Sample sizes are in parentheses.

Figure 7

Figure 7

Principal component analysis of mtDNA haplogroup frequencies. Comparison samples from Western Eurasia (Iran): Irn-W, Irn-E, Irn-C, Irn-SW, Irn-SE [12]; Indian subcontinent: AP, Andhra Pradesh [55]; WB-1, Castes from Bengal; WB-2, Kurmis from West Bengal; WB-3, Lodhas from West Bengal; Pj, Punjab; Rj, Rajput; Pa, Parsi; Gj, Gujarat; UP-2, Brahmins from Uttar Pradesh [12]; Th-UP, Tharus from Uttar Pradesh [12,56]; UP-1, Uttar Pradesh; Lb, Lobana group [56]; Pk, Karachis [42]; East Asia: Han-SE, Guandong [58]; Uzb, Uzbek; Uyg, Uygur; Kaz, Kazak; Mong, Mongolia; Hui, Xinjiang [59,60]; and Indonesia: Su, Sumatra; Bo, Borneo; Jv, Java; Ba, Bali; Lk, Lombok; Sm, Sumba; Am, Ambon [44]. Data have been normalized to the common level of analysis. On the whole, 26% of the total variance is represented: 15% by the first PC and 11% by the second PC.

Figure 8

Figure 8

Principal component analysis of Y-chromosome haplogroup frequencies. (a) Comparison with Nepalese and Tibetan groups [37]; (b) Comparison with some Indian caste and tribal groups [15] where our data have been normalized to the Sengupta level of resolution. Populations: Kat, Kathmandu; New, Newar; Tam, Tamang; Tib, Tibet; DC Dravidian castes; IEC, Indo-European castes; AAT, Austro-Asiatic tribals; TBT, Tibeto-Burman tribals; DT, Dravidian tribals; IET, Indo-European tribals.

Figure 9

Figure 9

Histograms of the Indian local (Hgs: C5, F, H, L1, O2a1a1) and inter-regional (Hgs: L* and R) components observed in the populations of the present study compared with other Nepalese groups. Sample sizes are in parentheses. (a) Gayden et al. [37]

Similar articles

Cited by

References

    1. Terrenato L, Shrestha S, Dixit KA, Luzzatto L, Modiano G, Morpurgo G, Arese P. Decreased malaria morbidity in the Tharu people compared to sympatric populations in Nepal. Ann Trop Med Parasitol. 1988;82:1–11. - PubMed
    1. Chopra VP. Studies on serum groups in the Kumaon region, India. Humangenetik. 1970;10:35–43. doi: 10.1007/BF00297638. - DOI - PubMed
    1. Bista DB. People of Nepal. Kathmandu, Nepal: Ratna Pustak Bhandar; 1980.
    1. Brega A, Gardella R, Semino O, Morpurgo G, Astaldi Ricotti GB, Wallace DC, Santachiara-Benerecetti AS. Genetic studies on the Tharu population of Nepal, restriction endonuclease polymorphisms of mitochondrial DNA. Am J Hum Genet. 1986;39:502–512. - PMC - PubMed
    1. Passarino G, Semino O, Pepe G, Shrestha SL, Modiano G, Santachiara Benerecetti AS. MtDNA polymorphisms among Tharus of eastern Terai (Nepal) Gene Geography. 1992;6:139–147. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources