The Human Genome Project Reveals a Continuous Transfer of Large Mitochondrial Fragments to the Nucleus (original) (raw)

Journal Article

,

Department of Evolutionary Biology, Zoological Institute, University of Copenhagen, Copenhagen, Denmark

Search for other works by this author on:

,

Department of Evolutionary Biology, Zoological Institute, University of Copenhagen, Copenhagen, Denmark

Search for other works by this author on:

,

Department of Evolutionary Biology, Zoological Institute, University of Copenhagen, Copenhagen, Denmark

Search for other works by this author on:

Department of Evolutionary Biology, Zoological Institute, University of Copenhagen, Copenhagen, Denmark

Search for other works by this author on:

Published:

01 September 2001

Cite

Tobias Mourier, Anders J. Hansen, Eske Willerslev, Peter Arctander, The Human Genome Project Reveals a Continuous Transfer of Large Mitochondrial Fragments to the Nucleus, Molecular Biology and Evolution, Volume 18, Issue 9, September 2001, Pages 1833–1837, https://doi.org/10.1093/oxfordjournals.molbev.a003971
Close

Navbar Search Filter Mobile Enter search term Search

Mitochondrial genomes are believed to gradually transfer DNA fragments (numts) into the nuclear chromosomes of eukaryotic cells during evolution (reviewed in Zhang and Hewitt 1996 ). This assumption relies on hybridization studies of mitochondrial DNA sequences (mtDNA) (Tsuzuki et al. 1983 ), sequencing of numts (e.g., Lopez et al. 1994 ; Arctander 1995 ; Zischler et al. 1995 ; Herrnstadt et al. 1999 ), and similarity searches in sequence databases (Blanchard and Schmidt 1996 ; Bensasson et al. 2001 ). Here we present the first extensive analysis of numts in the human nuclear genome. Through a combination of conventional BLAST alignment (Altschul et al. 1997 ) and a DNA block aligning (DBA) algorithm (Jareborg, Birney, and Durbin 1999 ), we searched roughly 93.5% of the human genome (http://www.ncbi.nlm.nih.gov/genome/seq/) for numts. This approach revealed three notable findings. First, several numts exceed the size of the longest human numt reported to date (Herrnstadt et al. 1999 ). Second, all parts of the mitochondrial DNA are represented in the nuclear genome. Finally, the integration of mtDNAs into the nucleus is a continuous evolutionary process, thereby verifying previous beliefs (Zhang and Hewitt 1996 ; Wallace et al. 1997 ; Herrnstadt et al. 1999 ).

Through the web service provided by NCBI (http://www.ncbi.nlm.nih.gov/), we compared the complete human mitochondrial DNA and the working draft of the human nuclear genome (as of mid-April 2001) using BLAST. This procedure was followed by alignment using the DBA algorithm (Jareborg, Birney, and Durbin 1999 ), which found collinear blocks of conserved sequence allowing for indels between blocks. The rationale for this twofold alignment procedure stems from the assumption that two mechanisms may obscure the BLAST alignment. First, the extant mtDNA will have diverged from the ancestral sequence. Second, as the numts are presumably released from selection, larger deletions and insertions may take place.

Hits from the BLAST search (default settings) in the same sense and within the vicinity (4–6,128 bp) of each other were assessed to potentially stem from a single insertion event. If such a group of hits involved more than 100 identical positions, the genomic sequence covering all the hits and their intervening sequences were retrieved. This sequence was aligned to the corresponding mtDNA sequence using the DBA algorithm. The sequences were considered a result of a single insertion event if the DBA algorithm was able to align more than 80% of the mtDNA sequence in a collinear way.

Following the above criteria, we found 296 numts ranging between 106 and 14,654 bp in size (table 1 ). Fifteen of these were found to be longer than 5,842 bp, previously reported by Herrnstadt et al. (1999) as the length of the longest human numt.

Furthermore, we found that all positions of the mitochondrial genome are represented in the nuclear DNA, with the domain comprising the control region being relatively underrepresented (fig. 1 ). As this could be an artifact caused by the distal position of the control region in the linear mtDNA sequence, we constructed an alternative representation in which the control region was central. Neither this nor the removal of the low-complexity filter of BLAST produced additional hits to this region (not shown). The deficiency of numts from the control region probably results from the significantly higher evolutionary rate of extant mtDNA in this region (Saccone, Pesole, and Sbisá 1991 ). This hypothesis is further supported by the increased number of numts in the region comprising the central conserved domain (fig. 1 ).

Interestingly, we found 4 numts covering the complete control region (table 1 ), signifying that these are at least the result of a DNA-based transfer (for a discussion see Shay and Werbin [1992] and references therein).

To estimate the time of insertion of the numts, we collected all numt-mitochondria alignments longer than 2,000 bp (i.e., either complete numts, if they were completely alignable, or subsets of numts of which DBA blocks exceeded 2,000 bp) and aligned these with the corresponding mtDNA sequences from a variety of mammals. The phylogenetic analysis supported the general conviction that numt DNAs are continually integrated into the nuclear genome as a result of several independent evolutionary events (fig. 2 ).

Since we used the working draft of the human nuclear genome for analysis, we cannot exclude that some of the recent integration events are simply due to erroneous sequencing of mitochondrial contamination. However, this will not change the above conclusions. On the contrary, the above findings may be an underestimate, since recently transferred numts may not have reached fixation (e.g., Zischler et al. 1995 ) and therefore may not be present in the available human genome draft.

This study presents the first extensive large-scale survey of human numts based on the human genome project—an initial step on the way to a complete catalog of human numts.

As previously stated (Perna and Kocher 1996 ), human numts may serve as both obstacles and tools in understanding the evolution of the human mitochondria. For example, the large number of long numts can confound studies on mitochondrial heteroplasmy as well as phylogenetic and population studies using mtDNA markers. For these studies, decisive knowledge of human numts may be crucial in detecting erroneous results due to false amplification of nuclear homologs.

On the contrary, since numts may be regarded as “molecular fossils” of mtDNA (Zischler, Geisert, and Castresana 1998 ), they may provide fruitful insight into the evolution of modern human mitochondria and help to uncover the evolutionary basis of contemporary human diseases related to the genetics of the mitochondria.

Supplementary Materials

A table of all 296 human numts is provided on the Molecular Biology and Evolution web site.

Pekka Pamilo, Reviewing Editor

1

Keywords: mitochondrial DNA nuclear insertions human genome

2

Address for correspondence and reprints: Tobias Mourier, Department of Evolutionary Biology, Zoological Institute, University of Copenhagen, Universitetsparken 15, DK-2100 Copenhagen Ø, Denmark. tmourier@zi.ku.dk .

Table 1 The 60 Longest Human Numts

Table 1 The 60 Longest Human Numts

Table 1 The 60 Longest Human Numts

Table 1 The 60 Longest Human Numts

Table 1 Continued

Table 1 Continued

Fig. 1.—Circular diagram of the number of numts descending from a given position in the mitochondria (thick line). The inner hatched circle depicts the mitochondria, with the two hypervariable segments of the control region (encompassing the central conserved domain) highlighted (black)

Fig. 1.—Circular diagram of the number of numts descending from a given position in the mitochondria (thick line). The inner hatched circle depicts the mitochondria, with the two hypervariable segments of the control region (encompassing the central conserved domain) highlighted (black)

Fig. 2.—Consensus tree of the phylogenetic positions of human numts, based on 35 individual bootstrap analyses of all blocks from the DBA alignment longer than 2,000 bp. The trees were constructed in PAUP*, version 4.0b4a (Swofford 1998 ), using the neighbor-joining algorithm based on maximum-likelihood (ML) distance measures. The shape parameters of the gamma distributions, ∝ (0.28–0.43), and the transition-transversion rates (1.6–2.9) were estimated using ML. Six branching points are depicted on the tree (A–F). On the basis of 100% support with 100 bootstrap replicates and Platypus as the outgroup, numts could be confined to one or more of the branching points, as shown below the tree. For example, numts listed in the gray box (A) have 100% bootstrap support positioned at branching point A, whereas numts listed in the box (A–B) with the same support only can be confined to either branching point A or branching point B. Needless to say, numts in the box covering all positions (A–F) are restricted to the primate clade, but their exact position is undetermined. If two or more alignment blocks come from the same numt, these have letter suffixes (see table 1 for details). The following mtDNA sequences were used (GenBank accession numbers in parentheses): human (Homo sapiens; NC_001807), chimpanzee (Pan troglodytes; NC_001643), gorilla (Gorilla gorilla; NC_001645), Orangutan (Pongo pygmaeus; NC_001646), gibbon (Hylobates lar; NC_002082), baboon (Papio hamadryas; NC_001992), wallaroo (Macropus robustus; NC_001794), opossum (Didelphis virginiana; NC_001610), and platypus (Ornithorhynchus anatinus; NC_000891). Nonprimate placentals: alpaca (Lama pacos; NC_002504), armadillo (Dasypus novemcinctus; NC_001821), bat (Chalinolobus tuberculatus; NC_002626), cat (Felis catus; NC_001700), cow (Bos taurus; NC_001567), European hedgehog (Erinaceus europaeus; NC_002080), flying fox (Pteropus scapulatus; NC_002619), guinea pig (Cavia porcellus; NC_000884), Madagascar hedgehog (Echinops telfairi; NC_002631), rabbit (Oryctolagus cuniculus; NC_001913), squirrel (Sciurus vulgaris; NC_002369), and tree shrew (Tupaia belangeri; NC_002521)

Fig. 2.—Consensus tree of the phylogenetic positions of human numts, based on 35 individual bootstrap analyses of all blocks from the DBA alignment longer than 2,000 bp. The trees were constructed in PAUP*, version 4.0b4a (Swofford 1998 ), using the neighbor-joining algorithm based on maximum-likelihood (ML) distance measures. The shape parameters of the gamma distributions, ∝ (0.28–0.43), and the transition-transversion rates (1.6–2.9) were estimated using ML. Six branching points are depicted on the tree (A–F). On the basis of 100% support with 100 bootstrap replicates and Platypus as the outgroup, numts could be confined to one or more of the branching points, as shown below the tree. For example, numts listed in the gray box (A) have 100% bootstrap support positioned at branching point A, whereas numts listed in the box (A–B) with the same support only can be confined to either branching point A or branching point B. Needless to say, numts in the box covering all positions (A–F) are restricted to the primate clade, but their exact position is undetermined. If two or more alignment blocks come from the same numt, these have letter suffixes (see table 1 for details). The following mtDNA sequences were used (GenBank accession numbers in parentheses): human (Homo sapiens; NC_001807), chimpanzee (Pan troglodytes; NC_001643), gorilla (Gorilla gorilla; NC_001645), Orangutan (Pongo pygmaeus; NC_001646), gibbon (Hylobates lar; NC_002082), baboon (Papio hamadryas; NC_001992), wallaroo (Macropus robustus; NC_001794), opossum (Didelphis virginiana; NC_001610), and platypus (Ornithorhynchus anatinus; NC_000891). Nonprimate placentals: alpaca (Lama pacos; NC_002504), armadillo (Dasypus novemcinctus; NC_001821), bat (Chalinolobus tuberculatus; NC_002626), cat (Felis catus; NC_001700), cow (Bos taurus; NC_001567), European hedgehog (Erinaceus europaeus; NC_002080), flying fox (Pteropus scapulatus; NC_002619), guinea pig (Cavia porcellus; NC_000884), Madagascar hedgehog (Echinops telfairi; NC_002631), rabbit (Oryctolagus cuniculus; NC_001913), squirrel (Sciurus vulgaris; NC_002369), and tree shrew (Tupaia belangeri; NC_002521)

We thank Douda Bensasson, Kasi B. Desfor, Sylvia Mathiasen, and Seirian Sumner for help and discussions. A.J.H. and E.W. were supported by the VELUX foundation of 1981, Denmark. A.J.H. and E.W. contributed equally to this work and should be regarded as joint authors.

References

Altschul S. F., T. L. Madden, A. A. Schäffer, J. Zhang, Z. Zhang, W. Miller, D. J. Lipman,

1997

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

Nucleic Acids Res

25

:

3389

-3402

Arctander P.,

1995

Comparison of a mitochondrial gene and a corresponding nuclear pseudogene

Proc. R. Soc. Lond. B Biol. Sci

262

:

13

-19

Bensasson D., D.-X. Zhang, D. Hartl, G. Hewitt,

2001

Mitochondrial pseudogenes: evolution's misplaced witnesses

Trends Ecol. Evol

16

:

314

-321

Blanchard J. L., G. W. Schmidt,

1996

Mitochondrial DNA migration events in yeast and humans: integration by a common end-joining mechanism and alternative perspectives on nucleotide substitution patterns

Mol. Biol. Evol

13

:

537

-548

Herrnstadt C., W. Clevenger, S. S. Ghosh, C. Anderson, E. Fahy, S. Miller, N. Howell, R. E. Davis,

1999

A novel mitochondrial DNA-like sequence in the human nuclear genome

Genomics

60

:

67

-77

Jareborg N., E. Birney, R. Durbin,

1999

Comparative analysis of noncoding regions of 77 orthologous mouse and human gene pairs

Genome Res

9

:

815

-824

Lopez J. V., N. Yuhki, R. Masuda, W. Modi, S. J. O'Brien,

1994

Numt, a recent transfer and tandem amplification of mitochondrial DNA to the nuclear genome of the domestic cat

J. Mol. Evol

39

:

174

-190

Perna N. T., T. D. Kocher,

1996

Mitochondrial DNA: molecular fossils in the nucleus

Curr. Biol

6

:

128

-129

Saccone C., G. Pesole, E. Sbis,

1991

The main regulatory region of mammalian mitochondrial DNA: structure-function model and evolutionary pattern

J. Mol. Evol

33

:

83

-91

Shay J. W., H. Werbin,

1992

New evidence for the insertion of mitochondrial DNA into the human genome: significance for cancer and aging

Mutat. Res

275

:

227

-235

Swofford D. L.,

1998

PAUP*: phylogenetic analysis using parsimony (*and other methods). Version 4 Sinauer, Sunderland, Mass

Tsuzuki T., H. Nomiyama, C. Setoyama, S. Maeda, K. Shimada,

1983

Presence of mitochondrial-DNA-like sequences in the human nuclear DNA

Gene

25

:

223

-229

Wallace D. C., C. Stugard, D. Murdock, T. Schurr, M. D. Brown,

1997

Ancient mtDNA sequences in the human nuclear genome: a potential source of errors in identifying pathogenic mutations

Proc. Natl. Acad. Sci. USA

94

:

14900

-14905

Zhang D.-X., G. M. Hewitt,

1996

Nuclear integrations: challenges for mitochondrial DNA markers

Trends Ecol. Evol

11

:

247

-251

Zischler H., H. Geisert, A. von Haeseler, S. Pääbo,

1995

A nuclear ‘fossil’ of the mitochondrial D-loop and the origin of modern humans

Nature

378

:

489

-492

Zischler H., H. Geisert, J. Castresana,

1998

A hominoid-specific nuclear insertion of the mitochondrial d-loop: implications for reconstructing ancestral mitochondrial sequences

Mol. Biol. Evol

15

:

463

-469

Citations

Views

Altmetric

Metrics

Total Views 2,274

1,710 Pageviews

564 PDF Downloads

Since 11/1/2016

Month: Total Views:
November 2016 1
February 2017 11
March 2017 5
April 2017 11
May 2017 8
June 2017 14
July 2017 6
August 2017 11
September 2017 3
November 2017 10
December 2017 21
January 2018 15
February 2018 15
March 2018 29
April 2018 19
May 2018 48
June 2018 43
July 2018 19
August 2018 21
September 2018 18
October 2018 27
November 2018 30
December 2018 19
January 2019 14
February 2019 17
March 2019 28
April 2019 34
May 2019 23
June 2019 21
July 2019 20
August 2019 17
September 2019 46
October 2019 29
November 2019 29
December 2019 30
January 2020 24
February 2020 31
March 2020 29
April 2020 36
May 2020 38
June 2020 10
July 2020 43
August 2020 15
September 2020 33
October 2020 27
November 2020 29
December 2020 16
January 2021 31
February 2021 19
March 2021 15
April 2021 42
May 2021 26
June 2021 32
July 2021 11
August 2021 32
September 2021 19
October 2021 35
November 2021 34
December 2021 21
January 2022 27
February 2022 22
March 2022 36
April 2022 28
May 2022 41
June 2022 34
July 2022 40
August 2022 22
September 2022 25
October 2022 40
November 2022 31
December 2022 25
January 2023 20
February 2023 21
March 2023 16
April 2023 22
May 2023 22
June 2023 13
July 2023 11
August 2023 26
September 2023 13
October 2023 17
November 2023 27
December 2023 24
January 2024 45
February 2024 26
March 2024 38
April 2024 30
May 2024 39
June 2024 18
July 2024 15
August 2024 23
September 2024 35
October 2024 42

Citations

144 Web of Science

×

Email alerts

Email alerts

Citing articles via

More from Oxford Academic