RNA-based gene duplication: mechanistic and evolutionary insights - PubMed (original) (raw)
Review
RNA-based gene duplication: mechanistic and evolutionary insights
Henrik Kaessmann et al. Nat Rev Genet. 2009 Jan.
Abstract
Gene copies that stem from the mRNAs of parental source genes have long been viewed as evolutionary dead-ends with little biological relevance. Here we review a range of recent studies that have unveiled a significant number of functional retroposed gene copies in both mammalian and some non-mammalian genomes. These studies have not only revealed previously unknown mechanisms for the emergence of new genes and their functions but have also provided fascinating general insights into molecular and evolutionary processes that have shaped genomes. For example, analyses of chromosomal gene movement patterns via RNA-based gene duplication have shed fresh light on the evolutionary origin and biology of our sex chromosomes.
Figures
Figure 1. Mechanism of gene retroposition
(A) Gene retroposition is initiated with the transcription of a parental gene by RNA polymerase II and (B) further processing of its RNA (splicing and polyadenylation), which produces a mature mRNA. (C) Gene retroposition is mediated by the L1 endonuclease domain (pink hourglass) that creates a first nick (yellow star) at the genomic site of insertion at the TTAAAA target sequence. (D) This nick enables the priming of the reverse transcription (by the L1 reverse transcription domain; pink oval shape), which uses the parental mRNA as template. (E) Second strand nick generation (precise mechanism not known). (F) Second DNA strand synthesis (precise mechanism not known). (G) Complementary DNA synthesis in overhang regions created by the two nicks, which creates a duplication of the sequence flanking the target sequence, which is one of the molecular signatures of gene retroposition, in addition to the lack of introns and the presence of a poly-A tail (the direct repeats and the poly-A tail degenerate upon time and are therefore usually only detectable in recent retrocopies). The illustration is based on findings described in references -.
Figure 2. Source of retrogene promoters
The figure illustrates various scenarios that lead to the transcription of retroposed gene copies. (A) Retrocopies may insert into intronic sequences of host genes. The evolution and/or presence of splicing signals enable these copies to be integrated into new splice variants of their host gene. Depending on the localization of these new splice sites, these variants result in either non-coding fusion transcripts (where the entire open reading frame derives from the retrocopy) or coding sequence fusions (the coding region of the retrocopy is fused to that of the host gene). (B) The insertion of retrocopies into actively transcribed regions with an open chromatin structure facilitates their transcription, due to the increased accessibility for the transcriptional machinery. The presence of enhancer elements from neighboring genes and weak transcription promoting sequences (not previously associated with genes) can further strengthen their transcriptional activity. (C) Recruitment of distant promoters in the genomic neighborhood via the acquisition of a new untranslated exon/intron structure. (D) Recruitment of promoters from retrotransposons or CpG proto-promoters. (E) Inheritance of parental promoters through alternative transcriptional start site usage of the parental gene. (F) De novo promoter evolution in the 5′ flanking region of the insertion site by single nucleotide substitutions.
Figure 3. Subcellular adaptation of proteins encoded by new duplicate genes
(A) Illustration of 2 scenarios for the evolution of duplicated genes (red and green) and their products. Each gene and its encoded protein are represented with one color. Distinct protein shapes indicate distinct functions. Three different protein localizations (cytosolic, endoplasmic reticulum, or secreted proteins) are indicated in a schematic cell. Positively selected substitutions responsible for subcellular changes or changes in protein function are indicated (arrows). See main text for references and further details. (B) Adaptive evolution of two primate specific retrogenes (GLUD2 left, CDC14Bretro right). Phylogenetic trees indicate retroduplication events. Periods of adaptive evolution and reconstructed subcellular localizations are indicated. Microscopy images display representative subcellular phenotypes for the indicated branches. Markers on the left: protein localization (green), nuclear DNA (blue), and microtubules (red). Yellow signals indicate an overlap of the protein with microtubules. Markers on the right: protein localization (green) and mitochondria (red).
Figure 4. Origin of TRIM5-CypA gene fusions in macques and owl monkeys
(A) Retroposition of CypA into an intron of the TRIM5 gene from macaques and the resulting fusion gene is shown (similar to the process displayed in Fig. 2A). (B) An independent retroposition of CypA into the UTR of TRIM5 in owl monkeys is shown, also resulting in a new TRIM5-CypA fusion gene. Please refer to Fig. 2 for the colour code and to the main text for details.
Figure 5. Retrogenes, MSCI, and the emergence of mammalian sex chromosomes
(A, upper part) Illustration of the retroposition of an X-linked parental gene to an autosome. (A, lower part) Illustration of the expression of X-linked parental genes and their autosomal retrogene copies before (in spermatogonial cells), during (spermatocytes), and after (spermatids) the process of meiotic sex chromosome inactivation (MSCI). (B) The evolutionary onset for the selectively driven out of X retroduplication process and MSCI, as well as the inferred origin of therian (eutherians/placental mammals and metatherians/marsupials) sex chromosomes. See main text for further explanations.
Similar articles
- The birth of new genes by RNA- and DNA-mediated duplication during mammalian evolution.
Jun J, Ryvkin P, Hemphill E, Mandoiu I, Nelson C. Jun J, et al. J Comput Biol. 2009 Oct;16(10):1429-44. doi: 10.1089/cmb.2009.0073. J Comput Biol. 2009. PMID: 19803737 - The Genomic Impact of Gene Retrocopies: What Have We Learned from Comparative Genomics, Population Genomics, and Transcriptomic Analyses?
Casola C, Betrán E. Casola C, et al. Genome Biol Evol. 2017 Jun 1;9(6):1351-1373. doi: 10.1093/gbe/evx081. Genome Biol Evol. 2017. PMID: 28605529 Free PMC article. - The evolutionary fate of recently duplicated retrogenes in mice.
Gayral P, Caminade P, Boursot P, Galtier N. Gayral P, et al. J Evol Biol. 2007 Mar;20(2):617-26. doi: 10.1111/j.1420-9101.2006.01245.x. J Evol Biol. 2007. PMID: 17305828 - Evolutionary dynamics of duplicated genes in plants.
Lawton-Rauh A. Lawton-Rauh A. Mol Phylogenet Evol. 2003 Dec;29(3):396-409. doi: 10.1016/j.ympev.2003.07.004. Mol Phylogenet Evol. 2003. PMID: 14615182 Review. - Gene duplication and other evolutionary strategies: from the RNA world to the future.
Brosius J. Brosius J. J Struct Funct Genomics. 2003;3(1-4):1-17. doi: 10.1023/a:1022627311114. J Struct Funct Genomics. 2003. PMID: 12836680 Review.
Cited by
- "Orphan" retrogenes in the human genome.
Ciomborowska J, Rosikiewicz W, Szklarczyk D, Makałowski W, Makałowska I. Ciomborowska J, et al. Mol Biol Evol. 2013 Feb;30(2):384-96. doi: 10.1093/molbev/mss235. Epub 2012 Oct 12. Mol Biol Evol. 2013. PMID: 23066043 Free PMC article. - On the relation of gene essentiality to intron structure: a computational and deep learning approach.
Schonfeld E, Vendrow E, Vendrow J, Schonfeld E. Schonfeld E, et al. Life Sci Alliance. 2021 Apr 27;4(6):e202000951. doi: 10.26508/lsa.202000951. Print 2021 Jun. Life Sci Alliance. 2021. PMID: 33906938 Free PMC article. - DNA methylation changes facilitated evolution of genes derived from Mutator-like transposable elements.
Wang J, Yu Y, Tao F, Zhang J, Copetti D, Kudrna D, Talag J, Lee S, Wing RA, Fan C. Wang J, et al. Genome Biol. 2016 May 6;17(1):92. doi: 10.1186/s13059-016-0954-8. Genome Biol. 2016. PMID: 27154274 Free PMC article. - Evolution and function of developmentally dynamic pseudogenes in mammals.
Qian SH, Chen L, Xiong YL, Chen ZX. Qian SH, et al. Genome Biol. 2022 Nov 8;23(1):235. doi: 10.1186/s13059-022-02802-y. Genome Biol. 2022. PMID: 36348461 Free PMC article. - Retrotransposition of gene transcripts leads to structural variation in mammalian genomes.
Ewing AD, Ballinger TJ, Earl D; Broad Institute Genome Sequencing and Analysis Program and Platform; Harris CC, Ding L, Wilson RK, Haussler D. Ewing AD, et al. Genome Biol. 2013 Mar 13;14(3):R22. doi: 10.1186/gb-2013-14-3-r22. Genome Biol. 2013. PMID: 23497673 Free PMC article.
References
- Long M, Betran E, Thornton K, Wang W. The origin of new genes: Glimpses from the young and old. Nature Reviews Genetics. 2003;4:865–875. - PubMed
- Ohno S. Evolution by Gene Duplication. Springer Verlag; Berlin: 1970.
- Wolfe KH, Li WH. Molecular evolution meets the genomics revolution. Nat Genet. 2003;33(Suppl):255–65. - PubMed
- Prince VE, Pickett FB. Splitting pairs: the diverging fates of duplicated genes. Nat Rev Genet. 2002;3:827–37. - PubMed
- Lynch M. The origins of genome architecture. Sinauer Associates; Sunderland, USA: 2007.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources