Natural mutagenesis of human genomes by endogenous retrotransposons - PubMed (original) (raw)
Natural mutagenesis of human genomes by endogenous retrotransposons
Rebecca C Iskow et al. Cell. 2010.
Abstract
Two abundant classes of mobile elements, namely Alu and L1 elements, continue to generate new retrotransposon insertions in human genomes. Estimates suggest that these elements have generated millions of new germline insertions in individual human genomes worldwide. Unfortunately, current technologies are not capable of detecting most of these young insertions, and the true extent of germline mutagenesis by endogenous human retrotransposons has been difficult to examine. Here, we describe technologies for detecting these young retrotransposon insertions and demonstrate that such insertions indeed are abundant in human populations. We also found that new somatic L1 insertions occur at high frequencies in human lung cancer genomes. Genome-wide analysis suggests that altered DNA methylation may be responsible for the high levels of L1 mobilization observed in these tumors. Our data indicate that transposon-mediated mutagenesis is extensive in human genomes and is likely to have a major impact on human biology and diseases.
Figures
Figure 1. Strategy to sequence retrotransposon insertion junctions
The approaches that were used to sequence retrotransposon insertion junctions are depicted. ABI (left side) and pyrosequencing (right side). The “transposon” is either L1-Ta or Alu. Human genomic DNA is digested with a restriction endonuclease and ligated to a linker. The linker is partially double-stranded with a 3′ amine group on the short strand. This prevents amplification of random genomic DNA from the linker primers. Amplification only occurs if there is extension from the transposon-specific primer. This completes the double-stranded linker and creates the sequence for the linker-specific primer to anneal to, thus allowing the PCR reaction to proceed. Left side-ABI sequencing. After an initial PCR amplification, a second round of PCR is performed with nested primers. Second round PCR products are cloned into pBLUESCRIPT using a restriction site in the nested retrotransposon primer and another restriction site in the linker. Resulting colonies are sent for ABI Sanger capillary sequencing. Right side-pyrosequencing. After an initial PCR amplification, a second PCR is performed with nested primers. The retrotransposon nested primer contains the “A” adapter sequence for 454 sequencing whereas the linker-specific primer contains an 8 bp unique barcode for each sample and the “B” adapter sequence. Samples are pooled in equal molar ratios for emulsion PCR with beads binding only the “A” end. Thus sequencing occurs from the “B” end only, avoiding possible problems with sequencing through the poly (A) tail of L1. A similar approach was used with Alu except the 5’ junctions were amplified and sequenced. Please see Experimental Procedures and Extended Experimental Procedures for primers and additional details.
Figure 2. Characterization of novel L1 elements
A) For both ABI sequencing and pyrosequencing experiments, L1 elements that were not present in the human reference sequence (hg18) were compared with L1 polymorphisms that have been deposited to dbRIP (Wang et al., 2006). The Venn diagram depicts the relationships between our candidates and L1 insertions that have been deposited to dbRIP but are absent from hg18. Note the minimal overlap between our data sets and the L1 polymorphisms in dbRIP, indicating that our L1-seq method has recovered many novel insertions. Our assays predominantly detected L1-Ta and pre-Ta elements (Table S5). B) Correlation of estimated and previously determined allelic frequencies. For the pyrosequencing experiment, allelic frequencies were estimated using sequence data and were compared to known allelic frequencies of polymorphic L1 elements (n = 63). C) A subset of L1 elements that appeared to be "singletons" based on their presence in a single individual (from sequence data) and absence from hg18 and dbRIP were verified by PCR in pools of diverse human DNA. The Individual lane is the individual from whom the L1 was initially sequenced. Pool lanes contain DNA from 15 diverse humans. Chimp is Coriell #NA03448A and Negative is a control PCR with no template. D) Histogram of allelic frequencies for polymorphic L1s in dbRIP (Wang et al., 2006). Allelic frequency data was extracted from dbRIP (n = 153). E) Allelic frequencies of L1’s identified by ABI Sanger dideoxy sequencing. Allelic frequencies were determined by PCR in a panel of 46 diverse individuals (n = 46). F) Allelic frequencies of L1’s identified by pyrosequencing. Allelic frequencies were determined as depicted in B (n = 650).
Figure 3. PCR validation of somatic insertions and identification of a hypomethylation signature in tumors with new L1 insertions
A) Nine somatic L1 insertions were identified by screening low frequency L1 insertions that were identified initially from our pyrosequencing data (Table S2) with PCR assays. Shown here are those verified as somatic insertions by their presence in the tumor tissue and absence from adjacent normal tissue. Negative lanes are control PCRs with no template. Primers flanking the putative insertion sites were used to amplify the pre-insertion alleles. Primers within the 3′ end of an L1 consensus sequence and downstream of the putative insertion sites were used to verify the presence of each L1. Anonymous patient identifiers are on the left. NCI-H1395 is a tumor-derived cell line matched with its normal B-lymphocyte-control (both from ATCC). The three somatic L1 insertion candidates that have been reported previously in the literature are shown in Figure S1. B) The methylation status of the 20 lung tumor and normal adjacent tissues used in the pyrosequencing experiment were analyzed by Illumina Infinium analysis. Fifty-nine probes were identified whose changes in methylation status in the tumor specimens relative to matched normal tissues were tightly correlated with somatic L1 retrotransposition (Table S6). The six tumors that were positive for somatic L1 insertions (blue) clustered together along with one of the remaining tumor samples (i.e., sample 119), which did not have a somatic L1 insertion (orange). Tumor ANCO119 (seventh from the left) clustered with the L1-positive tumors but lacked an L1 insertion, suggesting that it might also have an L1-permissive state. This signature expands to 1,928 correlated probes if tumor 119 is included in the L1 permissive class and the least correlated tumor of the six L1-positive tumors (ANCO106) is shifted to the L1-negative class (see Experimental Procedures).
Comment in
- Retrotransposition and structural variation in the human genome.
Lupski JR. Lupski JR. Cell. 2010 Jun 25;141(7):1110-2. doi: 10.1016/j.cell.2010.06.014. Cell. 2010. PMID: 20602993 - Genomic variation: Human retrotransposons keep it active.
Muers M. Muers M. Nat Rev Genet. 2010 Aug;11(8):527. doi: 10.1038/nrg2832. Epub 2010 Jul 13. Nat Rev Genet. 2010. PMID: 20628350 No abstract available. - Jumping genes.
de Souza N. de Souza N. Nat Methods. 2010 Aug;7(8):579. doi: 10.1038/nmeth0810-579. Nat Methods. 2010. PMID: 20704015 No abstract available.
Similar articles
- The NF1 gene contains hotspots for L1 endonuclease-dependent de novo insertion.
Wimmer K, Callens T, Wernstedt A, Messiaen L. Wimmer K, et al. PLoS Genet. 2011 Nov;7(11):e1002371. doi: 10.1371/journal.pgen.1002371. Epub 2011 Nov 17. PLoS Genet. 2011. PMID: 22125493 Free PMC article. - Analysis of 5' junctions of human LINE-1 and Alu retrotransposons suggests an alternative model for 5'-end attachment requiring microhomology-mediated end-joining.
Zingler N, Willhoeft U, Brose HP, Schoder V, Jahns T, Hanschmann KM, Morrish TA, Löwer J, Schumann GG. Zingler N, et al. Genome Res. 2005 Jun;15(6):780-9. doi: 10.1101/gr.3421505. Genome Res. 2005. PMID: 15930490 Free PMC article. - Functional impact of the human mobilome.
Babatz TD, Burns KH. Babatz TD, et al. Curr Opin Genet Dev. 2013 Jun;23(3):264-70. doi: 10.1016/j.gde.2013.02.007. Epub 2013 Mar 22. Curr Opin Genet Dev. 2013. PMID: 23523050 Free PMC article. Review. - L1 hybridization enrichment: a method for directly accessing de novo L1 insertions in the human germline.
Freeman P, Macfarlane C, Collier P, Jeffreys AJ, Badge RM. Freeman P, et al. Hum Mutat. 2011 Aug;32(8):978-88. doi: 10.1002/humu.21533. Epub 2011 Jul 12. Hum Mutat. 2011. PMID: 21560187 Free PMC article. - Emerging Opportunities to Study Mobile Element Insertions and Their Source Elements in an Expanding Universe of Sequenced Human Genomes.
Devine SE. Devine SE. Genes (Basel). 2023 Oct 10;14(10):1923. doi: 10.3390/genes14101923. Genes (Basel). 2023. PMID: 37895272 Free PMC article. Review.
Cited by
- Enhancer activation from transposable elements in extrachromosomal DNA.
Kraft K, Murphy SE, Jones MG, Shi Q, Bhargava-Shah A, Luong C, Hung KL, He BJ, Li R, Park SK, Weiser NE, Luebeck J, Bafna V, Boeke JD, Mischel PS, Boettiger AN, Chang HY. Kraft K, et al. bioRxiv [Preprint]. 2024 Sep 8:2024.09.04.611262. doi: 10.1101/2024.09.04.611262. bioRxiv. 2024. PMID: 39282372 Free PMC article. Preprint. - Comprehensive profiling of L1 retrotransposons in mouse.
Zhang X, Celic I, Mitchell H, Stuckert S, Vedula L, Han JS. Zhang X, et al. Nucleic Acids Res. 2024 May 22;52(9):5166-5178. doi: 10.1093/nar/gkae273. Nucleic Acids Res. 2024. PMID: 38647072 Free PMC article. - Regulation and function of transposable elements in cancer genomes.
Lee M Jr, Ahmad SF, Xu J. Lee M Jr, et al. Cell Mol Life Sci. 2024 Mar 31;81(1):157. doi: 10.1007/s00018-024-05195-2. Cell Mol Life Sci. 2024. PMID: 38556602 Free PMC article. Review. - Large Deletions, Cleavage of the Telomeric Repeat Sequence, and Reverse Transcriptase-Mediated DNA Damage Response Associated with Long Interspersed Element-1 ORF2p Enzymatic Activities.
Kines KJ, Sokolowski M, DeFreece C, Shareef A, deHaro DL, Belancio VP. Kines KJ, et al. Genes (Basel). 2024 Jan 23;15(2):143. doi: 10.3390/genes15020143. Genes (Basel). 2024. PMID: 38397133 Free PMC article. - Evolutionary insights from profiling LINE-1 activity at allelic resolution in a single human genome.
Yang L, Metzger GA, Padilla Del Valle R, Delgadillo Rubalcaba D, McLaughlin RN Jr. Yang L, et al. EMBO J. 2024 Jan;43(1):112-131. doi: 10.1038/s44318-023-00007-y. Epub 2023 Dec 18. EMBO J. 2024. PMID: 38177314 Free PMC article.
References
- Batzer MA, Deininger PL. Alu repeats and human genomic diversity. Nat Rev Genet. 2002;35:501–538. - PubMed
- Bestor TH. The host defense function of genomic methylation patterns. Novartis Found Symp. 1998;214:187–195. - PubMed
- Boissinot S, Chevret P, Furano AV. L1 (LINE-1) retrotransposon evolution and amplification in recent human history. Mol Biol Evol. 2000;17:915–928. - PubMed
- Boissinot S, Entezam A, Furano AV. Selection against deleterious LINE-1- containing loci in the human lineage. Mol Biol Evol. 2001;18:926–935. - PubMed
Publication types
MeSH terms
Grants and funding
- R01 HG002898/HG/NHGRI NIH HHS/United States
- R01 CA086335/CA/NCI NIH HHS/United States
- R01GM078541/GM/NIGMS NIH HHS/United States
- R01 CA132065/CA/NCI NIH HHS/United States
- R01 GM078541/GM/NIGMS NIH HHS/United States
- 1R01CA132065/CA/NCI NIH HHS/United States
- F32 HG004207/HG/NHGRI NIH HHS/United States
- P01 CA116676/CA/NCI NIH HHS/United States
- R01 CA116804/CA/NCI NIH HHS/United States
- R01CA116804/CA/NCI NIH HHS/United States
- 5P01CA116676/CA/NCI NIH HHS/United States
- R01 HG002898-06/HG/NHGRI NIH HHS/United States
- R01HG002898/HG/NHGRI NIH HHS/United States
- F32HG004207/HG/NHGRI NIH HHS/United States
- T32 GM008490/GM/NIGMS NIH HHS/United States
- R01CA086335/CA/NCI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources