The insertional history of an active family of L1 retrotransposons in humans - PubMed (original) (raw)
. 2004 Jul;14(7):1221-31.
doi: 10.1101/gr.2326704. Epub 2004 Jun 14.
Affiliations
- PMID: 15197167
- PMCID: PMC442137
- DOI: 10.1101/gr.2326704
The insertional history of an active family of L1 retrotransposons in humans
Stéphane Boissinot et al. Genome Res. 2004 Jul.
Abstract
As humans contain a currently active L1 (LINE-1) non-LTR retrotransposon family (Ta-1), the human genome database likely provides only a partial picture of Ta-1-generated diversity. We used a non-biased method to clone Ta-1 retrotransposon-containing loci from representatives of four ethnic populations. We obtained 277 distinct Ta-1 loci and identified an additional 67 loci in the human genome database. This collection represents approximately 90% of the Ta-1 population in the individuals examined and is thus more representative of the insertional history of Ta-1 than the human genome database, which lacked approximately 40% of our cloned Ta-1 elements. As both polymorphic and fixed Ta-1 elements are as abundant in the GC-poor genomic regions as in ancestral L1 elements, the enrichment of L1 elements in GC-poor areas is likely due to insertional bias rather than selection. Although the chromosomal distribution of Ta-1 inserts is generally a function of chromosomal length and gene density, chromosome 4 significantly deviates from this pattern and has been much more hospitable to Ta-1 insertions than any other chromosome. Also, the intra-chromosomal distribution of Ta-1 elements is not uniform. Ta-1 elements tend to cluster, and the maximal gaps between Ta-1 inserts are larger than would be expected from a model of uniform random insertion.
Copyright 2004 Cold Spring Harbor Laboratory Press ISSN
Figures
Figure 1
Structure of a typical full-length human L1 element. The 5′ untranslated region (UTR) has a regulatory function; open reading frame 1 (ORF I) encodes an RNA-binding protein; ORF II encodes the L1 replicase and contains an endonuclease (EN) and a reverse transcriptase domain (RT), and the 3′ UTR that contains a conserved G-rich polypurine motif. Genomic copies of L1 usually end in an A-rich stretch (open rectangle, see Moran and Gilbert 2002). The partial alignment of Ta-1, Ta-0, L1PA2, and L1PA5 consensus sequences shows the positions of oligonucleotide 1, which is specific for Ta-1, and of oligonucleotide 2, which includes the ACA trinucleotide, diagnostic of the Ta family (see Methods). The numbers indicate the position of the sequence on alignment ALIGN-000165 at the EMBL_ALIGN database (Boissinot and Furano 2001).
Figure 2
Frequency distribution of Ta-1-containing alleles. The extent of polymorphism of the indicated Ta-1-containing inserts in 141 individuals was determined as described in the Methods. All DNAs for these studies were obtained from the Coriell Institute for Medical Research and included individuals from the following populations: Chinese, Japanese, Druze, Biaka and Mbuti Pygmy, Melanesian, Atayal, Ami, Caucasian American, and African American.
Figure 3
Size distribution of autosomal L1 elements. Elements are grouped in 1000-bp intervals except for the smaller class that shows the frequency of elements 500–1000 bp long. The number of elements analyzed was 183 Ta-1 elements (those whose size was known from the human genome data base), 282 L1PA2 elements, and 305 L1PA5 elements.
Figure 4
Frequency of L1 elements in different GC fractions of the human genome. The % GC was calculated over 20-kb windows. The bins from left to right correspond to an increasing 2% GC fraction. The family-specific L1 curves were built using the DNA flanks of 299 Ta-1 (squares), 300 L1PA2 (triangles), and 324 L1PA5 (diamonds) inserts. The total genome curve (heavy line) was built using Tables “gcPercent” at
for all chromosomes.
Figure 5
Insertion sites of Ta-1 elements in the human genome. Ta-1 integration sites are shown as tick marks above each chromosome. Tick marks with thinner lines impinging directly on the chromosome diagrams are those identified in the human genome database. Tick marks with heavier lines offset from the chromosomes are the ones that we cloned. Solid circles indicate polymorphic inserts, open circles indicate fixed ones, and tick marks without either are indeterminate (not amenable to PCR, see Methods and Table 1). The number in parentheses indicates the number of Ta-1 insertion sites on each chromosome. Chromosome 15 only shows the positions of 4 of the 5 Ta-1 elements on this chromosome because one of them was located on an unassigned segment. The shaded boxes indicate the number of known genes per 5 Mb segments. The unadjusted _k_-specific P values for some apparent clusters are given.
Figure 6
Chromosomal distribution of Ta-1 elements. (A) The number of Ta-1 elements per chromosome is positively correlated to the chromosome physical length (P < 0.0001) in megabases (Mb, gaps removed). The solid line corresponds to the line expected if Ta-1 elements are distributed proportionally to the length of the chromosomes. The 95% confidence limits (dashed lines) are calculated as ±1.96 times the square root of the predicted number of sites to account for Poisson counting error. (B) The number of Ta-1 elements per chromosome (Ta-1/Mb) is negatively correlated to the gene density (Genes/Mb); R = –0.49, P < 0.004.
Figure 7
Distribution of P values. The uncorrected P values for the minimal _k_-span (k = 6) and maximal gap size for all chromosomes are shown. Similar results were found for k = 7 and 8 (not shown). The rectangles show the expected distribution of P values for a random uniform insertion of inserts.
Similar articles
- L1 (LINE-1) retrotransposon evolution and amplification in recent human history.
Boissinot S, Chevret P, Furano AV. Boissinot S, et al. Mol Biol Evol. 2000 Jun;17(6):915-28. doi: 10.1093/oxfordjournals.molbev.a026372. Mol Biol Evol. 2000. PMID: 10833198 - Following the LINEs: an analysis of primate genomic variation at human-specific LINE-1 insertion sites.
Vincent BJ, Myers JS, Ho HJ, Kilroy GE, Walker JA, Watkins WS, Jorde LB, Batzer MA. Vincent BJ, et al. Mol Biol Evol. 2003 Aug;20(8):1338-48. doi: 10.1093/molbev/msg146. Epub 2003 May 30. Mol Biol Evol. 2003. PMID: 12777507 - Retroelement distributions in the human genome: variations associated with age and proximity to genes.
Medstrand P, van de Lagemaat LN, Mager DL. Medstrand P, et al. Genome Res. 2002 Oct;12(10):1483-95. doi: 10.1101/gr.388902. Genome Res. 2002. PMID: 12368240 Free PMC article. - [Genomic instability associated with human LINE-1 rétrotransposition].
Gilbert N, Doucet AJ, Bucheton A. Gilbert N, et al. J Soc Biol. 2004;198(4):419-24. J Soc Biol. 2004. PMID: 15969349 Review. French. - Gene-containing regions of wheat and the other grass genomes.
Sandhu D, Gill KS. Sandhu D, et al. Plant Physiol. 2002 Mar;128(3):803-11. doi: 10.1104/pp.010745. Plant Physiol. 2002. PMID: 11891237 Free PMC article. Review.
Cited by
- Targeted Nanopore Resequencing and Methylation Analysis of LINE-1 Retrotransposons.
Sarkar A, Lanciano S, Cristofari G. Sarkar A, et al. Methods Mol Biol. 2023;2607:173-198. doi: 10.1007/978-1-0716-2883-6_10. Methods Mol Biol. 2023. PMID: 36449164 - The coevolution between APOBEC3 and retrotransposons in primates.
Modenini G, Abondio P, Boattini A. Modenini G, et al. Mob DNA. 2022 Nov 29;13(1):27. doi: 10.1186/s13100-022-00283-1. Mob DNA. 2022. PMID: 36443831 Free PMC article. Review. - Nef Suppresses LINE-1 Retrotransposition through Two Distinct Mechanisms.
Wang Y, Zhao K, Zhao Y, Zhao Z, Wang S, Du J. Wang Y, et al. J Virol. 2022 Oct 26;96(20):e0114822. doi: 10.1128/jvi.01148-22. Epub 2022 Oct 5. J Virol. 2022. PMID: 36197106 Free PMC article. - Long-read assembly of a Great Dane genome highlights the contribution of GC-rich sequence and mobile elements to canine genomes.
Halo JV, Pendleton AL, Shen F, Doucet AJ, Derrien T, Hitte C, Kirby LE, Myers B, Sliwerska E, Emery S, Moran JV, Boyko AR, Kidd JM. Halo JV, et al. Proc Natl Acad Sci U S A. 2021 Mar 16;118(11):e2016274118. doi: 10.1073/pnas.2016274118. Proc Natl Acad Sci U S A. 2021. PMID: 33836575 Free PMC article. - Striking heterogeneity of somatic L1 retrotransposition in single normal and cancerous gastrointestinal cells.
Yamaguchi K, Soares AO, Goff LA, Talasila A, Choi JA, Ivenitsky D, Karma S, Brophy B, Devine SE, Meltzer SJ, Kazazian HH Jr. Yamaguchi K, et al. Proc Natl Acad Sci U S A. 2020 Dec 22;117(51):32215-32222. doi: 10.1073/pnas.2019450117. Epub 2020 Dec 4. Proc Natl Acad Sci U S A. 2020. PMID: 33277430 Free PMC article.
References
- Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403–410. - PubMed
- Boissinot, S. and Furano, A.V. 2001. Adaptive evolution in LINE-1 retrotransposons. Mol. Biol. Evol 18: 2186–2194. - PubMed
- Boissinot, S., Chevret, P., and Furano, A.V. 2000. L1 (LINE-1) retrotransposon evolution and amplification in recent human history. Mol. Biol. Evol. 17: 915–928. - PubMed
- Boissinot, S., Entezam, A., and Furano, A.V. 2001. Selection against deleterious LINE-1-containing loci in the human lineage. Mol. Biol. Evol. 18: 926–935. - PubMed
WEB SITE REFERENCES
- http://www.repeatmasker.org; RepeatMasker.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous