Mechanisms for human genomic rearrangements - PubMed (original) (raw)

Mechanisms for human genomic rearrangements

Wenli Gu et al. Pathogenetics. 2008.

Abstract

Genomic rearrangements describe gross DNA changes of the size ranging from a couple of hundred base pairs, the size of an average exon, to megabases (Mb). When greater than 3 to 5 Mb, such changes are usually visible microscopically by chromosome studies. Human diseases that result from genomic rearrangements have been called genomic disorders. Three major mechanisms have been proposed for genomic rearrangements in the human genome. Non-allelic homologous recombination (NAHR) is mostly mediated by low-copy repeats (LCRs) with recombination hotspots, gene conversion and apparent minimal efficient processing segments. NAHR accounts for most of the recurrent rearrangements: those that share a common size, show clustering of breakpoints, and recur in multiple individuals. Non-recurrent rearrangements are of different sizes in each patient, but may share a smallest region of overlap whose change in copy number may result in shared clinical features among different patients. LCRs do not mediate, but may stimulate non-recurrent events. Some rare NAHRs can also be mediated by highly homologous repetitive sequences (for example, Alu, LINE); these NAHRs account for some of the non-recurrent rearrangements. Other non-recurrent rearrangements can be explained by non-homologous end-joining (NHEJ) and the Fork Stalling and Template Switching (FoSTeS) models. These mechanisms occur both in germ cells, where the rearrangements can be associated with genomic disorders, and in somatic cells in which such genomic rearrangements can cause disorders such as cancer. NAHR, NHEJ and FoSTeS probably account for the majority of genomic rearrangements in our genome and the frequency distribution of the three at a given locus may partially reflect the genomic architecture in proximity to that locus. We provide a review of the current understanding of these three models.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Experimental observations of recurrent and non-recurrent genomic rearrangements associated with genomic disorders. The long thin line signifies the genomic region undergoing genomic rearrangements. The black rectangle depicts a gene which is located in the rearranged region and can be affected by the rearrangements. The thick blue (in a) and red (in b and c) bars represent the rearrangements (duplications, deletions or inversions) with their breakpoints a. recurrent rearrangements with a common size and clustered breakpoints. Most of the recurrent rearrangements result from non-allelic homologous recombination (NAHR). The two hatched rectangles flanking the gene depict the low-copy repeats (LCRs) functioning as substrates for NAHR. The rearrangement breakpoints are clustered inside the LCRs. b. Non-recurrent rearrangements. The breakpoints of the non-recurrent rearrangements are scattered in the genomic region. Note, that all of the rearrangements share a common genomic region of overlap, the smallest region of overlap, that encompasses a gene necessary for the conveyed phenotypic trait, which enables these rearrangements to be ascertained. c. Non-recurrent rearrangements with grouping of one breakpoint. Some of the non-recurrent rearrangements have one of their breakpoints localized in one small genomic region. This grouping of breakpoints is distinct from breakpoint clustering, but like clustering, it may reflect underlying genomic architecture (for example, palindrome or cruciform) important to the rearrangement mechanism, depicted as the dotted rectangle in Figure 1c.

Figure 2

Figure 2

Genomic rearrangements (Adapted from [2] and [5]). a1 and a2 Genomic rearrangements resulting from recombination between low-copy repeats (LCRs). LCRs are depicted as black arrows with the orientation indicated by the direction of the arrowhead. Capital letters above the thin horizontal lines refer to the flanking unique sequences (for example, A). Homologues on the other strand (can be another chromatid or the homologous chromosome) are also shown (for example, a). Thin diagonal lines refer to a recombination event with the results shown by numbers 1, 2 and 3. a1 Recombination between direct repeats results in deletion and/or duplication. a2 Recombination between inverted repeats results in an inversion. b. Schematic representation of reciprocal duplications and deletions mediated by interchromosomal (left), interchromatid (middle) and intrachromatid (right) non-allelic homologous recombination (NAHR) using LCR pairs in direct orientation. Chromosomes are shown in black, with the centromere depicted by hashed lines. Yellow arrows depict LCRs. Letters adjacent to the chromatids refer to the flanking unique sequence (for example, A, a). Interchromosomal and interchromatid NAHR between LCRs in direct orientation result in reciprocal duplication and deletion, whereas intrachromatid NAHR only creates deletion. Signatures of homologous recombination include the sequence identity of the substrates (LCRs) used for NAHR, recombination hotspots within the LCRs, and evidence for gene conversion at the crossovers within the LCRs.

Figure 3

Figure 3

Genomic rearrangement mechanisms. a. (Adapted from [66]) Non-homologous end-joining (NHEJ) in vertebrates. A double-stranded DNA break (DSB) occurs and is repaired via NHEJ mechanism. The two thick lines depict two DNA strands with DSB, the thin segments in the middle represent the modifications which the ends have gone through before the final ligation. The enzyme machineries catalyzing each step are briefly summarized. They are described in details in references [65] and [70]. Note at step 3 that in order to repair ends, some addition or deletion of bases may be required, leaving behind a 'signature' of NHEJ. b. (Adapted from [82]) After the original stalling of the replication fork (dark blue and red, solid lines), the lagging strand (red, dotted line) disengages and anneals to a second fork (purple and green, solid lines) via microhomology (1), followed by (2) extension of the now 'primed' second fork and DNA synthesis (green, dotted line). After the fork disengages (3), the tethered original fork (dark blue and red, solid lines) with its lagging strand (red and green, dotted lines) could invade a third fork (gray and black, solid lines). Dotted lines represent newly synthesized DNA. Serial replication fork disengaging and lagging strand invasion could occur several times (e.g. FoSTeS x 2, FoSTeS x 3, ... etc.) before (4) resumption of replication on the original template.

Figure 4

Figure 4

One Pelizaeus-Merzbacher disease (PMD)-associated complex PLP1 rearrangement results from multiple FoSTeS events, FoSTeS × 3 (Adapted from [82]). a. Duplication junctions (vertical dotted lines) for one PMD patient are displayed relative to reference sequence, with the duplicated region boxed. Two or three base pairs of microhomology were found at the breakpoint junctions (i.e. "joint points") after amplification with outward-facing primers (F and R). b. Illustration of the order, origins, and relative orientations of junctional (pink and blue) and boundary reference sequences (orange and green) for the PMD patient. Arrowheads show direction of DNA relative to the positive strand; filled arrowheads with circled numbers below represent a FoSTeS event; open arrowhead marks resumption of replication on the original template. Proximal (centromeric) and distal (telomeric) are in relation to PLP1 (red circle).

Figure 5

Figure 5

Comparison of non-allelic homologous recombination, non-homologous end-joining and Fork Stalling and Template Switching mechanisms resulting in genomic duplication/deletion. The two thin lines in all three schemes represent the double strands of DNA. Left column: An intrachromatid non-allelic homologous recombination (NAHR) event. Rectangles in different shades of blue depict two directly orientated low-copy reapeats (LCRs) sharing high homology (97% to 98%), which align at non-allelic rather than allelic positions and the subsequent recombination causes deletion or duplication (reciprocal events but not with equivalent frequencies) of part of the two LCRs as well as the segment flanked by them. Middle column: a non-homologous end-joining (NHEJ) event. Double-strand breaks (DSBs) are created between the two sequences represented as a blue and a red rectangle with no homology between each other. The NHEJ system modifies and rejoins the two ends, resulting in the deletion of the segment between the two DSBs. Right column: a Fork Stalling and Template Switching (FoSTeS) × 2 event causing a complex deletion involving two fragments. No extensive homology is required between the substrate sequences depicted by a blue, a red and a green rectangle. However, the small open triangle heading downwards depicts a site bearing microhomology (2 to 5 base pairs) between the blue and the red sequences, and the small filled triangle heading downwards depicts another site bearing microhomology between the red and the green sequences. Different from NAHR and NHEJ, the FoSTeS event occurs during DNA replication. The replication forks from the two surrounding sequences are shown in the same color as the rectangles. The leading nascent strand at the left side (blue or red) fork invades the right side (red or green) fork via the demonstrated microhomology, and primes its own further synthesis using the right side fork as template. This event happens twice, causing deletion of the two fragments flanked by each pair of microhomology sites. Note the juxtaposition of genomic sequences from multiple distinct regions yielding complex rearrangements.

Figure 6

Figure 6

Genomic architecture is crucial for the genomic rearrangements. The low-copy repeats serve as substrates and thus are an indispensable requirement of non-allelic homologous recombination. Current data suggest that local genomic architecture, including palindromes or cruciforms, might be a stimulus for the Fork Stalling and Template Switching (FoSTeS) rearrangement as well, although these architectural elements are not necessarily directly involved in the FoSTeS rearrangement per se. This could account for the observation of breakpoint grouping with non-recurrent rearrangements at some loci.

Similar articles

Cited by

References

    1. Lupski JR, Stankiewicz P. Genomic disorders: molecular mechanisms for rearrangements and conveyed phenotypes. PLoS Genet. 2005;1:e49. doi: 10.1371/journal.pgen.0010049. - DOI - PMC - PubMed
    1. Lupski JR. Genomic disorders: structural features of the genome can lead to DNA rearrangements and human disease traits. Trends Genet. 1998;14:417–422. doi: 10.1016/S0168-9525(98)01555-8. - DOI - PubMed
    1. Lupski JR, Stankiewicz P. Genomic Disorders. Totowa, New Jersey: Humana Press; 2006.
    1. Shaw CJ, Lupski JR. Implications of human genome architecture for rearrangement-based disorders: the genomic basis of disease. Hum Mol Genet. 2004;13:R57–64. doi: 10.1093/hmg/ddh073. - DOI - PubMed
    1. Stankiewicz P, Lupski JR. Genome architecture, rearrangements and genomic disorders. Trends Genet. 2002;18:74–82. doi: 10.1016/S0168-9525(02)02592-1. - DOI - PubMed

LinkOut - more resources