Exonic splicing enhancer motif recognized by human SC35 under splicing conditions - PubMed (original) (raw)

Exonic splicing enhancer motif recognized by human SC35 under splicing conditions

H X Liu et al. Mol Cell Biol. 2000 Feb.

Abstract

Exonic splicing enhancers (ESEs) are important cis elements required for exon inclusion. Using an in vitro functional selection and amplification procedure, we have identified a novel ESE motif recognized by the human SR protein SC35 under splicing conditions. The selected sequences are functional and specific: they promote splicing in nuclear extract or in S100 extract complemented by SC35 but not by SF2/ASF. They can also function in a different exonic context from the one used for the selection procedure. The selected sequences share one or two close matches to a short and highly degenerate octamer consensus, GRYYcSYR. A score matrix was generated from the selected sequences according to the nucleotide frequency at each position of their best match to the consensus motif. The SC35 score matrix, along with our previously reported SF2/ASF score matrix, was used to search the sequences of two well-characterized splicing substrates derived from the mouse immunoglobulin M (IgM) and human immunodeficiency virus tat genes. Multiple SC35 high-score motifs, but only two widely separated SF2/ASF motifs, were found in the IgM C4 exon, which can be spliced in S100 extract complemented by SC35. In contrast, multiple high-score motifs for both SF2/ASF and SC35 were found in a variant of the Tat T3 exon (lacking an SC35-specific silencer) whose splicing can be complemented by either SF2/ASF or SC35. The motif score matrix can help locate SC35-specific enhancers in natural exon sequences.

PubMed Disclaimer

Figures

FIG. 1

FIG. 1

Experimental procedure for functional SELEX. The structure of the IgM M1-M2 minigene pre-mRNA is shown. The characterized natural ESE (73 nt) was replaced by 20 nt of randomized sequence by overlap-extension PCR (20). A T7 promoter (black box) was built into the PCR product. In vitro-transcribed RNA was then incubated under splicing conditions. Any spliced mRNA molecules must contain a functional ESE or winner sequence, designated by the white W in a black box. The spliced mRNA molecules were purified from a denaturing polyacrylamide gel, reamplified by RT-PCR, and cloned. Individual clones were sequenced and analyzed by a sampling algorithm to define a common motif, and a subset was rebuilt into minigene templates, transcribed, and assayed for splicing in vitro.

FIG. 2

FIG. 2

Splicing of the initial RNA pool. Twenty femtomoles of wild-type IgM minigene pre-mRNA (W; lane 1), pre-mRNA with a deletion of the ESE (ED; lane 2), or a pre-mRNA pool representing 1.2 × 1010 different molecules with a randomized 20-nt segment within exon M2 were spliced in 25-μl reaction mixtures in nuclear extract (lane 3), S100 extract alone (lane 4), or S100 extract complemented by 20 pmol of recombinant SC35 (lane 5). The structures of the precursor, intermediates, and products are indicated next to the autoradiogram. The expected size of the spliced mRNA product of the ESE deletion mutant is indicated by an arrow.

FIG. 3

FIG. 3

Analysis of the SC35-selected sequences. (A) Sequence alignment and identification of a consensus motif. The consensus motif and score matrix were derived as described previously (20). The sequences were aligned on the basis of the highest score motif for each sequence. Nucleotides matching the consensus are shown as white on a black background; mismatched nucleotides are not shaded. The scores of the aligned motifs are indicated on the right. Additional motifs present in some of the sequences with a score greater than 1.62 (the mean score of the random pool) are underlined. In two cases these include a trinucleotide contributed by the 3′-flanking sequence, which is indicated with lowercase letters (cua). The consensus shown is only an approximation that indicates the most frequent nucleotide(s) at each position. The lowercase c at position 5 denotes a slight preference for this nucleotide over the other three nucleotides, which occur at similar frequencies. Y, pyrimidine; S, G or C; R, purine. The nucleotide composition of the selected pool is shown at the bottom. The nucleotide composition of the initial RNA pool was as follows: A, 21%; G, 39%; C, 19%, and U, 21%. (B) Representation of the SC35 ESE score matrix and consensus motif. The diagram shows the frequency of each nucleotide at each position of the octamer consensus, adjusted for the compositional bias of the initial pool (20). The height of each letter is proportional to its frequency, and the nucleotides are shown from top to bottom in decreasing order of frequency. This method of displaying nucleotide frequencies is based on references (3) and (13).

FIG. 4

FIG. 4

Activity of the selected ESE motifs. SC35 SELEX winners were rebuilt into the IgM M1-M2 minigene by overlap-extension PCR (as described in the legend for Fig. 1), and transcripts corresponding to individual winners were spliced in HeLa nuclear extract (lanes 1, 4, 7, 10, and 13), in S100 extract alone (lanes 2, 5, 8, 11, and 14), or in S100 extract complemented by recombinant SC35 (lanes 3, 6, 9, 12, and 15).

FIG. 5

FIG. 5

Specificity of the selected ESE motifs. (A) Splicing of SC35-selected ESEs was analyzed in nuclear extract (lanes 1, 5, 9, 13, and 17), S100 extract alone (lanes 2, 6, 10, 14, and 17), or S100 extract plus recombinant SC35 (lanes 3, 7, 11, 15, and 19) or recombinant SF2/ASF (lanes 4, 8, 12, 16, and 20). (B) Splicing of SF2/ASF-selected ESEs was examined in nuclear extract (lanes 1, 5, 9, 13, and 17), S100 extract alone (lanes 2, 6, 10, 14, and 18), or in S100 extract plus SC35 (lanes 3, 7, 11, 15, and 19) or SF2/ASF (lanes 4, 8, 12, 16, and 20).

FIG. 6

FIG. 6

Comparison of SC35 winner sequences and SC35 ESE motif in two different exonic contexts. (A) The 20-nt D2 winner sequence (see Fig. 3), a variant of D2 with two nucleotide changes to introduce a maximum-score consensus (D2C), and a 19-nt SC35 SELEX winner sequence (6-24) described in a previous study (29) were inserted into the IgM M1-M2 minigene in place of the natural ESE in exon M2, and the corresponding transcripts were spliced in nuclear extract (lanes 2 to 4). The control pre-mRNA lacking an ESE (ED) is shown in lane 1. (B) The same D2, D2C, and 6-24 sequences were tested in the context of an IgM C3-C4 minigene (26). The Ca pre-mRNA includes the first 38 nt of the C4 exon (lane 1). In the remaining pre-mRNAs, this segment of the C4 exon is followed by the next 38 nt of the C4 exon, which comprise a natural SC35-dependent ESE (CaCb; lane 2) or it is followed by the D2, D2C, or 6-24 sequence (lanes 3 to 5). The sequences of the relevant portions of the 3′ exons are shown below each panel. The two nucleotide changes in D2C, compared to D2, are underlined. The mobilities of the pre-mRNAs, mRNAs, and 5′ exon intermediates are indicated next to each autoradiogram.

FIG. 7

FIG. 7

Correlation between predicted ESE motifs in natural genes and SR protein specificity of the pre-mRNAs. Score matrices derived for SC35 and SF2/ASF were used to search the sequences of natural genes. The resulting scores (y axis) were plotted against the nucleotide position along each exon (x axis). The vertical bars indicate the first nucleotide of each motif. SC35 high-score motifs are shown in green, and SF2/ASF ones are shown in blue. Since different score matrices were used for each protein, the numerical scores of the two different proteins cannot be compared. (A) High-score motifs in the IgM gene M2 exon. The characterized ESE is indicated by the horizontal magenta bar, and the yellow bar indicates the region comprising a recently described silencer (15, 39). (B) High-score motifs in the IgM gene C4 exon. (C) High-score motifs in the tat gene T3 exon. The horizontal yellow bar indicates the position of the SC35-specific silencer.

Similar articles

Cited by

References

    1. Berget S M. Exon recognition in vertebrate splicing. J Biol Chem. 1995;270:2411–2414. - PubMed
    1. Black D L. Finding splice sites within a wilderness of RNA. RNA. 1995;1:763–771. - PMC - PubMed
    1. Burge C B, Tuschl T, Sharp P A. Splicing of precursors to mRNAs by the spliceosomes. In: Gesteland R F, Cech T R, Atkins J F, editors. The RNA world. 2nd ed. Cold Spring Harbor, N.Y: Cold Spring Harbor Laboratory Press; 1999. pp. 525–559.
    1. Burset M, Guigo R. Evaluation of gene structure prediction programs. Genomics. 1996;34:353–367. - PubMed
    1. Cáceres J F, Stamm S, Helfman D M, Krainer A R. Regulation of alternative splicing in vivo by overexpression of antagonistic splicing factors. Science. 1994;265:1706–1709. - PubMed

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources