MEME: discovering and analyzing DNA and protein sequence motifs - PubMed (original) (raw)

MEME: discovering and analyzing DNA and protein sequence motifs

Timothy L Bailey et al. Nucleic Acids Res. 2006.

Abstract

MEME (Multiple EM for Motif Elicitation) is one of the most widely used tools for searching for novel 'signals' in sets of biological sequences. Applications include the discovery of new transcription factor binding sites and protein domains. MEME works by searching for repeated, ungapped sequence patterns that occur in the DNA or protein sequences provided by the user. Users can perform MEME searches via the web server hosted by the National Biomedical Computation Resource (http://meme.nbcr.net) and several mirror sites. Through the same web server, users can also access the Motif Alignment and Search Tool to search sequence databases for matches to motifs encoded in several popular formats. By clicking on buttons in the MEME output, users can compare the motifs discovered in their input sequences with databases of known motifs, search sequence databases for matches to the motifs and display the motifs in various formats. This article describes the freely accessible web server and its architecture, and discusses ways to use MEME effectively to find new sequence patterns in biological sequences and analyze their significance.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Sample MEME output.This portion of an MEME HTML output form shows a protein motif that MEME has discovered in the input sequences. The sites identified as belonging to the motif are indicated, and above them is the ‘consensus’ of the motif and a color-coded bar graph showing the conservation of each position in the motif. Some of the hyperlinked buttons that allow the motif to be viewed and analyzed in other ways can be seen at the bottom of the screen shot.

Figure 2

Figure 2

LOGO of protein motif. LOGOS are a visualization tool for motifs. The height of a letter indicates its relative frequency at the given position (_x_-axis) in the motif.

Figure 3

Figure 3

Usage of MEME at the NBCR web server. The plot shows the number of different users submitting jobs to the NBCR MEME web server each month since December 2000. Usage figures for March 2006 include up to March 20 only.

Similar articles

Cited by

References

    1. Bailey T.L., Elkan C. Unsupervised Learning of Multiple Motifs In Biopolymers Using EM. Mach. Learn. 1995;21:51–80.
    1. Bailey T.L., Elkan C. The value of prior knowledge in discovering motifs with MEME. In: Rawlings C., Clark D., Altman R., Hunter L., Lengauer T., Wodak S., editors. Proceedings of the Third International Conference on Intelligent Systems for Molecular biology, July; Menlo Park, CA: AAAI Press; 1995. pp. 21–29. - PubMed
    1. Lyons T.J., Gasch A.P., Alex Gaither L., Botstein D., Brown P.O., Eide D.J. Genome-wide characterization of the Zap1p zinc-responsive regulon in yeast. Proc. Natl Acad. Sci. USA. 2000;97:7957–7962. - PMC - PubMed
    1. Fang J., Haasl R.J., Dong Y., Lushington G.H. Discover protein sequence signatures from protein-protein interaction data. BMC Bioinformatics. 2005;6:1–8. - PMC - PubMed
    1. Bailey T.L., Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In: Altman R.B., Brutlag D.L., Karp P.D., Lathrop R.H., Searls D.B., editors. Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, August; Menlo Park, CA: AAAI Press; 1994. pp. 28–36. - PubMed

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources