MEME: discovering and analyzing DNA and protein sequence motifs - PubMed (original) (raw)

MEME: discovering and analyzing DNA and protein sequence motifs

Timothy L Bailey et al. Nucleic Acids Res. 2006.

Abstract

MEME (Multiple EM for Motif Elicitation) is one of the most widely used tools for searching for novel 'signals' in sets of biological sequences. Applications include the discovery of new transcription factor binding sites and protein domains. MEME works by searching for repeated, ungapped sequence patterns that occur in the DNA or protein sequences provided by the user. Users can perform MEME searches via the web server hosted by the National Biomedical Computation Resource (http://meme.nbcr.net) and several mirror sites. Through the same web server, users can also access the Motif Alignment and Search Tool to search sequence databases for matches to motifs encoded in several popular formats. By clicking on buttons in the MEME output, users can compare the motifs discovered in their input sequences with databases of known motifs, search sequence databases for matches to the motifs and display the motifs in various formats. This article describes the freely accessible web server and its architecture, and discusses ways to use MEME effectively to find new sequence patterns in biological sequences and analyze their significance.

PubMed Disclaimer

Figures

Figure 1

Sample MEME output.This portion of an MEME HTML output form shows a protein motif that MEME has discovered in the input sequences. The sites identified as belonging to the motif are indicated, and above them is the ‘consensus’ of the motif and a color-coded bar graph showing the conservation of each position in the motif. Some of the hyperlinked buttons that allow the motif to be viewed and analyzed in other ways can be seen at the bottom of the screen shot.

Figure 2

LOGO of protein motif. LOGOS are a visualization tool for motifs. The height of a letter indicates its relative frequency at the given position (_x_-axis) in the motif.

Figure 3

Usage of MEME at the NBCR web server. The plot shows the number of different users submitting jobs to the NBCR MEME web server each month since December 2000. Usage figures for March 2006 include up to March 20 only.

Cited by

Genome-wide analysis and expression profile of the bZIP gene family in Neopyropia yezoensis.
Zhu X, Gao T, Bian K, Meng C, Tang X, Mao Y. Zhu X, et al. Front Plant Sci. 2024 Oct 21;15:1461922. doi: 10.3389/fpls.2024.1461922. eCollection 2024. Front Plant Sci. 2024. PMID: 39498397 Free PMC article.
Genome-Wide Identification of the bHLH Gene Family in Rhododendron delavayi and Its Expression Analysis in Different Floral Tissues.
Dong J, Wu YW, Dong Y, Pu R, Li XJ, Lyu YM, Bai T, Zhang JL. Dong J, et al. Genes (Basel). 2024 Sep 26;15(10):1256. doi: 10.3390/genes15101256. Genes (Basel). 2024. PMID: 39457380 Free PMC article.
Predicting the role of β-GAL genes in bean under abiotic stress and genome-wide characterization of β-GAL gene family members.
Buttanri A, Kasapoğlu AG, Öner BM, Aygören AS, Muslu S, İlhan E, Yildirim E, Aydin M. Buttanri A, et al. Protoplasma. 2024 Oct 23. doi: 10.1007/s00709-024-01998-z. Online ahead of print. Protoplasma. 2024. PMID: 39441340
Genome-wide exploration of the CONSTANS-like (COL) gene family and its potential role in regulating plant flowering time in foxtail millet (Setaria italica).
Jiang L, Li G, Shao C, Gao K, Ma N, Rao J, Miao X. Jiang L, et al. Sci Rep. 2024 Oct 18;14(1):24518. doi: 10.1038/s41598-024-74724-7. Sci Rep. 2024. PMID: 39424865 Free PMC article.
Genome-Wide Identification, Phylogenetic, and Expression Analysis of Jasmonate ZIM-Domain Gene Family in Medicago Sativa L.
Cui J, Jiang X, Li Y, Zhang L, Zhang Y, Wang X, He F, Li M, Zhang T, Kang J. Cui J, et al. Int J Mol Sci. 2024 Oct 1;25(19):10589. doi: 10.3390/ijms251910589. Int J Mol Sci. 2024. PMID: 39408917 Free PMC article.

References

1. Bailey T.L., Elkan C. Unsupervised Learning of Multiple Motifs In Biopolymers Using EM. Mach. Learn. 1995;21:51–80.
1. Bailey T.L., Elkan C. The value of prior knowledge in discovering motifs with MEME. In: Rawlings C., Clark D., Altman R., Hunter L., Lengauer T., Wodak S., editors. Proceedings of the Third International Conference on Intelligent Systems for Molecular biology, July; Menlo Park, CA: AAAI Press; 1995. pp. 21–29. - PubMed
1. Lyons T.J., Gasch A.P., Alex Gaither L., Botstein D., Brown P.O., Eide D.J. Genome-wide characterization of the Zap1p zinc-responsive regulon in yeast. Proc. Natl Acad. Sci. USA. 2000;97:7957–7962. - PMC - PubMed
1. Fang J., Haasl R.J., Dong Y., Lushington G.H. Discover protein sequence signatures from protein-protein interaction data. BMC Bioinformatics. 2005;6:1–8. - PMC - PubMed
1. Bailey T.L., Elkan C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In: Altman R.B., Brutlag D.L., Karp P.D., Lathrop R.H., Searls D.B., editors. Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, August; Menlo Park, CA: AAAI Press; 1994. pp. 28–36. - PubMed

MEME: discovering and analyzing DNA and protein sequence motifs - PubMed (original) (raw)