An encyclopedia of mouse genes (original) (raw)
- Letter
- Published: February 1999
- LaDeana Hillier1,
- Tamara Kucaba1,
- Melissa Allen1,
- Robert Barstead2,
- Catherine Beck1,
- Angela Blistain1,
- Maria Bonaldo3,
- Yvette Bowers1,
- Louise Bowles1,
- Marco Cardenas1,
- Ann Chamberlain1,
- Julie Chappell1,
- Sandra Clifton1,
- Anthony Favello1,
- Steve Geisel1,
- Marilyn Gibbons1,
- Njata Harvey1,
- Francesca Hill4,
- Yolanda Jackson1,
- Sophie Kohn1,
- Greg Lennon4,5,
- Elaine Mardis1,
- John Martin1,
- LeeAnne Mila4,
- Rhonda McCann1,
- Richard Morales1,
- Deana Pape1,
- Barry Person1,
- Christa Prange4,
- Erika Ritter1,
- Marcelo Soares3,
- Rebecca Schurk1,
- Tanya Shin1,
- Michele Steptoe1,
- Timothy Swaller1,
- Brenda Theising1,
- Karen Underwood1,
- Todd Wylie1,
- Tamara Yount1,
- Richard Wilson1 &
- …
- Robert Waterston1
Nature Genetics volume 21, pages 191–194 (1999)Cite this article
- 202 Accesses
- 3 Altmetric
- Metrics details
Abstract
The laboratory mouse is the premier model system for studies of mammalian development due to the powerful classical genetic analysis1 possible (see also the Jackson Laboratory web site, http://www.jax.org/) and the ever–expanding collection of molecular tools2,3. To enhance the utility of the mouse system, we initiated a program to generate a large database of expressed sequence tags (ESTs) that can provide rapid access to genes4,5,6,7,8,9,10,11,12,13,14,15,16. Of particular significance was the possibility that cDNA libraries could be prepared from very early stages of development, a situation unrealized in human EST projects7,12. We report here the development of a comprehensive database of ESTs for the mouse. The project, initiated in March 1996, has focused on 5´ end sequences from directionally cloned, oligo–dT primed cDNA libraries. As of 23 October 1998, 352,040 sequences had been generated, annotated and deposited in dbEST, where they comprised 93% of the total ESTs available for mouse. EST data are versatile and have been applied to gene identification17, comparative sequence analysis18,19, comparative gene mapping and candidate disease gene identification20, genome sequence annotation21,22, microarray development23 and the development of gene–based map resources24.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Additional access options:
Similar content being viewed by others
References
- Brown, S.D.M. & Peters, J. Combining mutagenesis and genomics in the mouse—closing the phenotype gap. Trends Genet. 12, 433–435 (1996).
Article CAS Google Scholar - Zambrowicz, B.P. et al. Disruption and sequence identification of 2,000 genes in mouse embryonic stem cells. Nature 392, 608– 611 (1998).
Article CAS Google Scholar - Hicks, G.G. et al. Functional genomics in mice by tagged sequence mutagenesis. Nature Genet. 16, 338– 344 (1997).
Article CAS Google Scholar - Milner, R.J. & Sutcliffe, J.G. Gene expression in rat brain. Nucleic Acids Res. 11, 5497– 5520 (1983).
Article CAS Google Scholar - Putney, S.D., Herligh, W.D. & Schimmel, P. A new troponin T and cDNA clones for 13 different muscle proteins, found by shotgun sequencing. Nature 302, 718–721 (1983).
Article CAS Google Scholar - Adams, M.D. et al. Complementary DNA sequencing: expressed sequence tags and the human genome project. Science 252, 1651 –1656 (1991).
Article CAS Google Scholar - Adams, M.D. et al. Initial assessment of human gene diversity and expression patterns based upon 83 million nucleotides of cDNA sequence. Nature 377, 3–17 ( 1995).
CAS PubMed Google Scholar - McCombie, W.R. et al. Caenorhabditis elegans expressed sequence tags identify gene families and potential disease gene homologues. Nature Genet. 1, 124–131 ( 1992).
Article CAS Google Scholar - Waterston, R.H. et al. A survey of expressed genes in C. elegans. Nature Genet. 1, 114–123 (1992).
Article CAS Google Scholar - Sasaki, T. et al. Toward cataloguing all rice genes: large–scale sequencing of randomly chosen rice cDNAs from a callus cDNA library. Plant J. 6, 615–624 ( 1994).
Article CAS Google Scholar - Houlgatte, R. et al. The GenExpress index: a resource for gene discovery and the genic map of the human genome. Genome Res. 5, 272–304 (1995).
Article CAS Google Scholar - Hillier, L. et al. Generation and analysis of 280,000 human expressed sequence tags. Genome Res. 6, 807– 828 (1996).
Article CAS Google Scholar - Yamamoto, K. & Sasaki, T. Large–scale EST sequencing in rice. Plant Mol. Biol. 35, 135– 144 (1997).
Article CAS Google Scholar - Nelson, P.S. et al. An expressed–sequence–tag database of the human prostate: sequence analysis of 1168 clones. Genomics 47, 12–25 (1998).
Article CAS Google Scholar - Ajioka, J.W. et al. Gene discovery by EST sequencing in Toxoplasma gondii reveals sequences restricted to the Apicomplexa. Genome Res. 8, 18–28 (1998 ).
Article CAS Google Scholar - Sasaki, N. et al. Characterization of gene expression in mouse blastocyst using single–pass sequencing of 3995 clones. Genomics 49, 167–179 (1998).
Article CAS Google Scholar - Sutherland, H.F., Kim, U.J. & Scambler, P.J. Cloning and comparative mapping of the DiGeorge syndrome critical region in the mouse. Genomics 52, 37–43 (1998).
Article CAS Google Scholar - Makalowski, W. & Boguski, M.S. Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences. Proc. Natl Acad. Sci. USA 95, 9407–9412 (1998).
Article CAS Google Scholar - Makalowski, W., Zhang, J. & Boguski, M.S. Comparative analysis of 1,196 orthologous mouse and human full–length mRNA and protein sequences. Genome Res. 6, 846–857 ( 1996).
Article CAS Google Scholar - Scharf, J.M. et al. Identification of a candidate modifying gene for spinal muscular atrophy by comparative genomics. Nature Genet. 20, 83–86 (1998).
Article CAS Google Scholar - Bailey, L.C. Jr, Searls, D.B. & Overton, G.C. Analysis of EST–driven gene annotation in human genomic sequence. Genome Res. 8, 362– 376 (1998).
Article CAS Google Scholar - Jiang, J. & Jacob, H.J. EbEST: an automated tool using expressed sequence tags to delineate gene structure. Genome Res. 8, 268–275 (1998).
Article Google Scholar - Schena, M. et al. Microarrays: biotechnology's discovery platform for functional genomics. Trends Biotechnol. 16, 301– 306 (1998).
Article CAS Google Scholar - Schuler, G.D. et al. A gene map of the human genome. Science 274, 540–546 (1996).
Article CAS Google Scholar - Bonaldo, M.F., Lennon, G. & Soares, M.B. Normalization and subtraction: two approaches to facilitate gene discovery. Genome Res. 6, 791– 806 (1996).
Article CAS Google Scholar - Ewing, B., Hillier, L., Wendl, M. & Green, P. Basecalling of automated sequencer traces using PHRED I. Accuracy assessment. Genome Res. 8, 175–185 ( 1998).
Article CAS Google Scholar - Ewing, B. & Green, P. Basecalling of automated sequencer traces using PHRED II. Error probabilities. Genome Res. 8,186–194 (1998).
Article CAS Google Scholar - Suzuki, Y., Yoshitomo–Nakagawa, K., Maruyama, K., Suyama, A. & Sugano, S. Construction and characterization of a full length–enriched and a 5´–end enriched cDNA library. Gene 200, 149– 156 (1997).
Article CAS Google Scholar - Lennon, G., Auffray, C., Polymeropoulos, M. & Soares, M.B. The I.M.A.G.E. Consortium: an integrated molecular analysis of genomes and their expression. Genomics 33, 151– 152 (1996).
Article CAS Google Scholar - Sonnhammer, E.L. & Durbin, R. Analysis of protein domain families in Caenorhabditis elegans. Genomics 46, 200–216 (1997).
Article CAS Google Scholar
Acknowledgements
We thank all investigators who have donated libraries for sequencing; S. Tilghman for scientific guidance; S. Chissoe and S. Gorski for comments on the manuscript and useful discussion; G. Schuler, C. Tolstoshev and others at NCBI for assistance with databases; and the staff at Washington University Genome Center for technical support. Work by C.P. and G.L. was supported by the U.S. DOE under contract W–7405–Eng–48 to LLNL. Work at Washington University was funded by a grant from Howard Hughes Medical Institute.
Author information
Authors and Affiliations
- Washington University Genome Sequencing Center, 4444 Forest Park Boulevard, St. Louis, 63108, Missouri, USA
Marco Marra, LaDeana Hillier, Tamara Kucaba, Melissa Allen, Catherine Beck, Angela Blistain, Yvette Bowers, Louise Bowles, Marco Cardenas, Ann Chamberlain, Julie Chappell, Sandra Clifton, Anthony Favello, Steve Geisel, Marilyn Gibbons, Njata Harvey, Yolanda Jackson, Sophie Kohn, Elaine Mardis, John Martin, Rhonda McCann, Richard Morales, Deana Pape, Barry Person, Erika Ritter, Rebecca Schurk, Tanya Shin, Michele Steptoe, Timothy Swaller, Brenda Theising, Karen Underwood, Todd Wylie, Tamara Yount, Richard Wilson & Robert Waterston - Oklahoma Medical Research Foundation, Program in Molecular & Cell Biology, 825 NE 13th Street, Oklahoma City, 73104, Oklahoma, USA
Robert Barstead - The University of Iowa, Unit 41, 451 Eckstein Medical Research Building, Iowa City, 52242, Iowa, USA
Maria Bonaldo & Marcelo Soares - The I.M.A.G.E. Consortium, Biology and Biotechnology Research Program, Lawrence Livermore National Laboratory, 7000 East Ave/L–452 Livermore, 94550, California, USA
Francesca Hill, Greg Lennon, LeeAnne Mila & Christa Prange - GeneLogic, Inc. Genomics, 708 Quince Orchard Road, Gaithersburg, 20878 , Maryland, USA
Greg Lennon
Authors
- Marco Marra
You can also search for this author inPubMed Google Scholar - LaDeana Hillier
You can also search for this author inPubMed Google Scholar - Tamara Kucaba
You can also search for this author inPubMed Google Scholar - Melissa Allen
You can also search for this author inPubMed Google Scholar - Robert Barstead
You can also search for this author inPubMed Google Scholar - Catherine Beck
You can also search for this author inPubMed Google Scholar - Angela Blistain
You can also search for this author inPubMed Google Scholar - Maria Bonaldo
You can also search for this author inPubMed Google Scholar - Yvette Bowers
You can also search for this author inPubMed Google Scholar - Louise Bowles
You can also search for this author inPubMed Google Scholar - Marco Cardenas
You can also search for this author inPubMed Google Scholar - Ann Chamberlain
You can also search for this author inPubMed Google Scholar - Julie Chappell
You can also search for this author inPubMed Google Scholar - Sandra Clifton
You can also search for this author inPubMed Google Scholar - Anthony Favello
You can also search for this author inPubMed Google Scholar - Steve Geisel
You can also search for this author inPubMed Google Scholar - Marilyn Gibbons
You can also search for this author inPubMed Google Scholar - Njata Harvey
You can also search for this author inPubMed Google Scholar - Francesca Hill
You can also search for this author inPubMed Google Scholar - Yolanda Jackson
You can also search for this author inPubMed Google Scholar - Sophie Kohn
You can also search for this author inPubMed Google Scholar - Greg Lennon
You can also search for this author inPubMed Google Scholar - Elaine Mardis
You can also search for this author inPubMed Google Scholar - John Martin
You can also search for this author inPubMed Google Scholar - LeeAnne Mila
You can also search for this author inPubMed Google Scholar - Rhonda McCann
You can also search for this author inPubMed Google Scholar - Richard Morales
You can also search for this author inPubMed Google Scholar - Deana Pape
You can also search for this author inPubMed Google Scholar - Barry Person
You can also search for this author inPubMed Google Scholar - Christa Prange
You can also search for this author inPubMed Google Scholar - Erika Ritter
You can also search for this author inPubMed Google Scholar - Marcelo Soares
You can also search for this author inPubMed Google Scholar - Rebecca Schurk
You can also search for this author inPubMed Google Scholar - Tanya Shin
You can also search for this author inPubMed Google Scholar - Michele Steptoe
You can also search for this author inPubMed Google Scholar - Timothy Swaller
You can also search for this author inPubMed Google Scholar - Brenda Theising
You can also search for this author inPubMed Google Scholar - Karen Underwood
You can also search for this author inPubMed Google Scholar - Todd Wylie
You can also search for this author inPubMed Google Scholar - Tamara Yount
You can also search for this author inPubMed Google Scholar - Richard Wilson
You can also search for this author inPubMed Google Scholar - Robert Waterston
You can also search for this author inPubMed Google Scholar
Corresponding author
Correspondence toMarco Marra.
Rights and permissions
About this article
Cite this article
Marra, M., Hillier, L., Kucaba, T. et al. An encyclopedia of mouse genes.Nat Genet 21, 191–194 (1999). https://doi.org/10.1038/5976
- Received: 17 November 1998
- Accepted: 21 December 1998
- Issue Date: February 1999
- DOI: https://doi.org/10.1038/5976