A map for sequence analysis of the Arabidopsis thaliana genome (original) (raw)

Nature Genetics volume 22, pages 265–270 (1999)Cite this article

Abstract

Arabidopsis thaliana has emerged as a model system for studies of plant genetics and development, and its genome has been targeted for sequencing1 by an international consortium (the Arabidopsis Genome Initiative; http://genome-www.stanford.edu/Arabidopsis/agi.html). To support the genome-sequencing effort, we fingerprinted more than 20,000 BACs (ref. 2) from two high-quality publicly available libraries3,4,5, generating an estimated 17-fold redundant coverage of the genome, and used the fingerprints to nucleate assembly of the data by computer. Subsequent manual revision of the assemblies resulted in the incorporation of 19,661 fingerprinted BACs into 169 ordered sets of overlapping clones ('contigs'), each containing at least 3 clones. These contigs are ideal for parallel selection of BACs for large-scale sequencing and have supported the generation of more than 5.8 Mb of finished genome sequence submitted to GenBank; analysis of the sequence has confirmed the integrity of contigs constructed using this fingerprint data. Placement of contigs onto chromosomes can now be performed, and is being pursued by groups involved in both sequencing and positional cloning studies. To our knowledge, these data provide the first example of whole-genome random BAC fingerprint analysis of a eucaryote, and have provided a model essential to efforts aimed at generating similar databases of fingerprint contigs to support sequencing of other complex genomes, including that of human.

This is a preview of subscription content, access via your institution

Access options

Subscribe to this journal

Receive 12 print issues and online access

$209.00 per year

only $17.42 per issue

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Additional access options:

Similar content being viewed by others

References

  1. Bevan, M. et al. Objective: the complete sequence of a plant genome. Plant Cell 9, 476–478 ( 1997).
    Article CAS Google Scholar
  2. Shizuya, H. et al. Cloning and stable maintenance of 300-kilobase-pair fragments of human DNA in Escherichia coli using an F-factor-based vector. Proc. Natl Acad. Sci. USA 89, 8794– 8797 (1992).
    Article CAS Google Scholar
  3. Mozo, T., Fischer, S., Shizuya, H. & Altmann, T. Construction and characterization of the IGF Arabidopsis BAC library. Mol. Gen. Genet. 258, 562–570 (1998).
    Article CAS Google Scholar
  4. Mozo, T., Fischer, S., Meier-Ewert, S., Lehrach, H. & Altmann, T. Use of the IGF BAC library for physical mapping of the Arabidopsis thaliana genome. Plant J. 16, 377–384 (1998).
    Article CAS Google Scholar
  5. Choi, S.D., Creelman, R., Mullet, J. & Wing, R.A. Construction and characterization of a bacterial artificial chromosome library from Arabidopsis thaliana. Weeds World 2, 17– 20 (1995).
    CAS Google Scholar
  6. Marra, M.A. et al. High throughput fingerprint analysis of large-insert clones. Genome Res. 7, 1072–1084 (1997).
    Article CAS Google Scholar
  7. Coulson, A., Sulston, J., Brenner, S. & Karn, J. Toward a physical map of the genome of the nematode Caenorhabditis elegans. Proc. Natl Acad. Sci. USA 83, 7821– 7825 (1986).
    Article CAS Google Scholar
  8. Gregory, S.G., Howell, G.R. & Bentley, D.R. Genome mapping by fluorescent fingerprinting. Genome Res. 7, 1162–1168 (1997).
    Article CAS Google Scholar
  9. Wilson, R.K. & Mardis, E.R. in Genome Analysis: A Laboratory Manual (eds Birren, B., Green, E.D., Klapholz, S., Myers, R.M. & Roskams, J.) 397–454 (Cold Spring Harbor Laboratory Press, Plainview, 1997).
    Google Scholar
  10. Olson, M.V. et al. Random-clone strategy for genomic restriction mapping in yeast. Proc. Natl Acad. Sci. USA 83, 7826– 7830 (1986).
    Article CAS Google Scholar
  11. Goffeau, A. et al. Life with 6000 genes. Science 274, 563–567 (1996).
    Article Google Scholar
  12. Mewes, H.W. et al. Overview of the yeast genome. Nature 387 (suppl.), 7–65 (1997 ).
    Article Google Scholar
  13. The C. elegans Genome Sequencing Consortium. Genome sequence of the nematode C. elegans: a platform for investigating biology. Science 282, 2012– 2018 (1998).
  14. Riles, L. et al. Physical maps of the six smallest chromosomes of Saccharomyces cerevisiae at a resolution of 2.6 kilobase pairs. Genetics 134, 81–150 ( 1993).
    CAS PubMed PubMed Central Google Scholar
  15. Sulston, J. et al. Software for genome mapping by fingerprinting techniques. Comput. Appl. Biosci. 4, 125– 132 (1988).
    CAS PubMed Google Scholar
  16. Sulston, J., Mallett, F., Durbin, R. & Horsnell, T. Image analysis of restriction enzyme fingerprint autoradiograms. Comput. Appl. Biosci. 5, 101–106 ( 1989).
    CAS PubMed Google Scholar
  17. Soderlund, C., Longden I. & Mott, R. FPC: a system for building contigs from restriction fingerprinted clones. Comput. Appl. Biosci. 13, 523–535 (1997).
    CAS PubMed Google Scholar
  18. Parsons, J.D. Miropeats: graphical DNA sequence comparisons. Comput. Appl. Biosci. 11, 615–619 ( 1995).
    CAS PubMed Google Scholar
  19. Staden, R. The Staden sequence analysis package. Mol. Biotechnol. 5, 233–241 (1996).
    Article CAS Google Scholar
  20. Wong, G.K., Yu, J., Thayer, E.C. & Olson, M.V. Multiple-complete-digest restriction fragment mapping: generating sequence-ready maps for large-scale DNA sequencing. Proc. Natl Acad. Sci. USA 13, 5225–5230 (1997).
    Article Google Scholar

Download references

Acknowledgements

We thank T. Altmann and R. Wing for providing the scientific community access to their high-quality A. thaliana BAC libraries; E. Mardis, S. Chissoe, W. Barbazuk and S. Gorski for comments and discussion; D. Panussis for design and engineering expertise; M. Holman for maintaining data in web-accessible formats; C. McCabe, N. Florence, D. Scheer, S. Sasso, L. Belaygorod and C. Franklin for expert assistance in fingerprinting; A. Favello for laboratory management; staff at Washington University Genome Sequencing Center for technical support; and D. Preuss, G. Copenhaver, T. Kuromori and many others for providing contig anchoring information. Funding for this work was provided by Monsanto Company.

Author information

Authors and Affiliations

  1. Washington University Genome Sequencing Center, 4444 Forest Park Boulevard, St Louis, 63108, Missouri, USA
    Marco Marra, Tamara Kucaba, Mandeep Sekhon, LaDeana Hillier, Asif Chinwalla, Jye'mon Crockett, Jacqueline Fedele, Heather Grover, Christopher Gund, Ken McDonald, John McPherson, Nancy Mudd, Jacqueline Schein, Ryan Seim, Proteon Shelby, Robert Waterston & Richard Wilson
  2. Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, 11724 , New York, USA
    Robert Martienssen, W. Richard McCombie & Larry Parnell

Authors

  1. Marco Marra
    You can also search for this author inPubMed Google Scholar
  2. Tamara Kucaba
    You can also search for this author inPubMed Google Scholar
  3. Mandeep Sekhon
    You can also search for this author inPubMed Google Scholar
  4. LaDeana Hillier
    You can also search for this author inPubMed Google Scholar
  5. Robert Martienssen
    You can also search for this author inPubMed Google Scholar
  6. Asif Chinwalla
    You can also search for this author inPubMed Google Scholar
  7. Jye'mon Crockett
    You can also search for this author inPubMed Google Scholar
  8. Jacqueline Fedele
    You can also search for this author inPubMed Google Scholar
  9. Heather Grover
    You can also search for this author inPubMed Google Scholar
  10. Christopher Gund
    You can also search for this author inPubMed Google Scholar
  11. W. Richard McCombie
    You can also search for this author inPubMed Google Scholar
  12. Ken McDonald
    You can also search for this author inPubMed Google Scholar
  13. John McPherson
    You can also search for this author inPubMed Google Scholar
  14. Nancy Mudd
    You can also search for this author inPubMed Google Scholar
  15. Larry Parnell
    You can also search for this author inPubMed Google Scholar
  16. Jacqueline Schein
    You can also search for this author inPubMed Google Scholar
  17. Ryan Seim
    You can also search for this author inPubMed Google Scholar
  18. Proteon Shelby
    You can also search for this author inPubMed Google Scholar
  19. Robert Waterston
    You can also search for this author inPubMed Google Scholar
  20. Richard Wilson
    You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toMarco Marra.

Rights and permissions

About this article

Cite this article

Marra, M., Kucaba, T., Sekhon, M. et al. A map for sequence analysis of the Arabidopsis thaliana genome .Nat Genet 22, 265–270 (1999). https://doi.org/10.1038/10327

Download citation

This article is cited by

Associated content