The NCBI Handbook (original) (raw)
The National Center for Biotechnology Information (NCBI), a division of the National Library of Medicine (NLM) at the U.S. National Institutes of Health, is a leader in the field of bioinformatics; it studies computational approaches to fundamental questions in biology and provides online delivery of biomedical information and bioinformatics tools. NCBI hosts approximately 40 online literature and molecular biology databases—including PubMed, PubMed Central, and GenBank—that serve millions of users around the world. The second edition of the NCBI Handbook, released in November 2013 in conjunction with the 25th anniversary of NCBI, aims to provide a comprehensive overview of the breadth of informatics resources at NCBI, and an in-depth account of the scope, data, infrastructure, processing, and access for each major database or resource. The databases and resources are organized here into seven concept areas: literature, genomes, variation, health, genes and gene expression, nucleotide, proteins, and small molecules and biological assays. Three additional categories encompass tools, infrastructure, and metadata. Each concept area begins with an overview chapter that provides a contextual framework for the resources discussed under that concept; the overview is followed by separate chapters that cover individual databases or resources.
As with the first edition, The NCBI Handbook 2nd Edition is geared towards advanced users of NCBI resources to provide an understanding of how bioinformatics resources at NCBI work. It is not a step-by-step user manual but complements NCBI user guides, tutorials, help information, and other existing documentation. It is our intent that the handbook will reflect, to the extent possible, the current state of databases, resources, and tools at NCBI, with information updated periodically.
Contents
-
- NCBI Literature Resources
Ed Sequeira.
Created: November 14, 2013.
* Introduction - NLM Catalog
Sarah Weis.
Created: February 21, 2013; Last Update: July 19, 2013.
* History and Scope
* Data Sources
* Using the NLM Catalog - PubMed: The Bibliographic Database
Kathi Canese and Sarah Weis.
Created: October 9, 2002; Last Update: March 20, 2013.
* Summary
* Data Sources
* Electronic Data Submission
* Database Management and Hardware
* Indexing
* How PubMed Queries Are Processed
* Using PubMed
* Additional PubMed Features
* Results
* How to Create Hyperlinks to PubMed
* Customer Support - PubMed Central
Chris Maloney, Ed Sequeira, Christopher Kelly, Rebecca Orris, and Jeffrey Beck.
Created: November 14, 2013; Last Update: December 5, 2013.
* Overview of PMC
* Architecture Overview
* Ingest
* Retrieval / Data Processing
* Rendering
* PMCI
* Other Tools and Utilities
* References - Bookshelf
Marilu Hoeppner, Martin Latterner, and Karanjit Siyan.
Created: March 18, 2013; Last Update: November 4, 2013.
* Scope
* History
* The Collection and Content
* Data Model
* Dataflow
* Access
* References - NLM DTD to NISO JATS Z39.96-2012
Jeffrey Beck and Laura Randall.
Created: November 14, 2013.
* Scope
* History
* The NLM DTDs
* Involvement of NISO
* The Standard and the Supporting Information
* The Future of JATS
* References - The NIH Manuscript Submission System
Abigail Acland.
Created: March 15, 2013.
* Summary
* History
* Ingest
* Ingest QA
* Conversion
* XML in NIHMS
* QA of Converted Materials
* Additional Processing
* Reporting Systems And NIHMS
- NCBI Literature Resources
-
- What’s in a Genome at NCBI?
James Ostell.
Created: November 8, 2013.
* Scope
* What is a Genome?
* History
* Resources, Tools, and Access
* References - GenBank
Ilene Mizrachi.
Created: November 12, 2013.
* Scope - Protein Clusters
Tatiana Tatusova, Leonid Zaslavsky, Boris Fedorov, Diana Haddad, Anjana Vatsan, Danso Ako-adjei, Olga Blinkova, and Hassan Ghazal.
Created: September 14, 2014.
* Scope
* History
* Data Model
* Clustering Methods
* Dataflow
* Manual Curation
* Cluster Display
* Access
* References - Eukaryotes
* Clone
Valerie Schneider.
Created: November 14, 2013.
* Scope
* History
* Data Model
* Dataflow
* Access
* References
* Genome Reference Consortium
Valerie Schneider and Deanna Church.
Created: November 14, 2013.
* Scope
* History
* Data Model
* Dataflow
* Access
* References
* Eukaryotic Genome Annotation Pipeline
Françoise Thibaud-Nissen, Alexander Souvorov, Terence Murphy, Michael DiCuccio, and Paul Kitts.
Created: November 14, 2013.
* Scope
* History
* Dataflow
* Access
* References - Prokaryotes
* About Prokaryotic Genome Processing and Tools
Tatiana Tatusova, Stacy Ciufo, Boris Fedorov, Kathleen O’Neill, Igor Tolstoy, and Leonid Zaslavsky.
Created: January 23, 2014.
* Scope
* History
* Data Model
* Dataflow
* Access
* References
* Prokaryotic Genome Annotation Pipeline
Tatiana Tatusova, Mike DiCuccio, Azat Badretdin, Vyacheslav Chetvernin, Stacy Ciufo, and Wenjun Li.
Created: December 10, 2013.
* Scope
* History
* Annotation Standards
* Dataflow
* GenBank Submission Service
* RefSeq Genome Annotations
* Autonomous Protein Records
* Data Access
* Re-annotation Consortium
* References - Viruses
* About Viral and Phage Genome Processing and Tools
Yiming Bao, J. Rodney Brister, Olga Blinkova, Danso Ako-adjei, and Chetvernin Vyacheslav.
Created: March 30, 2013; Last Update: May 10, 2013.
* Virus Variation
J. Rodney Brister and Yiming Bao.
Created: November 14, 2013.
* Scope
* History
* Data Model
* Dataflow
* Access
* References
- What’s in a Genome at NCBI?
-
- Variation Overview
Deanna Church, Stephen Sherry, Lon Phan, Minghong Ward, Melissa Landrum, and Donna Maglott.
Created: November 14, 2013.
* Scope
* History
* Data Model
* Access
* References - The Database of Genotypes and Phenotypes (dbGaP) and PheGenI
Kimberly A Tryka, Luning Hao, Anne Sturcke, Yumi Jin, Masato Kimura, Zhen Y Wang, Lora Ziyabari, Moira Lee, and Michael Feolo.
Created: August 15, 2013.
* Scope
* History
* Data Model
* Dataflow
* Access
* References
* Appendix – Phenotype Quality Control - The Database of Short Genetic Variation (dbSNP)
Adrienne Kitts, Lon Phan, Minghong Ward, and John Bradley Holmes.
Created: June 30, 2013; Last Update: April 3, 2014.
* Scope
* History
* Data Model
* Dataflow
* Access
* Related Tools and Studies
* References
* Appendices - dbVar
Adrienne Kitts, Deanna Church, Tim Hefferon, and Lon Phan.
Created: October 26, 2014.
* Scope
* History
* Data Model
* Dataflow
* Access
* Related Tools and Studies
* References - ClinVar
Melissa Landrum, Jennifer Lee, George Riley, Wonhee Jang, Wendy Rubinstein, Deanna Church, and Donna Maglott.
Created: November 21, 2013.
* Scope
* History
* Data Model
* Dataflow
* Access
* FTP
* E-Utilities
* References
- Variation Overview
-
- MedGen
Maryam Halavi, Donna Maglott, Viatcheslav Gorelenkov, and Wendy Rubinstein.
Created: May 28, 2013; Last Update: December 11, 2018.
* Scope
* History
* Data Model
* Dataflow
* Access
* References - ClinVar
Melissa Landrum, Jennifer Lee, George Riley, Wonhee Jang, Wendy Rubinstein, Deanna Church, and Donna Maglott.
Created: November 21, 2013.
* Scope
* History
* Data Model
* Dataflow
* Access
* FTP
* E-Utilities
* References
- MedGen
-
- Genes and Gene Expression
Donna Maglott, Tanya Barrett, Terence Murphy, Michael Feolo, Lukas Wagner, and Richa Agarwala.
Created: November 7, 2013.
* Scope
* History
* Data Model
* Dataflow
* References - Gene Expression Omnibus (GEO)
Tanya Barrett.
Created: May 19, 2013.
* Scope
* History
* Data Model
* Dataflow
* Access
* References - Gene
Donna Maglott, PhD, Kim Pruitt, PhD, Tatiana Tatusova, PhD, and Terence Murphy, PhD.
Created: November 14, 2013.
* Scope
* History
* Data Model
* Dataflow
* Access
* References - UniGene
Lukas Wagner and Richa Agarwala.
Created: November 14, 2013.
* Scope
* History
* Data Model
* Dataflow
* Access
* References
- Genes and Gene Expression
-
- NCBI Protein Resources
Eric Sayers.
Created: November 12, 2013; Last Update: November 21, 2013.
* Introduction
* Protein
* Structure
* Conserved Domains (CDD)
* Protein Clusters - Protein Clusters
Tatiana Tatusova, Leonid Zaslavsky, Boris Fedorov, Diana Haddad, Anjana Vatsan, Danso Ako-adjei, Olga Blinkova, and Hassan Ghazal.
Created: September 14, 2014.
* Scope
* History
* Data Model
* Clustering Methods
* Dataflow
* Manual Curation
* Cluster Display
* Access
* References
- NCBI Protein Resources
Small Molecules and Biological Assays
- Small Molecules and Biological Activities
Rana Morris.
Created: December 9, 2013.
* Scope
* History
* Dataflow
* Access
* References - NCBI PubChem BioAssay Database
Yanli Wang and Stephen H Bryant.
Created: March 14, 2014.
* Scope
* PubChem BioAssay Standard & Data Model
* Tracking BioAssay Update
* PubChem BioAssay Data Specification
* Assay Neighboring and Related BioAssays
* Public Access, Search, and FTP site
* BioAssay Tools
* BioAssay Submissions and Updates
* Summary
* References
- Small Molecules and Biological Activities