PubChem Substance and Compound databases - PubMed (original) (raw)
. 2016 Jan 4;44(D1):D1202-13.
doi: 10.1093/nar/gkv951. Epub 2015 Sep 22.
Paul A Thiessen 1, Evan E Bolton 2, Jie Chen 1, Gang Fu 1, Asta Gindulyte 1, Lianyi Han 1, Jane He 1, Siqian He 1, Benjamin A Shoemaker 1, Jiyao Wang 1, Bo Yu 1, Jian Zhang 1, Stephen H Bryant 1
Affiliations
- PMID: 26400175
- PMCID: PMC4702940
- DOI: 10.1093/nar/gkv951
PubChem Substance and Compound databases
Sunghwan Kim et al. Nucleic Acids Res. 2016.
Abstract
PubChem (https://pubchem.ncbi.nlm.nih.gov) is a public repository for information on chemical substances and their biological activities, launched in 2004 as a component of the Molecular Libraries Roadmap Initiatives of the US National Institutes of Health (NIH). For the past 11 years, PubChem has grown to a sizable system, serving as a chemical information resource for the scientific research community. PubChem consists of three inter-linked databases, Substance, Compound and BioAssay. The Substance database contains chemical information deposited by individual data contributors to PubChem, and the Compound database stores unique chemical structures extracted from the Substance database. Biological activity data of chemical substances tested in assay experiments are contained in the BioAssay database. This paper provides an overview of the PubChem Substance and Compound databases, including data sources and contents, data organization, data submission using PubChem Upload, chemical structure standardization, web-based interfaces for textual and non-textual searches, and programmatic access. It also gives a brief description of PubChem3D, a resource derived from theoretical three-dimensional structures of compounds in PubChem, as well as PubChemRDF, Resource Description Framework (RDF)-formatted PubChem data for data sharing, analysis and integration with information contained in other databases.
Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Figures
Figure 1.
Data organization in PubChem. SID, CID and AID are the identifiers for the Substance, Compound and BioAssay databases, respectively.
Figure 2.
PubChem standardization process in which unique chemical structures are extracted from the Substance database and stored in the Compound database.
Figure 3.
A snapshot of the Document Summary (DocSum) page returned from an Entrez Search for ‘tylenol’ against the PubChem Compound database.
Figure 4.
A snapshot of the top portion of the Compound Summary page for CID 1983 (Tylenol).
Figure 5.
A snapshot of the Chemical Structure Search tool.
Figure 6.
Diagram showing the high-level overview of PubChemRDF semantic relationships.
Similar articles
- An overview of the PubChem BioAssay resource.
Wang Y, Bolton E, Dracheva S, Karapetyan K, Shoemaker BA, Suzek TO, Wang J, Xiao J, Zhang J, Bryant SH. Wang Y, et al. Nucleic Acids Res. 2010 Jan;38(Database issue):D255-66. doi: 10.1093/nar/gkp965. Epub 2009 Nov 19. Nucleic Acids Res. 2010. PMID: 19933261 Free PMC article. - PubChem 2019 update: improved access to chemical data.
Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, Li Q, Shoemaker BA, Thiessen PA, Yu B, Zaslavsky L, Zhang J, Bolton EE. Kim S, et al. Nucleic Acids Res. 2019 Jan 8;47(D1):D1102-D1109. doi: 10.1093/nar/gky1033. Nucleic Acids Res. 2019. PMID: 30371825 Free PMC article. - PUG-SOAP and PUG-REST: web services for programmatic access to chemical information in PubChem.
Kim S, Thiessen PA, Bolton EE, Bryant SH. Kim S, et al. Nucleic Acids Res. 2015 Jul 1;43(W1):W605-11. doi: 10.1093/nar/gkv396. Epub 2015 Apr 30. Nucleic Acids Res. 2015. PMID: 25934803 Free PMC article. - Getting the most out of PubChem for virtual screening.
Kim S. Kim S. Expert Opin Drug Discov. 2016 Sep;11(9):843-55. doi: 10.1080/17460441.2016.1216967. Epub 2016 Aug 5. Expert Opin Drug Discov. 2016. PMID: 27454129 Free PMC article. Review. - An Overview of the Challenges in Designing, Integrating, and Delivering BARD: A Public Chemical-Biology Resource and Query Portal for Multiple Organizations, Locations, and Disciplines.
de Souza A, Bittker JA, Lahr DL, Brudz S, Chatwin S, Oprea TI, Waller A, Yang JJ, Southall N, Guha R, Schürer SC, Vempati UD, Southern MR, Dawson ES, Clemons PA, Chung TD. de Souza A, et al. J Biomol Screen. 2014 Jun;19(5):614-27. doi: 10.1177/1087057113517139. Epub 2014 Jan 17. J Biomol Screen. 2014. PMID: 24441647 Free PMC article. Review.
Cited by
- Structure-based drug-development study against fibroblast growth factor receptor 2: molecular docking and Molecular dynamics simulation approaches.
Shamsi A, Khan MS, Yadav DK, Shahwan M, Furkan M, Khan RH. Shamsi A, et al. Sci Rep. 2024 Aug 21;14(1):19439. doi: 10.1038/s41598-024-69850-1. Sci Rep. 2024. PMID: 39169082 Free PMC article. - Application of DFT Calculations in Designing Polymer-Based Drug Delivery Systems: An Overview.
Adekoya OC, Adekoya GJ, Sadiku ER, Hamam Y, Ray SS. Adekoya OC, et al. Pharmaceutics. 2022 Sep 19;14(9):1972. doi: 10.3390/pharmaceutics14091972. Pharmaceutics. 2022. PMID: 36145719 Free PMC article. Review. - Inverse Molecular Docking Study of NS3-Helicase and NS5-RNA Polymerase of Zika Virus as Possible Therapeutic Targets of Ligands Derived from Marcetia taxifolia and Its Implications to Dengue Virus.
Buendia-Atencio C, Pieffet GP, Montoya-Vargas S, Martínez Bernal JA, Rangel HR, Muñoz AL, Losada-Barragán M, Segura NA, Torres OA, Bello F, Suárez AI, Rodríguez AK. Buendia-Atencio C, et al. ACS Omega. 2021 Feb 26;6(9):6134-6143. doi: 10.1021/acsomega.0c04719. eCollection 2021 Mar 9. ACS Omega. 2021. PMID: 33718704 Free PMC article. - Unraveling the Catha edulis Extract Effects on the Cellular and Molecular Signaling in SKOV3 Cells.
Abou-Elhamd AS, Kalamegam G, Ahmed F, Assidi M, Alrefaei AF, Pushparaj PN, Abu-Elmagd M. Abou-Elhamd AS, et al. Front Pharmacol. 2021 May 10;12:666885. doi: 10.3389/fphar.2021.666885. eCollection 2021. Front Pharmacol. 2021. PMID: 34040530 Free PMC article. - Molecular Docking of Intrinsically Disordered Proteins: Challenges and Strategies.
Patel KN, Chavda D, Manna M. Patel KN, et al. Methods Mol Biol. 2024;2780:165-201. doi: 10.1007/978-1-0716-3985-6_11. Methods Mol Biol. 2024. PMID: 38987470
References
- Bolton E.E., Wang Y., Thiessen P.A., Bryant S.H. PubChem: integrated platform of small molecules and biological activities. In: Wheeler RA, Spellmeyer DC, editors. Annual Reports in Computational Chemistry. Vol. 4. Amsterdam: Elsevier; 2008. pp. 217–241.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous