VFDB 2016: hierarchical and refined dataset for big data analysis--10 years on - PubMed (original) (raw)

VFDB 2016: hierarchical and refined dataset for big data analysis--10 years on

Lihong Chen et al. Nucleic Acids Res. 2016.

Abstract

The virulence factor database (VFDB, http://www.mgc.ac.cn/VFs/) is dedicated to providing up-to-date knowledge of virulence factors (VFs) of various bacterial pathogens. Since its inception the VFDB has served as a comprehensive repository of bacterial VFs for over a decade. The exponential growth in the amount of biological data is challenging to the current database in regard to big data analysis. We recently improved two aspects of the infrastructural dataset of VFDB: (i) removed the redundancy introduced by previous releases and generated two hierarchical datasets--one core dataset of experimentally verified VFs only and another full dataset including all known and predicted VFs and (ii) refined the gene annotation of the core dataset with controlled vocabularies. Our efforts enhanced the data quality of the VFDB and promoted the usability of the database in the big data era for the bioinformatic mining of the explosively growing data regarding bacterial VFs.

© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

PubMed Disclaimer

Figures

Figure 1.

Figure 1.

Schematic diagram of the relationship between previous releases of virulence factor database (VFDB). R1 forms the base and core of the database, whereas R2 and R3 provided a comparative genomics platform and expanded the data contents in terms of genomes and pathogens (illustrated by the horizontal bars on the top), respectively.

Similar articles

Cited by

References

    1. van Oosten M., Hahn M., Crane L.M., Pleijhuis R.G., Francis K.P., van Dijl J.M., van Dam G.M. Targeted imaging of bacterial infections: advances, hurdles and hopes. FEMS Microbiol. Rev. 2015;39:892–916. - PubMed
    1. Unala C.M., Steinert M. Microbial peptidyl-prolyl cis/trans isomerases (PPIases): virulence factors and potential alternative drug targets. Microbiol. Mol. Biol. Rev. 2014;78:544–571. - PMC - PubMed
    1. Chen L., Yang J., Yu J., Yao Z., Sun L., Shen Y., Jin Q. VFDB: a reference database for bacterial virulence factors. Nucleic Acids Res. 2005;33:D325–D328. - PMC - PubMed
    1. Yang J., Chen L., Sun L., Yu J., Jin Q. VFDB 2008 release: an enhanced web-based resource for comparative pathogenomics. Nucleic Acids Res. 2008;36:D539–D542. - PMC - PubMed
    1. Chen L., Xiong Z., Sun L., Yang J., Jin Q. VFDB 2012 update: toward the genetic diversity and molecular evolution of bacterial virulence factors. Nucleic Acids Res. 2012;40:D641–D645. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources