Discovery of an expansive bacteriophage family that includes the most abundant viruses from the human gut - PubMed (original) (raw)
Discovery of an expansive bacteriophage family that includes the most abundant viruses from the human gut
Natalya Yutin et al. Nat Microbiol. 2018 Jan.
Abstract
Metagenomic sequence analysis is rapidly becoming the primary source of virus discovery 1-3 . A substantial majority of the currently available virus genomes come from metagenomics, and some of these represent extremely abundant viruses, even if never grown in the laboratory. A particularly striking case of a virus discovered via metagenomics is crAssphage, which is by far the most abundant human-associated virus known, comprising up to 90% of sequences in the gut virome 4 . Over 80% of the predicted proteins encoded in the approximately 100 kilobase crAssphage genome showed no significant similarity to available protein sequences, precluding classification of this virus and hampering further study. Here we combine a comprehensive search of genomic and metagenomic databases with sensitive methods for protein sequence analysis to identify an expansive, diverse group of bacteriophages related to crAssphage and predict the functions of the majority of phage proteins, in particular those that comprise the structural, replication and expression modules. Most, if not all, of the crAss-like phages appear to be associated with diverse bacteria from the phylum Bacteroidetes, which includes some of the most abundant bacteria in the human gut microbiome and that are also common in various other habitats. These findings provide for experimental characterization of the most abundant but poorly understood members of the human-associated virome.
Conflict of interest statement
Competing interests
The authors declare no competing financial interests.
Figures
Figure 1. Architecture and evolution of the capsid gene module of the crAss-like phage family
The phylogenetic tree was constructed from concatenated multiple alignments of the 5 proteins of the capsid module. The genomic maps of the capsid gene block are shown for each branch. The 5 genes of the capsid module are color-coded, and uncharacterized adjacent genes are shown by empty block arrows. The colors of labels and branches indicate host or metagenome source: Red, human gut or fecal metagenome; Green, termite gut metagenome; Purple, terrestrial/groundwater; Brown, Soda lake (hypersaline brine); Turquoise, Marine sediment; Orange, populus root microbiome; Black, _Flavobacterium psychrophilum_950106-1/1 (fish pathogen). Tree branch colors indicate the DNA polymerase family represented in the respective genome (contig): Purple, family B DNAP; Red, family A DNAP; Green, no DNAP; Black, unknown (incomplete genomes). Support values were obtained using 100 bootstrap replications; values greater than 50% are shown. The scale bar represents the number of amino acid substitutions per site. No outgroup was included due to the low (or absent) similarity between the crAss-like family protein to homologs from other phages.
Figure 2. Whole genome maps of crAssphage and IAS virus, the two members of the crAss-like family that are abundant in the human gut virome
Conserved crAss-like family genes are color-coded. Dashed boxes highlight capsid, tail, replication, and transcription gene blocks. Genome regions encoding proteins with strong similarity to Bacteroidetes are shaded in pale green. Gene numbers are according to the crAssphage and IAS virus MetaGeneMark translations. Abbreviations: ssb, single-stranded DNA-binding protein; SF1, SF1 helicase; UDG, uracyl-DNA glycosylase; PolB, DNA polymerase family B; SF2, SNF2-family helicase; RecT, phage RecT recombinase; primase, DnaG family primase; ligase, ATP-dependent DNA ligase; dNK, deoxynucleoside monophosphate kinase; ThyX, flavin-dependent thymidylate synthase; Gp157, Siphovirus Gp157; dUTP, dUTPase; N4_gp49, phage protein of N4_gp49/Sf6_gp66 family; RepL, plasmid replication initiation protein RepL; IHF, integration host factor IHF subunit; PD-(D/E)XK, PD-(D/E)XK family nuclease; Rep_Org, putative replisome organizer protein; DnaB, DnaB replicative DNA helicase; AAA, AAA domain ATPase; rIIA, rIIA-like protector protein; rIIB, rIIB-like protector protein; NRDD, anaerobic ribonucleoside-triphosphate reductase; RNR, anaerobic ribonucleoside-triphosphate reductase activating protein; PolA, DNA polymerase family A; DprA, DNA processing protein DprA. For further details on the annotation, see Supplementary Table 2.
Figure 3. The replicative gene module of the crAss-like phage family
A. The crAssphage group B. The rest of the crAss-like family Homologous genes are marked by the same colors and labels. Genes with no predicted function are numbered according to crAssphage translation, OBV-13 virus translation (suffix ‘b’), Cellulophaga phage phi14:2 translation (suffix ‘c’), or IAS virus translation (suffix ‘i’). Abbreviations:: RNRm, class II ribonucleotide reductase; RNRa, ribonucleoside reductase alpha chain; RNRb, ribonucleoside reductase beta chain; GGCT, Gamma-glutamyl cyclotransferase; Gn_AT, glucosamine-fructose-6-phosphate aminotransferase; NTP-PPase, nucleoside triphosphate pyrophosphohydrolase. The rest of the abbreviations are as in Figure 2.
Figure 3. The replicative gene module of the crAss-like phage family
A. The crAssphage group B. The rest of the crAss-like family Homologous genes are marked by the same colors and labels. Genes with no predicted function are numbered according to crAssphage translation, OBV-13 virus translation (suffix ‘b’), Cellulophaga phage phi14:2 translation (suffix ‘c’), or IAS virus translation (suffix ‘i’). Abbreviations:: RNRm, class II ribonucleotide reductase; RNRa, ribonucleoside reductase alpha chain; RNRb, ribonucleoside reductase beta chain; GGCT, Gamma-glutamyl cyclotransferase; Gn_AT, glucosamine-fructose-6-phosphate aminotransferase; NTP-PPase, nucleoside triphosphate pyrophosphohydrolase. The rest of the abbreviations are as in Figure 2.
Figure 4. The genome expression gene module of the crass-like phage family
The predicted RNAP subunits as well as the RNAP and protease motifs are color-coded as shown at the bottom of the figure. The PD-DxK nucleases are most likely encoded in Group I introns.
Similar articles
- The crAss-like Phage Group: How Metagenomics Reshaped the Human Virome.
Koonin EV, Yutin N. Koonin EV, et al. Trends Microbiol. 2020 May;28(5):349-359. doi: 10.1016/j.tim.2020.01.010. Epub 2020 Feb 28. Trends Microbiol. 2020. PMID: 32298613 Review. - Biology and Taxonomy of crAss-like Bacteriophages, the Most Abundant Virus in the Human Gut.
Guerin E, Shkoporov A, Stockdale SR, Clooney AG, Ryan FJ, Sutton TDS, Draper LA, Gonzalez-Tortuero E, Ross RP, Hill C. Guerin E, et al. Cell Host Microbe. 2018 Nov 14;24(5):653-664.e6. doi: 10.1016/j.chom.2018.10.002. Epub 2018 Oct 25. Cell Host Microbe. 2018. PMID: 30449316 - Evolution of BACON Domain Tandem Repeats in crAssphage and Novel Gut Bacteriophage Lineages.
Jonge PA, Meijenfeldt FABV, Rooijen LEV, Brouns SJJ, Dutilh BE. Jonge PA, et al. Viruses. 2019 Nov 21;11(12):1085. doi: 10.3390/v11121085. Viruses. 2019. PMID: 31766550 Free PMC article. - Discovery, diversity, and functional associations of crAss-like phages in human gut metagenomes from four Dutch cohorts.
Gulyaeva A, Garmaeva S, Ruigrok RAAA, Wang D, Riksen NP, Netea MG, Wijmenga C, Weersma RK, Fu J, Vila AV, Kurilshikov A, Zhernakova A. Gulyaeva A, et al. Cell Rep. 2022 Jan 11;38(2):110204. doi: 10.1016/j.celrep.2021.110204. Cell Rep. 2022. PMID: 35021085 - CrAss-Like Phages: From Discovery in Human Fecal Metagenome to Application as a Microbial Source Tracking Marker.
Remesh AT, Viswanathan R. Remesh AT, et al. Food Environ Virol. 2024 Jun;16(2):121-135. doi: 10.1007/s12560-024-09584-5. Epub 2024 Feb 27. Food Environ Virol. 2024. PMID: 38413544 Review.
Cited by
- Isolation and characterisation of ΦcrAss002, a crAss-like phage from the human gut that infects Bacteroides xylanisolvens.
Guerin E, Shkoporov AN, Stockdale SR, Comas JC, Khokhlova EV, Clooney AG, Daly KM, Draper LA, Stephens N, Scholz D, Ross RP, Hill C. Guerin E, et al. Microbiome. 2021 Apr 12;9(1):89. doi: 10.1186/s40168-021-01036-7. Microbiome. 2021. PMID: 33845877 Free PMC article. - The spinal cord-gut-immune axis as a master regulator of health and neurological function after spinal cord injury.
Kigerl KA, Zane K, Adams K, Sullivan MB, Popovich PG. Kigerl KA, et al. Exp Neurol. 2020 Jan;323:113085. doi: 10.1016/j.expneurol.2019.113085. Epub 2019 Oct 22. Exp Neurol. 2020. PMID: 31654639 Free PMC article. Review. - A simple, reproducible and cost-effective procedure to analyse gut phageome: from phage isolation to bioinformatic approach.
d'Humières C, Touchon M, Dion S, Cury J, Ghozlane A, Garcia-Garcera M, Bouchier C, Ma L, Denamur E, P C Rocha E. d'Humières C, et al. Sci Rep. 2019 Aug 5;9(1):11331. doi: 10.1038/s41598-019-47656-w. Sci Rep. 2019. PMID: 31383878 Free PMC article. - Characterization of crAss-like phage isolates highlights Crassvirales genetic heterogeneity and worldwide distribution.
Ramos-Barbero MD, Gómez-Gómez C, Sala-Comorera L, Rodríguez-Rubio L, Morales-Cortes S, Mendoza-Barberá E, Vique G, Toribio-Avedillo D, Blanch AR, Ballesté E, Garcia-Aljaro C, Muniesa M. Ramos-Barbero MD, et al. Nat Commun. 2023 Jul 18;14(1):4295. doi: 10.1038/s41467-023-40098-z. Nat Commun. 2023. PMID: 37463935 Free PMC article. - The stepwise assembly of the neonatal virome is modulated by breastfeeding.
Liang G, Zhao C, Zhang H, Mattei L, Sherrill-Mix S, Bittinger K, Kessler LR, Wu GD, Baldassano RN, DeRusso P, Ford E, Elovitz MA, Kelly MS, Patel MZ, Mazhani T, Gerber JS, Kelly A, Zemel BS, Bushman FD. Liang G, et al. Nature. 2020 May;581(7809):470-474. doi: 10.1038/s41586-020-2192-1. Epub 2020 Apr 15. Nature. 2020. PMID: 32461640 Free PMC article.
References
- Rohwer F. Global phage diversity. Cell. 2003;113:141. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources