A comparative sequence analysis to revise the current taxonomy of the family Coronaviridae (original) (raw)

Arch Virol. 2003; 148(11): 2207–2235.

J. M. González

Centro Nacional de Biotecnología, CSIC, Department of Molecular and Cell Biology, Campus Universidad Autónoma, Cantoblanco, Madrid, Spain, , , , ES

P. Gomez-Puertas

Bioinformatics Lab. Centro de Astrobiología (CSIC-INTA), Torrejón de Ardoz, Madrid, Spain, , , , ES

D. Cavanagh

Institute for Animal Health, Compton Laboratory, Compton, Newbury, U.K., , , , GB

A. E. Gorbalenya

Center of Infectious Diseases, Leiden University Medical Center, Leiden, The Netherlands, , , , NL

Luis Enjuanes

Centro Nacional de Biotecnología, CSIC, Department of Molecular and Cell Biology, Campus Universidad Autónoma, Cantoblanco, Madrid, Spain, , , , ES

Centro Nacional de Biotecnología, CSIC, Department of Molecular and Cell Biology, Campus Universidad Autónoma, Cantoblanco, Madrid, Spain, , , , ES

Bioinformatics Lab. Centro de Astrobiología (CSIC-INTA), Torrejón de Ardoz, Madrid, Spain, , , , ES

Institute for Animal Health, Compton Laboratory, Compton, Newbury, U.K., , , , GB

Center of Infectious Diseases, Leiden University Medical Center, Leiden, The Netherlands, , , , NL

Copyright © Springer-Verlag/Wien 2003

This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.

Summary.

The Coronaviridae family, comprising the Coronavirus and Torovirus genera, is part of the Nidovirales order that also includes two other families, Arteriviridae and Roniviridae. Based on genetic and serological relationships, groups 1, 2 and 3 were previously recognized in the Coronavirus genus. In this report we present results of comparative sequence analysis of the spike (S), envelope (E), membrane (M), and nucleoprotein (N) structural proteins, and the two most conserved replicase domains, putative RNA-dependent RNA polymerase (RdRp) and RNA helicase (HEL), aimed at a revision of the Coronaviridae taxonomy. The results of pairwise comparisons involving structural and replicase proteins of the Coronavirus genus were consistent and produced percentages of sequence identities that were distributed in discontinuous clusters. Inter-group pairwise scores formed a single cluster in the lowest percentile. No homologs of the N and E proteins have been found outside coronaviruses, and the only (very) distant homologs of S and M proteins were identified in toroviruses. Intragroup sequence conservation was higher, although for some pairs, especially those from the most diverse group 1, scores were close or even overlapped with those from the intergroup comparisons. Phylogenetic analysis of six proteins using a neighbor-joining algorithm confirmed three coronavirus groups. Comparative sequence analysis of RdRp and HEL domains were extended to include arterivirus and ronivirus homologs. The pairwise scores between sequences of the genera Coronavirus and Torovirus (22–25% and 21–25%) were found to be very close to or overlapped with the value ranges (12 to 22% and 17 to 25%) obtained for interfamily pairwise comparisons, but were much smaller than values derived from pairwise comparisons within the Coronavirus genus (63–71% and 59–67%). Phylogenetic analysis confirmed toroviruses and coronaviruses to be separated by a large distance that is comparable to those between established nidovirus families. Based on comparison of these scores with those derived from analysis of separate ranks of several multi-genera virus families, like the Picornaviridae, a revision of the Coronaviridae taxonomy is proposed. We suggest the Coronavirus and Torovirus genera to be re-defined as two subfamilies within the Coronavirdae or two families within Nidovirales, and the current three informal coronavirus groups to be converted into three genera within the Coronaviridae.

Keywords: Intergroup Comparison, Comparative Sequence Analysis, Virus Family, Distant Homolog, Lower Percentile

Footnotes

Received April 2, 2003; accepted May 20, 2003 Published online August 18, 2003


Articles from Archives of Virology are provided here courtesy of Nature Publishing Group