Viruses, the diseases they can trigger, and the possible associated societal disaster represent different entities. To engage with the complexities of viral pandemics, we need to recognize each... Show moreViruses, the diseases they can trigger, and the possible associated societal disaster represent different entities. To engage with the complexities of viral pandemics, we need to recognize each entity by using a distinctive name. Show less
The Zika virus (ZIKV) disease caused a public health emergency of international concern that started in February 2016. The overall number of ZIKV-related cases increased until November 2016, after... Show moreThe Zika virus (ZIKV) disease caused a public health emergency of international concern that started in February 2016. The overall number of ZIKV-related cases increased until November 2016, after which it declined sharply. While the evaluation of the potential risk and impact of future arbovirus epidemics remains challenging, intensified surveillance efforts along with a scale-up of ZIKV whole-genome sequencing provide an opportunity to understand the patterns of genetic diversity, evolution, and spread of ZIKV. However, a classification system that reflects the true extent of ZIKV genetic variation is lacking. Our objective was to characterize ZIKV genetic diversity and phylodynamics, identify genomic footprints of differentiation patterns, and propose a dynamic classification system that reflects its divergence levels. We analysed a curated dataset of 762 publicly available sequences spanning the full-length coding region of ZIKV from across its geographical span and collected between 1947 and 2021. The definition of genetic groups was based on comprehensive evolutionary dynamics analyses, which included recombination and phylogenetic analyses, within- and between-group pairwise genetic distances comparison, detection of selective pressure, and clustering analyses. Evidence for potential recombination events was detected in a few sequences. However, we argue that these events are likely due to sequencing errors as proposed in previous studies. There was evidence of strong purifying selection, widespread across the genome, as also detected for other arboviruses. A total of 50 sites showed evidence of positive selection, and for a few of these sites, there was amino acid (AA) differentiation between genetic clusters. Two main genetic clusters were defined, ZA and ZB, which correspond to the already characterized 'African' and 'Asian' genotypes, respectively. Within ZB, two subgroups, ZB.1 and ZB.2, represent the Asiatic and the American (and Oceania) lineages, respectively. ZB.1 is further subdivided into ZB.1.0 (a basal Malaysia sequence sampled in the 1960s and a recent Indian sequence), ZB.1.1 (South-Eastern Asia, Southern Asia, and Micronesia sequences), and ZB.1.2 (very similar sequences from the outbreak in Singapore). ZB.2 is subdivided into ZB.2.0 (basal American sequences and the sequences from French Polynesia, the putative origin of South America introduction), ZB.2.1 (Central America), and ZB.2.2 (Caribbean and North America). This classification system does not use geographical references and is flexible to accommodate potential future lineages. It will be a helpful tool for studies that involve analyses of ZIKV genomic variation and its association with pathogenicity and serve as a starting point for the public health surveillance and response to on-going and future epidemics and to outbreaks that lead to the emergence of new variants. Show less
The genome sequence is the only characteristic readily obtainable for all known viruses, underlying the growing role of comparative genomics in organizing knowledge about viruses in a systematic... Show moreThe genome sequence is the only characteristic readily obtainable for all known viruses, underlying the growing role of comparative genomics in organizing knowledge about viruses in a systematic evolution-aware way, known as virus taxonomy. Overseen by the International Committee on Taxonomy of Viruses (ICTV), development of virus taxonomy involves taxa demarcation at 15 ranks of a hierarchical classification, often in host-specific manner. Outside the ICTV remit, researchers assess fitting numerous unclassified viruses into the established taxa. They employ different metrics of virus clustering, basing on conserved domain(s), separation of viruses in rooted phylogenetic trees and pair-wise distance space. Computational approaches differ further in respect to methodology, number of ranks considered, sensitivity to uneven virus sampling, and visualization of results. Advancing and using computational tools will be critical for improving taxa demarcation across the virosphere and resolving rank origins in research that may also inform experimental virology. Show less
The Polyomaviridae is a family of ubiquitous dsDNA viruses that establish persistent infection early in life. Screening for human polyomaviruses (HPyVs), which comprise 14 diverse species, relies... Show moreThe Polyomaviridae is a family of ubiquitous dsDNA viruses that establish persistent infection early in life. Screening for human polyomaviruses (HPyVs), which comprise 14 diverse species, relies upon species-specific qPCRs whose validity may be challenged by accelerating genomic exploration of the virosphere. Using this reasoning, we tested 64 published HPyV qPCR assays in silico against the 1781 PyV genome sequences that were divided in targets and nontargets, based on anticipated species specificity of each qPCR. We identified several cases of problematic qPCR performance that were confirmed in vitro and corrected through using degenerate oligos. Furthermore, our study ranked 8 out of 52 tested BKPyV qPCRs as remaining of consistently high quality in the wake of recent PyV discoveries and showed how sensitivity of most other qPCRs could be rescued by annealing temperature adjustment. This study establishes an efficient framework for ensuring confidence in available HPyV qPCRs in the genomic era. Show less
The ongoing coronavirus (CoV) disease 2019 (COVID-19) pandemic caused by infection with severe acute respiratory syndrome CoV 2 (SARS-CoV-2) is associated with substantial morbidity and mortality.... Show moreThe ongoing coronavirus (CoV) disease 2019 (COVID-19) pandemic caused by infection with severe acute respiratory syndrome CoV 2 (SARS-CoV-2) is associated with substantial morbidity and mortality. Understanding the immunological and patho-logical processes of coronavirus diseases is crucial for the rational design of effective vaccines and therapies for COVID-19. Previous studies showed that 2'-O-methylation of the viral RNA cap structure is required to prevent the recognition of viral RNAs by intra-cellular innate sensors. Here, we demonstrate that the guanine N7-methylation of the 5' cap mediated by coronavirus nonstructural protein 14 (nsp14) contributes to viral evasion of the type I interferon (IFN-I)-mediated immune response and pathogenesis in mice. A Y414A substitution in nsp14 of the coronavirus mouse hepatitis virus (MHV) significantly decreased N7-methyltransferase activity and reduced guanine N7-methyla-tion of the 5' cap in vitro. Infection of myeloid cells with recombinant MHV harboring the nsp14-Y414A mutation (rMHV(nsp14-Y414A)) resulted in upregulated expression of IFN-I and ISG15 mainly via MDA5 signaling and in reduced viral replication compared to that of wild-type rMHV. rMHV(nsp14-Y414A) replicated to lower titers in livers and brains and exhibited an attenuated phenotype in mice. This attenuated phenotype was IFN-I de-pendent because the virulence of the rMHV(nsp14-Y414A) mutant was restored in Ifnar(-/-) mice. We further found that the comparable mutation (Y420A) in SARS-CoV-2 nsp14 (rSARS-CoV-2(nsp14-Y420A)) also significantly decreased N7-methyltransferase activity in vitro, and the mutant virus was attenuated in K18-human ACE2 transgenic mice. Moreover, infection with rSARS-CoV-2(nsp14-Y420A) conferred complete protection against subsequent and otherwise lethal SARS-CoV-2 infection in mice, indicating the vaccine potential of this mutant.IMPORTANCE Coronaviruses (CoVs), including SARS-CoV-2, the cause of COVID-19, use several strategies to evade the host innate immune responses. While the cap struc-ture of RNA, including CoV RNA, is important for translation, previous studies indi-cate that the cap also contributes to viral evasion from the host immune response. In this study, we demonstrate that the N7-methylated cap structure of CoV RNA is pivotal for virus immunoevasion. Using recombinant MHV and SARS-CoV-2 encoding an inactive N7-methyltransferase, we demonstrate that these mutant viruses are highly attenuated in vivo and that attenuation is apparent at very early times after infection. Virulence is restored in mice lacking interferon signaling. Further, we show that infection with virus defective in N7-methylation protects mice from lethal SARSCoV-2, suggesting that the N7-methylase might be a useful target in drug and vaccine development. Show less
A group convened and led by the Virus Evolution Working Group of the World Health Organization reports on its deliberations and announces a naming scheme that will enable clear communication about... Show moreA group convened and led by the Virus Evolution Working Group of the World Health Organization reports on its deliberations and announces a naming scheme that will enable clear communication about SARS-CoV-2 variants of interest and concern. Show less
At least six small alternative-frame open reading frames (ORFs) overlapping well-characterized SARS-CoV-2 genes have been hypothesized to encode accessory proteins. Researchers have used different... Show moreAt least six small alternative-frame open reading frames (ORFs) overlapping well-characterized SARS-CoV-2 genes have been hypothesized to encode accessory proteins. Researchers have used different names for the same ORF or the same name for different ORFs, resulting in erroneous homological and functional inferences. We propose standard names for these ORFs and their shorter isoforms, developed in consultation with the Coronaviridae Study Group of the International Committee on Taxonomy of Viruses. We recommend calling the 39 codon Spike-overlapping ORF ORF2b; the 41, 57, and 22 codon ORF3a-overlapping ORFs ORF3c, ORF3d, and ORF3b; the 33 codon ORF3d isoform ORF3d-2; and the 97 and 73 codon Nucleocapsid-overlapping ORFs ORF9b and ORF9c. Finally, we document conflicting usage of the name ORF3b in 32 studies, and consequent erroneous inferences, stressing the importance of reserving identical names for homologs. We recommend that authors referring to these ORFs provide lengths and coordinates to minimize ambiguity caused by prior usage of alternative names. Show less
Species taxa are the units of taxonomy most suited to measure virus diversity, and they account for more than 70% of all virus taxa. Yet, as evidenced by the content of GenBank entries and... Show moreSpecies taxa are the units of taxonomy most suited to measure virus diversity, and they account for more than 70% of all virus taxa. Yet, as evidenced by the content of GenBank entries and illustrated by the recent literature on SARS-CoV-2, they are the most neglected taxa of virus research. To correct this disparity, we propose to make species taxa a first choice for communicating virus taxonomy in publications concerning viruses. We see it as a key step toward promoting research on diverse viruses, including pathogens, at this fundamental level of biology. Show less
RNA-dependent RNA polymerases (RdRps) of the Nidovirales (Coronaviridae, Arteriviridae, and 12 other families) are linked to an amino-terminal (N-terminal) domain, called NiRAN, in a non-structural... Show moreRNA-dependent RNA polymerases (RdRps) of the Nidovirales (Coronaviridae, Arteriviridae, and 12 other families) are linked to an amino-terminal (N-terminal) domain, called NiRAN, in a non-structural protein (nsp) that is released from polyprotein 1ab by the viral main protease (Mpro). Previously, self-GMPylation/UMPylation activities were reported for an arterivirus NiRAN-RdRp nsp and suggested to generate a transient state primed for transferring nucleoside monophosphate (NMP) to (currently unknown) viral and/or cellular biopolymers. Here, we show that the coronavirus (human coronavirus [HCoV]-229E and severe acute respiratory syndrome coronavirus 2) nsp12 (NiRAN-RdRp) has Mn2+-dependent NMPylation activity that catalyzes the transfer of a single NMP to the cognate nsp9 by forming a phosphoramidate bond with the primary amine at the nsp9 N terminus (N3825) following M-pro-mediated proteolytic release of nsp9 from N-terminally flanking nsps. Uridine triphosphate was the preferred nucleotide in this reaction, but also adenosine triphosphate, guanosine triphosphate, and cytidine triphosphate were suitable cosubstrates. Mutational studies using recombinant coronavirus nsp9 and nsp12 proteins and genetically engineered HCoV-229E mutants identified residues essential for NiRAN-mediated nsp9 NMPylation and virus replication in cell culture. The data corroborate predictions on NiRAN active-site residues and establish an essential role for the nsp9 N3826 residue in both nsp9 NMPylation in vitro and virus replication. This residue is part of a conserved N-terminal NNE tripeptide sequence and shown to be the only invariant residue in nsp9 and its homologs in viruses of the family Coronaviridae. The study provides a solid basis for functional studies of other nidovirus NMPylation activities and suggests a possible target for antiviral drug development. Show less
Two pandemics of respiratory distress diseases associated with zoonotic introductions of the species Severe acute respiratory syndrome-related coronavirus in the human population during 21st... Show moreTwo pandemics of respiratory distress diseases associated with zoonotic introductions of the species Severe acute respiratory syndrome-related coronavirus in the human population during 21st century raised unprecedented interest in coronavirus research and assigned it unseen urgency. The two viruses responsible for the outbreaks, SARS-CoV and SARS-CoV-2, respectively, are in the spotlight, and SARSCoV-2 is the focus of the current fast-paced research. Its foundation was laid down by studies of many coronaand related viruses that collectively form the vast order Nidovirales. Comparative genomics of nidoviruses played a key role in this advancement over more than 30 years. It facilitated the transfer of knowledge from characterized to newly identified viruses, including SARS-CoV and SARS-CoV-2, as well as contributed to the dissection of the nidovirus proteome and identification of patterns of variations between different taxonomic groups, from species to families. This review revisits selected cases of protein conservation and variation that define nidoviruses, illustrates the remarkable plasticity of the proteome during nidovirus adaptation, and asks questions at the interface of the proteome and processes that are vital for nidovirus reproduction and could inform the ongoing research of SARS-CoV-2.(c) 2020 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). Show less
Motivation: To facilitate accurate estimation of statistical significance of sequence similarity in profile-profile searches, queries should ideally correspond to protein domains. For multidomain... Show moreMotivation: To facilitate accurate estimation of statistical significance of sequence similarity in profile-profile searches, queries should ideally correspond to protein domains. For multidomain proteins, using domains as queries depends on delineation of domain borders, which may be unknown. Thus, proteins are commonly used as queries that complicate establishing homology for similarities close to cutoff levels of statistical significance.Results: In this article, we describe an iterative approach, called LAMPA, LArge Multidomain Protein Annotator, that resolves the above conundrum by gradual expansion of hit coverage of multidomain proteins through re-evaluating statistical significance of hit similarity using ever smaller queries defined at each iteration. LAMPA employs TMHMM and HHsearch for recognition of transmembrane regions and homology, respectively. We used Pfam database for annotating 2985 multidomain proteins (polyproteins) composed of >1000 amino acid residues, which dominate proteomes of RNA viruses. Under strict cutoffs, LAMPA outperformed HHsearch-mediated runs using intact polyproteins as queries by three measures: number of and coverage by identified homologous regions, and number of hit Pfam profiles. Compared to HHsearch, LAMPA identified 507 extra homologous regions in 14.4% of polyproteins. This Pfam-based annotation of RNA virus polyproteins by LAMPA was also superior to RefSeq expert annotation by two measures, region number and annotated length, for 69.3% of RNA virus polyprotein entries. We rationalized the obtained results based on dependencies of HHsearch hit statistical significance for local alignment similarity score from lengths and diversities of query-target pairs in computational experiments. Show less
Gorbalenya, A.E.; Krupovic, M.; Mushegian, A.; Kropinski, A.M.; Siddell, S.G.; Varsani, A.; ... ; Viruses Executive Comm 2020
Virus taxonomy emerged as a discipline in the middle of the twentieth century. Traditionally, classification by virus taxonomists has been focussed on the grouping of relatively closely related... Show moreVirus taxonomy emerged as a discipline in the middle of the twentieth century. Traditionally, classification by virus taxonomists has been focussed on the grouping of relatively closely related viruses. However, during the past few years, the International Committee on Taxonomy of Viruses (ICTV) has recognized that the taxonomy it develops can be usefully extended to include the basal evolutionary relationships among distantly related viruses. Consequently, the ICTV has changed its Code to allow a 15-rank classification hierarchy that closely aligns with the Linnaean taxonomic system and may accommodate the entire spectrum of genetic divergence in the virosphere. The current taxonomies of three human pathogens, Ebola virus, severe acute respiratory syndrome coronavirus and herpes simplex virus 1 are used to illustrate the impact of the expanded rank structure. This new rank hierarchy of virus taxonomy will stimulate further research on virus origins and evolution, and vice versa, and could promote crosstalk with the taxonomies of cellular organisms.Here, the International Committee on Taxonomy of Viruses describe a new, expanded virus classification scheme with 15 ranks that closely aligns with the Linnaean taxonomic system and better encompasses viral diversity. Show less
The present outbreak of a coronavirus-associated acute respiratory disease called coronavirus disease 19 (COVID-19) is the third documented spillover of an animal coronavirus to humans in only two... Show moreThe present outbreak of a coronavirus-associated acute respiratory disease called coronavirus disease 19 (COVID-19) is the third documented spillover of an animal coronavirus to humans in only two decades that has resulted in a major epidemic. The Coronaviridae Study Group (CSG) of the International Committee on Taxonomy of Viruses, which is responsible for developing the classification of viruses and taxon nomenclature of the family Coronaviridae, has assessed the placement of the human pathogen, tentatively named 2019-nCoV, within the Coronaviridae. Based on phylogeny, taxonomy and established practice, the CSG recognizes this virus as forming a sister clade to the prototype human and bat severe acute respiratory syndrome coronaviruses (SARS-CoVs) of the species Severe acute respiratory syndrome-related coronavirus, and designates it as SARS-CoV-2. In order to facilitate communication, the CSG proposes to use the following naming convention for individual isolates: SARS-CoV-2/host/location/isolate/date. While the full spectrum of clinical manifestations associated with SARS-CoV-2 infections in humans remains to be determined, the independent zoonotic transmission of SARS-CoV and SARS-CoV-2 highlights the need for studying viruses at the species level to complement research focused on individual pathogenic viruses of immediate significance. This will improve our understanding of virus-host interactions in an ever-changing environment and enhance our preparedness for future outbreaks. Show less
Enteroviruses (EVs) and rhinoviruses (RVs) are significant pathogens of humans and are the subject of intensive clinical and epidemiological research and public health measures, notably in the... Show moreEnteroviruses (EVs) and rhinoviruses (RVs) are significant pathogens of humans and are the subject of intensive clinical and epidemiological research and public health measures, notably in the eradication of poliovirus and in the investigation and control of emerging pathogenic EV types worldwide. EVs and RVs are highly diverse in their antigenic properties, tissue tropism, disease associations and evolutionary relationships, but the latter often conflict with previously developed biologically defined terms, such as "coxsackieviruses", "polioviruses" and "echoviruses", which were used before their genetic interrelationships were understood. This has created widespread formatting problems and inconsistencies in the nomenclature for EV and RV types and species in the literature and public databases. As members of the International Committee for Taxonomy of Viruses (ICTV) Picornaviridae Study Group, we describe the correct use of taxon names for these viruses and have produced a series of recommendations for the nomenclature of EV and RV types and their abbreviations. We believe their adoption will promote greater clarity and consistency in the terminology used in the scientific and medical literature. The recommendations will additionally provide a useful reference guide for journals, other publications and public databases seeking to use standardised terms for the growing multitude of enteroviruses and rhinoviruses described worldwide. Show less