Crystal Structure of the SARS-CoV-2 Non-structural Protein 9, Nsp9

(1)

Dene R. Littler,

Benjamin S. Gully,

Rhys N. Colson,

Jamie Rossjohn

[email protected] (D.R.L.) [email protected] (J.R.) HIGHLIGHTS The SARS-CoV-2 Nsp9 protein is structurally similar to SARS-CoV Dimerization of the coronaviral Nsp9 proteins is known to be required for its function

Oligomerization is mediated by an unusual GxxxG protein-protein interaction interface

A cavity near this Nsp9 GxxxG interaction interface may be able to bind peptides

Littler et al., iScience23, 101258

July 24, 2020ª2020 The Author(s).

https://doi.org/10.1016/ j.isci.2020.101258

(2)

Article

Crystal Structure of the SARS-CoV-2

Non-structural Protein 9, Nsp9

Dene R. Littler,

1,2,

_*

_{Benjamin S. Gully,}

1,2

_{Rhys N. Colson,}

1

_{and Jamie Rossjohn}

1,2,3,4,

_*

SUMMARY

Many of the SARS-CoV-2 proteins have related counterparts across the Severe

Acute Respiratory Syndrome (SARS-CoV) family. One such protein is

non-struc-tural protein 9 (Nsp9), which is thought to mediate viral replication, overall

viru-lence, and viral genomic RNA reproduction. We sought to better characterize the

SARS-CoV-2 Nsp9 and subsequently solved its X-ray crystal structure, in an apo

form and, unexpectedly, in a peptide-bound form with a sequence originating

from a rhinoviral 3C protease sequence (LEVL). The SARS-CoV-2 Nsp9 structure

revealed the high level of structural conservation within the Nsp9 family. The

exogenous peptide binding site is close to the dimer interface and impacted

the relative juxtapositioning of the monomers within the homodimer. We have

es-tablished a protocol for the production of SARS-CoV-2 Nsp9, determined its

structure, and identified a peptide-binding site that warrants further study to

un-derstanding Nsp9 function.

INTRODUCTION

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) comprises a large single-stranded positive polarity RNA genome that acts as messenger RNA after entering the host. The 50two-thirds of the genome encodes a large polyprotein that is translated into ORF1a and ORF1ab through host ribosomal frameshift-ing, with the remainder of the viral RNA encoding structural and accessory proteins within smaller ORFs. The viral proteins necessary for host cell infection such as the RNA polymerase along with enzymes that facilitate RNA synthesis are largely contained within the SARS-CoV-2 polyproteins and are released by the action of two internally encoded proteases. The mature proteins thus released are known as non-struc-tural proteins (Nsps) as they are not incorporated within the virion particles. Owing to their degree of sequence conservation the enzymatic roles and essentiality of each of the Nsps within SARS-CoV-2 is likely to mimic the behavior of homologous proteins within previously studied coronaviruses such as SARS-CoV. The development of therapeutic interventions against SARS-CoV-2 infection has focused on a number of approaches: vaccination strategies that target the structural spike glycoprotein of the envelope (Wrapp et al., 2020) and may also include a larger selection of viral proteins (Thanh Le et al., 2020), whereas small-molecule compounds have predominantly targeted two conserved viral enzymes, the main protease (Zhenming et al., 2020) (Yang et al., 2005) and the RNA-polymerase (Yan et al., 2020). Nevertheless, some of the betacoronaviral non-structural proteins appear necessary for viral replication within SARS-CoV and in-fluence pathogenesis (Frieman et al., 2012). Despite their close homology between viruses, such non-struc-tural proteins remain of interest as they may have conserved roles within the viral life cycle of SARS-CoV-2 that could be susceptible to inhibition.

During infection of human cells, SARS-CoV Non-structural protein 9 (Nsp9SARS) was found to be essential

for replication (Frieman et al., 2012). Homologs of the Nsp9 protein have been identified in numerous coro-naviruses, including SARS-Cov-2 (Nsp9COV19), human coronavirus 229E (Nsp9HcoV), avian infectious bronchitis

virus (Nsp9IBV), porcine epidemic diarrhea virus (Nsp9PEDV), and porcine delta virus (Nsp9PDCoV). Nsp9SARS

has been shown to have modest affinity for long oligonucleotides with binding thought to be dependent on oligomerization state (Egloff et al., 2004) (Sutton et al., 2004). Nsp9SARSdimerizes in solution via a conserved

a-helical ‘‘GxxxG’’ motif. Disruption of key residues within this motif reduces both RNA binding (Sutton et al., 2004) and SARS-CoV viral replication (Frieman et al., 2012). The mechanism of RNA binding within the Nsp9 protein family is not understood as these proteins have an unusual structural fold not previously seen in RNA-binding proteins (Egloff et al., 2004) (Sutton et al., 2004). The fold’s Greek-key motif exhibits

1_{Infection and Immunity} Program, Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia

2_{Australian Research Council} Centre of Excellence for Advanced Molecular Imaging, Monash University, Clayton, VIC, Australia 3_{Institute of Infection and} Immunity, Cardiff University School of Medicine, Heath Park, Cardiff, UK 4_{Lead Contact} *Correspondence: [email protected] (D.R.L.), [email protected] (J.R.) https://doi.org/10.1016/j.isci. 2020.101258

(3)

topological similarities with Oligonucleotide/oligosaccharide binding proteins (OB-fold), but such vestiges have proven insufficient to provide clear insight into Nsp9 function (Egloff et al., 2004). As a consequence of the weak affinity of Nsp9SARSfor long oligonucleotide stretches it was suggested that the natural RNA

sub-strate may instead be conserved features at the 30end of the viral-genome (the stem-loop II RNA-motif) ( Pon-nusamy et al., 2008). Furthermore, potential direct interactions with the co-factors of the RNA polymerase have been reported (Chen et al., 2017). However, it remains to be determined how the oligonucleotide-binding ac-tivity of Nsp9 proteins promotes viral replication during infection.

The sequence of Nsp9 homologs is conserved among betacoronoaviruses, suggesting a degree of func-tional conservation. Nsp9COV19exhibits 97% sequence identity with Nsp9SARSbut only 44% sequence

iden-tity with Nsp9HCoV.The structure of the HCoV-229E Nsp9 protein suggested a potential oligomeric switch

induced upon the formation of an intersubunit disulfide bond. Here, disulfide bond formation shifts the relative orientation of the Nsp9 monomers, which was suggested to promote higher-order oligomerization (Ponnusamy et al., 2008). The resultant rod-like higher-order Nsp9HCoVassemblies had increased affinity for

the RNA oligonucleotides. Cysteine mutants of Nsp9HCoVthat are unable to produce the disulfide

dis-played reduced RNA-binding affinity (Ponnusamy et al., 2008). The observation of a redox-induced struc-tural switch of Nsp9HCoVled to the hypothesis that Nsp9HCoVmay have a functional role in sensing the

redox status of the host cell (Ponnusamy et al., 2008). Although the ‘‘redox-switch’’ cysteine responsible for oligomer formation in Nsp9HCoVis conserved among different viral Nsp9 homologs, the higher-order

oligomers were not observed for Nsp9SARS(Ponnusamy et al., 2008). Because of these potential differences

between Nsp9 proteins we sought to further characterize the nature of Nsp9COV19.

RESULTS

Expression and Purification of the SARS-CoV-2 Nsp9 Protein

The Nsp9 protein from SARS-CoV-2 (Nsp9COV19) was cloned and recombinantly expressed inE. coli. The

expression construct included an N-terminal Hexa-His tag attached via a rhinoviral 3C-protease site. Following Ni-affinity chromatography Nsp9COV19was further purified via size-exclusion chromatography

to yield >95% pure and homogeneous protein. Nsp9COV19eluted from gel filtration columns with the

apparent molecular weight of a dimer suggesting that, as with other Nsp9 proteins, Nsp9COV19is an

obli-gate homodimer. The N-terminal tag was removed prior to any biochemical experiments via overnight digestion with precision protease, as reported for Nsp9SARS(Sutton et al., 2004).

Nucleotide Binding of Nsp9COV19Protein

The affinity of viral Nsp9 homologs for oligonucleotides has a range of binding affinities reported, some of which are dependent on oligomerization state and nucleotide length, ranging from 20 to 400mM (Zeng et al., 2018). We therefore sought to assess the potential for Nsp9COV19to bind to fluorescently labeled

ol-igonucleotides using fluorescence anisotropy. Preliminary experiments were performed under conditions similar to those previously identified for Nsp9SARS(Sutton et al., 2004). Oligonucleotide affinity was very

limited under our assay conditions (Figure 1). Indeed, protein concentrations up to 200mM of Nsp9COV19

Figure 1. Nsp9COV19Nucleotide-Binding Assay

Fluorescence polarization anisotropy assays were used to examine the possibility that Nsp9COV19could bind to labeled

17-mer and 10-mer single-stranded oligonucleotides. The plot shows corrected anisotropy for each Nsp9COV19protein

(4)

did not result in saturated binding and thus indicated an incredibly low affinityKD, or no affinity for these

oligonucleotides at all under these assay conditions.

Crystal Structure of apo-Nsp9COV19

We next determined the structure of apo-Nsp9COV19(Table 1). The apo-Nsp9COV19structure aligned closely

to that of Nsp9SARS(root-mean-square deviation [RMSD] of 0.57 A˚ over 113 Ca,Figures 2A–2C) (Egloff et al.,

2004). Like other Nsp9 homologs it exhibits an unusual fold that is yet to be observed outside of coronavi-ruses (Sutton et al., 2004). The core of the fold is a small six-stranded enclosedbbarrel, from which a series of extended loops project outward (Figure 2A). The elongated loops link the individualbstrands of the barrel,

3c-Nsp9COV19 apo-Nsp9COV19

Data Collection Space group P4322 P6122 Cell dimensions a,b,c(A˚) 59.0, 59.0, 85.7 88.8, 88.8, 134.0 a,b,g() 90.0, 90.0, 90.0 90.0, 90.0, 120.0 Resolution (A˚) 48.5-2.0 (2.1-2.05) 44.7-2.00 (2.06-2.00) Rpima 2.4 (60.4) 1.2 (40.4) I/s1 13.7 (1.4) 26.4 (1.4) Completeness (%) 100 (100) 99.4 (93.1) Total number of. observations 86,945 (12,241) 423956 (28,236) Number of unique observations 10,055 (1,425) 21,482 (1,451) Multiplicity 8.6 (8.6) 19.7 (19.5) Refinement Statistics Rfactorb(%) 23.3 21.4 Rfreec(%) 24.7 25.2 Number of atoms Protein 936 1,716

Water (ligand) 5 (PO4) 18 (3 SO4)

Ramachandran plot (%) Most favored 95.0 95.9 Allowed region 5.0 4.1 Outlier 0.0 0.0 B factors (A˚2₎ Protein 72.1 73.8 RMSD bonds (A˚) 0.015 0.012 RMSD angles () 1.33 1.19

Table 1. Data Collection and Refinement Statistics

Values in parentheses refer to the highest resolution bin.

a_R

p.i.m=Shkl[1/(N-1)]1/2Si| Ihkl, i- <Ihkl> |/Shkl<Ihkl>. b_R

factor= (S| |Fo| - |Fc| |)/(S|Fo|)—for all data except as indicated in footnote c. c_{Five percent of data was used for the R}

(5)

along with a projecting N-terminalbstrand and C-terminala1 helix; the latter two elements make up the main components of the dimer interface (Figure 3A). Two loops project from the open face of the barrel: theb2-3- andb3-4-loops are both positively charged, glycine rich, and proposed to be involved in RNA bind-ing. The only protrusion on the enclosed barrel side is theb6-7-loop; the C-terminal half of theb7 strand is an integral part of the fold’s barrel-core, but its other half extended outward to pair with the externalb6 strand and create a twistedbhairpin, cupping thea1 helix and interacting with subsequent C-terminal residues. The arrangement of monomers within Nsp9 dimers is well conserved in different viruses and is maintained within Nsp9COV19(RMSD of 0.66 A˚ over 226Cacompared with the dimeric unit of Nsp9SARS). The main

component of the intersubunit interaction is the self-association of the conserved GxxxG protein-protein binding motif (Figure 3C) that allowed backbone van der Waals interactions between interfacing copies of the C-terminala1 helix (Hu et al., 2017). Here Gly-100 of the respective parallela1 helices formed comple-mentary backbone van der Waals interactions. These interactions were replicated after a full helical turn by Gly-104 of the respective chains, thereby forming the molecular basis of the Nsp9COVID19dimer interface

(Figure 3C). The 2-fold axis that created the dimer ran at a15angle through the GxxxG motif allowing the 14-residue helix to cross its counterpart (Figure 4A), the N-terminal turns of the helix were relatively iso-lated, only making contacts with counterpart protomer residues. In contrast, the C-terminal portions were encircled by hydrophobic residues, albeit at a distance that created funnel-like hydrophobic cavities either side of the interfacing helices (Figure 3A). Strandsb1 andb6 and the protein’s C terminus served to provide

Figure 2. apo-Nsp9COV19Is Structurally Similar to Nsp9SARS

(A–C) Cartoon representation of the monomeric units of (A) apo-Nsp9COV19, (B) apo-Nsp9SARS(Sutton et al., 2004), and (C)

a backbone alignment of the two structures. The COV19 structures are colored withbstrands inmarineand theahelix in

wheat;the SARS structures are intealandorange, respectively. (D) The bound peptide is highlighted inred.

(6)

a ring of residues that encircled the paired helices. The first 10 residues of Nsp9COV19exchanged across the

dimer-interface to form a strand-like extension ofb1 that ran alongsideb6 from the other protomer ( Fig-ure 3A). The interaction these strands made did not appear optimal; indeed, the remaining four C-terminal residues projected sideways across the dimer interface, inserting between the two strands while contrib-uting a hydrophobic backing to the main helix.

Extraneous Peptides Occupy the Hydrophobic Cavities of Nsp9

In a separate crystallization experiment we determined the structure of Nsp9COV19that included the

N-ter-minal tag together with a rhinoviral 3C protease sequence (termed 3C-Nsp9COV19). The 3C-Nsp9COV19

crys-tal form diffracted to 2.05 A˚ resolution in space group P4322 and had one molecule within the asymmetric

unit, with the dimer being created across the crystallographic 2-fold axis.

Unexpectedly, the high-resolution structure of 3C-Nsp9COV19diverged from that of the apo-Nsp9COV19

(RMSD 0.86 A˚ for the monomer and 2.23 A˚ when superimposing a dimer). The 3C sequence folded around either side of the paired intersubunit helices to fill two funnel-like hydrophobic cavities (Figures 2D,3B,4C, and 4D), namely, 3C residues LEVL, inserted into the opposing cavities either side of the dimer interface and ran parallel to the paired GxxxG motif. Moreover, the 3C sequence formed additionalbsheet interac-tions with the N terminus of the protein from the other protomer (Figure 3B). To accommodate the 3C res-idues the N-terminal strand resres-idues moved outward by1.6 A˚ (residues 6–10). This movement allowed the N terminus to increase the number ofbsheet interactions it formed withb6’. Thebbarrel core of the fold remained unchanged, but the increase in interactions betweenb1 andb60served to exclude the C termi-nus, prompting residues 106–111 to condense into a bent extension of theahelix (Figures 4A and 4B). The

Figure 3. Peptide Binding in Nsp9COV19Alters the Dimer Interface

(A and B) Top-down views of the dimer interface highlighting the interaction helices for (A) unbound Nsp9COV19in which

the surface of the hydrophobic interface cavity is displayed labeled; (B) an equivalent representation of peptide-occupied 3c-Nsp9COV19dimer.

(C) Stick representation of the GxxxG protein-protein interaction helices at the dimer interface for apo-Nsp9COV19.

(D) Cabackbone overlay of the Nsp9COV19interface in the apo and peptide-occupied states. The GxxxG motif residues

(7)

subtle structural changes near the interacting GxxxG motifs (Figure 3D) are amplified at the periphery of the dimer resulting in6 A˚ shift in thebbarrel core (Figure 4C).

Conserved Cavity Residues Accommodate a Peptide Backbone

When comparing apo-Nsp9COV19with 3C-Nsp9COV19the point where the N-terminal interface strand

di-verges is near Leu-9 (Figure 3D). Within the apo form it makes van der Waals interactions with the side chains of Met-101, Asn-33, and Ser-105; this latter serine is important as it immediately follows the conserved protein-binding motif (100GMVLGS105), while also specifically interacting with Gly1040 from the opposing protomer. Within the 3C-Nsp9COV19structure the extraneous LEVL residues insert at this

point (Figure 5B) and the hydrophobic side chains clasp either side of Ser-105 and allowed its hydroxyl group to form backbone hydrogen bonds to the glutamate within the extraneous sequence (Figure 5B).

Figure 4. Movements within the GxxxG Motif

(A and B) A side view of the N- and C- terminal structural elements at the dimer interface is shown for the (A) apo and (B) peptide-occupied forms.

(C) Overlay of the Nsp9COV19dimer in the apo and peptide-bound forms indicating respective shifts in subunit

orientation. The center of mass of the nonaligned subunit is depicted with alight-pinkanddark-pinkpoint, respectively. (D) Unbiased omit map contoured at 3.2snear the hydrophobic cavities into which the exogenous bound peptide was refined.

(8)

Meanwhile, the C-terminal leucine from the extraneous residues inserted behind theahelix of the other protomer (Figure 5B). Cumulatively these changes allow for an5 rotation of the protomer subunits about the 2-fold axis compared with apo-Nsp9COV19(Figure 4C).

Most residues involved in protein binding within the hydrophobic cavity and the structural changes needed to accommodate them appear broadly conserved among other Nsp9 viral homologs (redhighlights in Fig-ure 5A). The main exception to this is Ser-105, which is a tyrosine in the distantly related Nsp9HCoVand

Nsp9PEDVproteins (Ponnusamy et al., 2008) (Zeng et al., 2018). However, the N-terminal interfacebstrand

in these homologs is known to be involved in interface re-organization of the subunits (Ponnusamy et al., 2008) and thus denotes other structural differences at this site.

DISCUSSION

Here we describe the structure of the recombinantly expressed Nsp9COV19as part of a global effort to characterize

the virus causing a current global pandemic. Nsp9 is important for virulence in SARS-CoV (Miknis et al., 2009). It remains to be understood whether Nsp9COV19plays a similar role in SARS-CoV-2; however, the 97% sequence

identity suggests a high degree of functional conservation. The CoV Nsp9 proteins are seemingly obligate dimers comprising a unique fold that associates via an unusuala-helical GxxxG interaction motif. The integrity of this

Figure 5. Sequence Conservation within Nsp9 Homologs

(A) Sequence alignment for viral Nsp9 proteins encoded by SARS-CoV-2, SARS-CoV, Human coronavirus 229E, and Porcine epidemic diarrhea virus. The extent of secondary structural elements observed in the 3C-Nsp9COV19structure is

shown and labeled above. The GxxxG motif residues are highlighted withpurpleand those making up the extraneous peptide binding site inpink.

(B) Cartoon-and-stick representation of the peptide-binding site observed in 3C-Nsp9COV19. The side chains of the

extraneous 3C residues on one side of the paired helical-interface are displayed with carbon atoms coloredpink. Nearby residues that make up the binding site are displayed and labeled and listed in the accompanying contacts table.

(9)

bound on all sides within the site situating themselves proximal to the conserved GxxxG motif. Coordina-tion of the 3C sequence induced changes within interfacing residues, serving to both restructure key struc-tural elements and cause a modest shift in subunit orientation.

At this stage it is unclear whether the bound residues within our structure have any bearing on the physi-ological function of Nsp9COV19. In the first instance this would seem unlikely; however, our sequence is that

of a rhinoviral 3C protease site and the SARS-CoV main protease cleaves consensus sequences following an LQ sequence (Zhu et al., 2011). The bound 3C-LE residues have hallmarks of the LQ motif and are proximal to the highly conserved GxxxG motif. There are no obvious structural features to preclude, or select for, a Glu to Gln substitution within our bound sequence. Notably the Mpro_{cleavage sequence occurs at multiple}

points throughout the CoV genome as the majority of viral proteins are released by its activity; thus, it re-mains possible that Nsp9 may associate with unprocessed viral polyproteins retaining them near the viral RNA. Within our structure Met-101 provides contacts with the bound valine side chain but the presence of Asn-33 nearby may also accommodate residues such as lysine at this position. Peptide-binding assays will need to be developed to rigorously assess if other sequences are preferred by this putative binding site. Indeed, Nsp9 may be part of the viral replication-transcriptase complex, so we may have serendipitously identified a protein-protein interaction interface for another viral or host protein.

In summary, we have established a protocol for the production and purification of SARS-CoV-2 Nsp9 protein. We determined the structure of the Nsp9COV19 and described the conservation of the

unique fold and dimerization interface identified previously for members of this protein family. We also determined the structure of Nsp9COV19in complex with a 3C sequence, although the significance of this

is yet to be established. The structures we describe here could potentially be utilized in drug screening and targeting experiments to disrupt a dimer interface known to be important for coronavirus replication.

Limitations of the Study

The identity of the peptide bound to Nsp9COV19raises follow-up questions that were not addressed within

this study. Namely, it remains to be determined whether a physiological peptide, either of the same sequence or a 3C peptide variant is also able to occupy this putative site. Further questions remain on whether this is a retention mechanism for pre-processed or post-processed polyproteins or another pro-tein altogether, and if so, what its binding affinity might be.

Resource Availability

Lead Contact

Further information and requests for resources and reagents should be directed to: jamie.rossjohn@ monash.edu.

Materials Availability

Plasmids generated in this study are available upon request.

Data and Code Availability

The accession number for the atomic coordinates of the apo-Nsp9COV19with 3C-Nsp9COV19and associated

diffraction data have been deposited at the protein databank (www.rcsb.org) with accession codes PDB: 6W9Q and PDB: 6WXD, respectively.

(10)

METHODS

All methods can be found in the accompanyingTransparent Methods supplemental file.

SUPPLEMENTAL INFORMATION

Supplemental Information can be found online athttps://doi.org/10.1016/j.isci.2020.101258.

ACKNOWLEDGMENTS

Funding for the work originated from the Australian Research Council Centre of Excellence for Advanced Molecular Imaging. This research was undertaken in part using the MX2 beamline at the Australian Syn-chrotron, part of ANSTO, and made use of the Australian Cancer Research Foundation (ACRF) detector. Additionally, we thank Dr. Geoffrey Kong of the Monash Molecular Crystallisation Facility for his assistance with crystallographic screening and optimization. We thank A. Riboldi-Tunnicliffe for assistance with data collection and J. Whisstock and G. Watson for advice on the manuscript.

AUTHOR CONTRIBUTIONS

D.R.L, B.S.G., and J.R. designed the project and wrote the manuscript. D.R.L. cloned, purified, and crystal-lized Nsp9Cov19and refined the structures. R.N.C. performed the RNA binding assays.

DECLARATION OF INTERESTS

The authors declare no conflict of interest.

This article containsSupplemental Informationonline.

Received: March 31, 2020 Revised: May 14, 2020 Accepted: June 5, 2020 Published: July 24, 2020 REFERENCES

Chen, J., Xu, X., Tao, H., Li, Y., Nan, H., Wang, Y., Tian, M., and Chen, H. (2017). Structural analysis of porcine reproductive and respiratory syndrome virus non-structural protein 7alpha (NSP7alpha) and identification of its interaction with NSP9. Front. Microbiol.8, 853.

Egloff, M.P., Ferron, F., Campanacci, V., Longhi, S., Rancurel, C., Dutartre, H., Snijder, E.J., Gorbalenya, A.E., Cambillau, C., and Canard, B. (2004). The severe acute respiratory syndrome-coronavirus replicative protein nsp9 is a single-stranded RNA-binding subunit unique in the RNA virus world. Proc. Natl. Acad. Sci. U S A101, 3792–3796. Frieman, M., Yount, B., Agnihothram, S., Page, C., Donaldson, E., Roberts, A., Vogel, L., Woodruff, B., Scorpio, D., Subbarao, K., et al. (2012). Molecular determinants of severe acute respiratory syndrome coronavirus pathogenesis and virulence in young and aged mouse models of human disease. J. Virol.86, 884–897. Hu, T., Chen, C., Li, H., Dou, Y., Zhou, M., Lu, D., Zong, Q., Li, Y., Yang, C., Zhong, Z., et al. (2017). Structural basis for dimerization and RNA binding of avian infectious bronchitis virus nsp9. Protein Sci.26, 1037–1048.

Miknis, Z.J., Donaldson, E.F., Umland, T.C., Rimmer, R.A., Baric, R.S., and Schultz, L.W. (2009). Severe acute respiratory syndrome coronavirus nsp9 dimerization is essential for efficient viral growth. J. Virol.83, 3007–3018.

Ponnusamy, R., Moll, R., Weimar, T., Mesters, J.R., and Hilgenfeld, R. (2008). Variable

oligomerization modes in coronavirus non-structural protein 9. J. Mol. Biol.383, 1081–1096. Sutton, G., Fry, E., Carter, L., Sainsbury, S., Walter, T., Nettleship, J., Berrow, N., Owens, R., Gilbert, R., Davidson, A., et al. (2004). The nsp9 replicase protein of SARS-coronavirus, structure and functional insights. Structure12, 341–353. Thanh Le, T., Andreadakis, Z., Kumar, A., Go´mez Roma´n, R., Tollefsen, S., Saville, M., and Mayhew, S. (2020). The COVID-19 vaccine development landscape. Nat. Rev. Drug Discov.19, 305–306. Wrapp, D., Wang, N., Corbett, K.S., Goldsmith, J.A., Hsieh, C.L., Abiona, O., Graham, B.S., and McLellan, J.S. (2020). Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science367, 1260–1263.

Yan, G., Liming, Y., Yucen, H., Fengjiang, L., Yao, Z., Lin, C., Tao, W., Qianqian, S., Zhenhua, M.,

Lianqi, Z., et al. (2020). Structure of RNA-dependent RNA polymerase from 2019-nCoV, a major antiviral drug target. bioRxiv.https://doi. org/10.1101/2020.03.16.993386.

Yang, H., Xie, W., Xue, X., Yang, K., Ma, J., Liang, W., Zhao, Q., Zhou, Z., Pei, D., Ziebuhr, J., et al. (2005). Design of wide-spectrum inhibitors targeting coronavirus main proteases. PLoS Biol.

3, e324.

Zeng, Z., Deng, F., Shi, K., Ye, G., Wang, G., Fang, L., Xiao, S., Fu, Z., and Peng, G. (2018). Dimerization of coronavirus nsp9 with diverse modes enhances its nucleic acid binding affinity. J. Virol.92, e00692-18.

Zhenming, J., Xiaoyu, D., Yechun, X., Yongqiang, D., Meiqin, L., Yao, Z., Bing, Z., Xiaofeng, L., Leike, Z., Chao, P., et al. (2020). Structure of Mpro 1 from COVID-19 virus and discovery of its inhibitors. bioRxiv.https://doi.org/10.1101/2020.02.26.964882.

Zhu, L., George, S., Schmidt, M.F., Al-Gharabli, S.I., Rademann, J., and Hilgenfeld, R. (2011). Peptide aldehyde inhibitors challenge the substrate specificity of the SARS-coronavirus main protease. Antiviral Res.92, 204–212.

(11)

(12)

Synthetic cDNA for Nsp9

COV19

:

aat aat gaa ctg agt cct gtc N N E L S P V

gcg ctg cgt caa atg agt tgc gcc gcc ggt acg acc cag acc gca tgt act gac gac aac A L R Q M S C A A G T T Q T A C T D D N gct tta gcc tat tat aac acc aca aag ggg ggt cgt ttt gtt ctt gca ttg ctg tcg gat A L A Y Y N T T K G G R F V L A L L S D ttg cag gac ctg aaa tgg gct cgt ttt ccc aaa agc gac ggt act gga aca att tac acg L Q D L K W A R F P K S D G T G T I Y T gaa tta gag cca ccc tgt cgt ttc gtt aca gat acg ccc aag ggt ccc aag gtt aaa tac E L E P P C R F V T D T P K G P K V K Y tta tac ttc atc aag ggt ctt aac aat ctg aat cgc ggt atg gta ctg ggt tca ctg gcc L Y F I K G L N N L N R G M V L G S L A gcc aca gtt cgc ctt caa A T V R L Q

Cloning

DAY -5 - Primer design and ordering Determine the cDNA sequence of the constructs you want to make. Design primers based on this sequence aiming for a 68oC annealing temperature. Add the LIC-specific extensions “cagggacccggt” to the 5’ end of the fwd primer and “cgaggagaagcccggtta” to the 5’ end of the rev primer. For example:

forward cagggacccggtaataatgaactgagtcctgtc

reverse cgaggagaagcccggttattgaaggcgaactgtggcggc

Primers can be ordered from the Sigma Genosys website 0.025 ug quantity and desalting is sufficient. Delivery time can vary considerably allow plenty of time.

DAY0 - Growth induction

From glycerol stocks or a plate inoculate 250 -300 mL LB with 30ug/mL Kanamycin in a 1L flask with either DH5a or Nova Blue cells containing the pET-NKIb 3C/LIC empty plasmid. Grow to saturation overnight at 37oC 140rpm.

DAY1 – Vector cleavage

Perform a Qiagen-Midi or Maxi- prep of the plasmid according to instructions. 1ug of cleaved plasmid will be enough for approximately 20 LIC reactions. At this stage carefully check the size of the plasmid on a 0.8% Agarose gel, the supercoiled plasmid should run just below the MIII Marker 3.5kbp band.

Cleave the vector with the appropriate restriction enzyme (based on the LIC vector you want to use). For example the reaction for 10ug would be something like:

Example:

Reagent Add

pET-NKIb 3C/LIC 10ug (e.g. 100uL of 100ng/uL stock) restriction enzyme 10uL of 1U/uL stock

10x buffer 14uL

(13)

Rev primer (100pmol/uL) 0.05-1.0 uL 5-100 pmol Pfu Stratagene polymerase (2.5U/uL) 1.0 uL 2.5U

MQ H2O Upto 100uL

Run with a cycle such as: 94oC for 5 minutes 94oC for 1 minute }

55oC for 1 minute } Repeat steps 30x 72oC for 2.5 minutes }

72oC for 10 minutes 10oC for ever

Remove 5 uL of the PCR reaction and mix with 3 uL of 6x Loading dye. Run on a 0.8% Agarose gel and check the size of the bands. Ensure that they are correct (usually by looking for slight movements up/down compared to nearby samples).

Perform a Qiagen PCR-cleanup of the PCR reaction mixtures providing there is a single band only. If two or more bands occur a gelextraction may be necessary, but if they are clearly distinct these can be more easily separated by performing a larger number of PCRcolony screen

DAY2 – T4 Polymerase treatment

Treat the cut-vector and PCR’d insert with T4 polymerase in 1.5mL eppendorf tubes. The number of pg/pmol = (#of bp)x650. So for a 1000bp PCR-product we have 1000x650 = 650 000 pg/pmol = 650 ng/pmol which means for 0.2pmol of insert we require:

650ng/pmol x 0.2pmol = 130ng

normal yields of Qiagen miniprep are 20ng/uL – 100ng/uL so will require 8uL-1uL of PCR product.

Reagent Add PCR insert 0.2 pmol 10x buffer (NEBuffer #2) 2 uL 25 mM dATP 2 uL T4 polymerase (NEB 3U/uL) 0.8 uL MQ H2O Upto 20 uL

Reagent Add Cut, gel-extracted vector 0.2 pmol 10x buffer (NEBuffer #2) 2 uL 25 mM dTTP 2 uL T4 polymerase (NEB 3U/uL) 0.8 uL MQ H2O Upto 20 uL

1)Incubate at 22oC for 30 minutes.

2)Denature the T4 polymerase by then incubating the reaction at 75oC for 10-20 minutes. 3) Spin down the evaporation.

(14)

1) Add 10-15 uL of Nova Blue competent cells to 2ul of conjoined vector/insert mix.(Store the other 2ul at -20oC). Include a T4 treated vector-only sample to assess background. 2) Incubate on ice for 30 minutes.

3)Heat shock at 42oC for 30-40 seconds. 4)Incubate on ice for 2 minutes.

5)Add 250 uL of SOC media or LB without antibiotic. 6)Allow the cells to recover by incubating at 37oC, 1400rpm.

7)Plate out on the appropriate antibiotic for the vector, leave overnight at 37oC. DAY3 – Colony screening.

If colonies have grown (colonies take closer to 20 hours to grow) perform a PCR-screen using the sequencing primers appropriate for the vector (e.g. T7 promoter/ T7 terminator primers for the pET vector). Setup a Taq sequencing mix such as:

Reagent For 1 reaction For 25 reactions 10 x invitrogen Taq buffer ( - Mg2+) 2 uL 50 uL 50 mM invitrogen MgCl2 1 uL 20 uL 10 mM dNTPs 0.4 uL 10 uL Fwd primer (10 pmol/uL) 0.3 uL 7.5 uL Rev primer (10 pmol/uL) 0.3 uL 7.5 uL Invitrogen Taq polymerase (5U/uL) 0.15 uL 3.75 uL MQ H2O Upto 20uL 401.25 uL

1)Label 2 or 3 bacterial colonies from each samples plate.

2)Aliquot 20 uL of the master PCR mix into each well of a 96-well thermowell tray.

3)Take a sterile P10 tip and scrape half of each bacterial colony and put this into the 20 uL. Leave the tip in the tray while going onto the next sample.

4)After scraping all colonies use a pipette to mix briefly and seal the reactions. 5) Run using a program such as:

96oC for 10 minutes 96oC for 1 minute | 55oC for 1 minute | Repeat steps 25x 68oC for 1 minutes/kbp | 68oC for 10 minutes 10oC for ever 6) When finished add loading dye to each PCR reaction and run half on a 0.8%-1.0% Agarose gel. Check the size of each band (remember the sequencing primers will add approx 170 extra bp compared to the original reaction). If any samples are missing colonies go back and rescreen.

7) Inoculate overnights from the remaining half-spot of the positive colonies. DAY4 – Plasmid minipreps.

From the overnights purify plasmids by either using Qiagen mini-preps or in a 96-well block format. Transform into an expression cell-line like BL21 (DE3) or Rosetta (DE3).

Protein purification

The plasmid was transformed into

E. coli

BL21(DE3) cells which were grown in Luria Broth at 37°C until reaching an

Absorbance at 600nm of ~1.0 before being induced with 0.5mM

Isopropyl β- d-1-thiogalactopyranoside for 4 hours

.

Cells were harvested in 20mM HEPES pH 7.0, 150mM NaCl, 20mM Imidazole, 2mM MgCl

2

and 0.5mM TCEP and frozen

until required.

Lysis was achieved by sonicating the cells in the presence of 1mg of Lysozyme and 1mg of DNAase on ice.

The lysate was then cleared by centrifugation at 10,000xg for 20 minutes and loaded onto a nickel affinity column.

Bound protein was washed extensively with 20 column volumes of 20mM HEPES pH 7.0, 150mM NaCl, 0.5mM TCEP before

being eluted in the same buffer with the addition of 400mM Imidazole.

For His-tag removal samples were incubated with precision 3c protease overnight at 4°C.

All samples were subjected to gel filtration (S75 16/60; GE Healthcare) in 20mM HEPES pH 7.0, 150mM NaCl before being

concentrated to 50mg/mL for crystallization trials.

Crystallisation

(15)

(IDT, USA) in assay buffer (20mM HEPES pH 7.0, 150mM NaCl, 2mM MgCl

2

) at room temperature.

The assay was performed in 96-well non-binding black plates (Greiner Bio-One), with fluorescence anisotropy measured in

triplicate using the PHERAstar FS (BMG) with FP 488-520-520 nm filters.

The data was corrected using the anisotropy of RNA sample alone, then fitted by a one-site binding model using the Equation,

A = (Amax [L])/(KD+[L]), where A is the corrected fluorescence anisotropy; Amax is maximum binding fluorescence

anisotropy signal, [L] is the Nsp9

COV19

concentration, and

KD

is the dissociation equilibrium constant.

Amax and

KD

were used as fitting parameters and nonlinear regression was performed using GraphPad Prism. Measurements

were taken after 60 minutes incubation between protein and RNA.

Methods References

Aragao, D., Aishima, J., Cherukuvada, H., Clarken, R., Clift, M., Cowieson, N.P., Ericsson, D.J., Gee, C.L., Macedo, S., Mudie, N., et al. (2018). MX2: a high-flux undulator microfocus beamline serving both the chemical and macromolecular crystallography communities at the Australian Synchrotron. J Synchrotron Radiat 25, 885-891.

Egloff, M.P., Ferron, F., Campanacci, V., Longhi, S., Rancurel, C., Dutartre, H., Snijder, E.J., Gorbalenya, A.E., Cambillau, C., and Canard, B. (2004). The severe acute respiratory syndrome coronavirus replicative protein nsp9 is a single-stranded RNA-binding subunit unique in the RNA virus world. Proc Natl Acad Sci U S A 101, 3792-3796.

Kabsch W. (2010) XDS. Acta Cryst. D66, p133-144.

Casanal, A., Lohkamp, B., and Emsley, P. (2019). Current developments in Coot for macromolecular model building of Electron Cryo-microscopy and Crystallographic Data. Protein Sci.

Liebschner, D., Afonine, P.V., Baker, M.L., Bunkoczi, G., Chen, V.B., Croll, T.I., Hintze, B., Hung, L.W., Jain, S., McCoy, A.J., et al. (2019). Macromolecular structure determination using Xrays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr D Struct Biol 75, 861-877.