Next Article in Journal
Alphaherpesvirus gB Homologs Are Targeted to Extracellular Vesicles, but They Differentially Affect MHC Class II Molecules
Next Article in Special Issue
Molecular Investigation of SARS–CoV-2 Proteins and Their Interactions with Antiviral Drugs
Previous Article in Journal
In Silico Analysis of Genetic Diversity of Human Hepatitis B Virus in Southeast Asia, Australia and New Zealand
Previous Article in Special Issue
Virology, Epidemiology, Pathogenesis, and Control of COVID-19
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Molecular Mechanism of Evolution and Human Infection with SARS-CoV-2

Institute of Biophysics, School of Physics, Huazhong University of Science and Technology, Wuhan 430074, China
*
Authors to whom correspondence should be addressed.
Viruses 2020, 12(4), 428; https://doi.org/10.3390/v12040428
Submission received: 5 March 2020 / Revised: 31 March 2020 / Accepted: 5 April 2020 / Published: 10 April 2020
(This article belongs to the Special Issue Pathogenesis of Human and Animal Coronaviruses)

Abstract

:
The outbreak of a novel coronavirus, which was later formally named the severe acute respiratory coronavirus 2 (SARS-CoV-2), has caused a worldwide public health crisis. Previous studies showed that SARS-CoV-2 is highly homologous to SARS-CoV and infects humans through the binding of the spike protein to ACE2. Here, we have systematically studied the molecular mechanisms of human infection with SARS-CoV-2 and SARS-CoV by protein-protein docking and MD simulations. It was found that SARS-CoV-2 binds ACE2 with a higher affinity than SARS-CoV, which may partly explain that SARS-CoV-2 is much more infectious than SARS-CoV. In addition, the spike protein of SARS-CoV-2 has a significantly lower free energy than that of SARS-CoV, suggesting that SARS-CoV-2 is more stable and may survive a higher temperature than SARS-CoV. This provides insights into the evolution of SARS-CoV-2 because SARS-like coronaviruses have originated in bats. Our computation also suggested that the RBD-ACE2 binding for SARS-CoV-2 is much more temperature-sensitive than that for SARS-CoV. Thus, it is expected that SARS-CoV-2 would decrease its infection ability much faster than SARS-CoV when the temperature rises. These findings would be beneficial for the disease prevention and drug/vaccine development of SARS-CoV-2.

1. Introduction

Coronaviruses (CoVs) are a group of enveloped positive-stranded RNA viruses that can cause respiratory, intestinal, and central nervous system infections in humans and animals [1]. Until 2019, six strains of coronaviruses that are able to infect humans had been identified [1,2]. Among them, four human coronaviruses, including HCoV-OC43, HCoV-229E, HCoV-NL63, and HCoVHKU1, are not highly pathogenic and only cause mild respiratory diseases [1]. However, two other coronaviruses, the severe acute respiratory syndrome coronavirus (SARS-CoV) [3,4,5,6] and the Middle East respiratory syndrome coronavirus (MERS-CoV) [7,8], have caused two large-scale pandemics and resulted in more than 8000 cases, including nearly 800 related deaths and about 2500 cases, including about 860 related deaths, respectively. The outbreaks of SARS-CoV and MERS-CoV indicated that some coronaviruses can be highly pathogenic when they transmit from animals to humans [9,10,11]. Therefore, it is urgent to develop antiviral treatments or vaccines targeting such high-risk coronaviruses like SARS-CoV and MERS-CoV.
Before efficient antiviral drugs or vaccines are developed for SARS-CoV or MERS-CoV, another outbreak of pneumonia (COVID-19) caused by a novel coronavirus (SARS-CoV-2) has emerged in Wuhan since December 2019 [12,13,14,15,16,17,18,19]. As of 30 March 2020, SARS-CoV-2 has caused 693,224 confirmed cases, including 33,106 related deaths globally [20]. The full-length genome sequence of SARS-CoV-2 was soon determined by the Zhang group [21]. It was revealed that SARS-CoV-2 has a probable bat origin and is 96% identical at the whole-genome level to a bat SARS-like coronavirus [22]. In addition, SARS-CoV-2 is also closely related to other SARS-like coronaviruses and shares a 79.5% sequence identity to SARS-CoV [22]. For some encoded proteins like coronavirus main proteinase (3CLpro), papain-like protease (PLpro), and RNA-dependent RNA polymerase (RdRp), the sequence identity is even higher and can be as high as 96% between SARS-CoV-2 and SARS-CoV [23]. Therefore, it was thought that SARS-CoV-2 would function in a similar way to SARS-CoV in the human-infection and pathogenic mechanism [22,23,24].
Coronaviruses use the surface spike (S) glycoprotein on the coronavirus envelope to attach host cells and mediate host cell membrane and viral membrane fusion during infection [25]. The spike protein includes two regions, S1 and S2, where S1 is for host cell receptor binding and S2 is for membrane fusion. The S1 region also includes an N-terminal domain (NTD) and three C-terminal domains (CTD1, CTD2, and CTD3) [26]. For SARS-CoV, the receptor binding domain (RBD) is located in the CTD1 of the S1 region. SARS-CoV attaches the human host cells through the binding of the RBD protein to the angiotensin-converting enzyme II (ACE2) [27,28]. Therefore, the interaction between RBD and ACE2 is a prerequisite for the human infection with SARS-CoV. Given the high homology between SARS-CoV and SARS-CoV-2, it was expected that SARS-CoV-2 would also use the ACE2 molecule as the receptor to enter human cells [24,29]. This hypothesis was further experimentally confirmed by the virus infectivity studies from the Shi group, in which SARS-CoV-2 is able to use the ACE2 proteins from humans, Chinese horseshoe bats, and civet as an entry receptor in the ACE2-expressing cells, but not cells without ACE2 [22]. Xu et al. have used MOE to calculate the binding free energies between the Spike-RBD protein and human ACE2, showing that the binding free energy between SARS-CoV-2 RBD and human ACE2 was 50.6 kcal/mol, whereas that between SARS-CoV RBD and human ACE2 was 78.6 kcal/mol [24]. Very recently, the Cryo-EM structure of the SARS-CoV-2 spike protein in the prefusion conformation has been determined [30]. The biophysical and structural evidence suggested that SARS-CoV-2 may bind ACE2 with a much higher affinity than SARS-CoV [30].
Although it has been clear that SARS-CoV-2 infects human cells through the binding of the RBD domain to the human ACE2 receptor [30,31,32,33,34], the molecular mechanism of the binding between the RBD protein and the ACE2 receptor is still unknown. Many questions remain to be answered. For example, previous simulations showed that the binding affinity between SARS-CoV-2 and human ACE2 is weaker than that between SARS-CoV and human ACE2 [24]. However, in reality, SARS-CoV-2 has resulted in many more cases and seems to be more infectious than SARS-CoV. In this study, we have extensively investigated the spike protein/human ACE2 protein systems of SARS-CoV-2 and SARS-CoV by using protein-protein docking and molecular dynamics (MD) simulations. Specifically, we have extensively studied the free energies and dynamics of RBD-ACE2 binding, spike protein, and free RBD protein systems. Given that the spike protein is not only a potential drug target but also the virus antigen [35], the present study will be beneficial for the drug design, vaccine development, and disease prevention of SARS-CoV-2.

2. Materials and Methods

2.1. Structure Preparation

In this study, we have investigated the RBD-ACE2 complex, spike protein, and free RBD systems of SARS-CoV (GenBank ID: NP_828851.1) and SARS-CoV-2 (GenBank ID: MN908947.3). For SARS-CoV, the RBD-ACE2 complex structure was directly downloaded from the Protein Data Bank (PDB entry: 3SCI) [36]. Then, all the water molecules were removed from the complex structure. The free RBD structure was obtained by removing the ACE2 protein from the RBD-ACE2 complex. The structure of the trimeric spike protein of SARS-CoV was also downloaded from the PDB (PDB entry: 6ACD) [26]. For SARS-CoV-2, the three dimensional (3D) RBD structure was modeled based on the RBD structure of SARS-CoV using the MODELLER program [37], where the sequence alignment was performed using the ClustalW program [38,39]. The complex structure between the SARS-CoV-2 RBD protein and human ACE2 was then predicted by our protein-protein docking approach [40,41,42]. The 3D structure of the trimeric spike protein for SARS-CoV-2 was constructed based on the spike protein structure of SARS-CoV (PDB entry: 6ACD) using MODELLER.

2.2. Protein-Protein Docking

The complex structure between the SARS-CoV-2 RBD protein and the human ACE2 molecule was predicted using our hybrid protein-protein docking algorithm, HDOCK [40,41,42,43]. Specifically, given the individual structures of the SARS-CoV-2 RBD protein and the human ACE2 molecule, HDOCK globally samples all possible binding modes between the two proteins through a fast Fourier transform (FFT)-based search strategy [42]. Then, all the sampled binding modes were evaluated by our iterative knowledge-based scoring function ITScorePP [44]. Last, the binding modes were ranked according to their binding energy scores, and the top ten binding modes were provided to users. During the docking calculation, all the default parameters were used. Namely, the grid spacing was set to 1.2 Å for 3D translational search, the angle interval was set to 15 for rotational sampling in 3D Euler space, and the binding interface information in the PDB was automatically applied during the template-based modeling of individual structures. A web server version of our HDOCK algorithm can be freely accessed from our web site at http://hdock.phys.hust.edu.cn/ [41].

2.3. MD Simulations

The AMBER suite was used for the MD simulations [45]. Before the simulations, the missing residues in the middle of a chain were added using the MODELLER program [37]. During the simulations, the ff14SB force field was selected [46], an explicit solvent model was used, the time step was set to 2 fs, Langevin dynamics were used for temperature control, and the program “pmemd.cuda” was used as the simulation engine, where all the simulations were performed on a GPU compute node [47]. Specifically, for each system, the following four stages of MD simulations were performed before the production simulation: (1) A 1000-step simulation was first run to minimize the solvated protein system with weakly restraints on the backbone atoms; (2) The system was then heated to 300 K by a 25,000-step (i.e., 50 ps) simulation with weakly restraints on the backbone atoms; (3) Next, another 25,000-step (i.e., 50 ps) constant pressure simulation was conducted to equilibrating the density of the system at 300 K; (4) the system was then equilibrated by a 250,000-step (i.e., 500 ps) of constant pressure simulation at 300 K. Finally, two 2,500,000-step production simulations were run to record the trajectories of the system at 300 K, where the coordinates were written out every 5000 steps (i.e., 10 ps), resulting in a total of 10 ns simulation with 1000 recorded trajectories. After the simulations, the “MMPBSA.py” was used to calculate the free energies of the systems through the MM-GBSA model [48], and the “cpptraj” was used to analyze the coordinate trajectories [49]. To check the statistical significance of the difference between two means, a t-test was performed to obtain the P value using two average free energies and their standard deviations.

3. Results

3.1. The RBD-ACE2 Docking

The RBD proteins of SARS-CoV-2 and SARS-CoV exhibit a high sequence similarity (89.2%) with a sequence identity of 73.7%. The high homology resulted in an accurate 3D RBD model of SARS-CoV-2 with a small RMSD of 0.55 Å from the experimental SARS-CoV RBD structure. With the SARS-CoV-2 RBD model and experimental human ACE2 structure, we then performed protein-protein docking to predict their binding mode using our HDOCK approach [40,41,42]. Figure 1A shows the predicted complex structure between the human ACE2 molecule and the SARS-CoV-2 RBD protein. It can be seen from the figure that the predicted SARS-CoV-2 RBD-ACE2 complex structure is very close to the experimentally determined SARS-CoV RBD-ACE2 complex structure. The interface root mean square deviation (RMSD) between the two complexes is only 0.473 Å, demonstrating that the RBDs of SARS-CoV-2 and SARS-CoV bind to the same site of the human ACE2 receptor (Figure 1A). These results can also be understood by comparing the residues at the RBD-ACE2 binding interface for SARS-CoV-2 and SARS-CoV. The binding sites on the RBD proteins of SARS-CoV-2 and SARS-CoV are very conserved. The corresponding residues show a high sequence similarity of 83.3% (Figure 1B,C). Among them, the hydrophobic residues, which are important for protein-protein interactions, are especially conserved. For the RBD of SARS-CoV-2, there are 13 hydrophobic residues at the binding site, which are comparable to 13 hydrophobic residues for that of SARS-CoV (Figure 1B).
Recently, the CryoEM structure of the SARS-CoV-2 RBD-ACE2 complex was determined by the Zhou group [31]. Comparing the experimental structure and our predicted model showed that the two complexes are close to each other and have a small interface RMSD of 1.108 Å (Figure 2A). The interface difference is mainly due to the conformational changes in the RBD, especially around the loop of residues 475–488 (Figure 2B), while the backbone RMSD for the two ACE2 monomers is only 0.965 Å. If the flexible loop of residues 475–488 in the RBD was excluded, the interface RMSD between the predicted and experimental complexes would be as small as 0.774 Å. These results suggested the accuracy of our predicted SARS-CoV-2 RBD-ACE2 complex structure.

3.2. The RBD-ACE2 Interaction: SARS-CoV-2 Binds ACE2 with Higher Affinity than SARS-CoV

We have run a long-time MD simulation to generate the trajectories of the RBD-ACE2 complex system for SARS-CoV-2 and SARS-CoV. The binding free energies were calculated using the MM-GBSA model by the “MMPBSA.py” script in the AMBER package. Table 1 shows a comparison of the RBD-ACE2 binding free energies for SARS-CoV-2 and SARS-CoV. It can be seen from the table that the binding free energy of the SARS-CoV-2 RBD-ACE2 interaction is 50.43 kcal/mol, which is significantly lower than the binding free energy of the SARS-CoV RBD-ACE2 interaction ( 36.75 kcal/mol). In other words, SARS-CoV-2 binds human ACE2 with a significantly higher affinity than SARS-CoV. Very recently, an experiment study also suggested that SARS-CoV-2 could bind human ACE2 with a higher affinity than SARS-CoV [30]. This may provide one of the possible reasons why SARS-CoV-2 is much more infectious than SARS-CoV, though other factors like human activities and pathogen persistence can also have a critical impact on the spread of SARS-CoV-2. Further examination of the binding free energy contributions reveals that the higher binding affinity of SARS-CoV-2 than SARS-CoV is mostly attributed to the solvation energy contribution Δ G solv (674.97 vs. 696.56 kcal/mol), whereas SARS-CoV-2 has a higher binding free energy in vacuum Δ G gas than SARS-CoV (−725.41 vs. −733.31). In other words, SARS-CoV-2 tends to bind human ACE2 better than SARS-CoV in the water, while SARS-CoV would bind to human ACE2 better than SARS-CoV-2 in the gas. Further investigation will be needed to reveal the impact of such binding preferences on human infection.

3.3. The Spike Protein: SARS-CoV-2 Is More Stable than SARS-CoV

The spike protein on the coronavirus envelope is a trimeric protein. This protein is critical for the vitality of coronaviruses because it is not only an important component for the virus particle but also plays a crucial role in attaching host cells and fusing the membranes [26]. In addition, the spike protein also determines the solubility of coronavirus particles and thus the viral infectivity because the spike protein is the largest protein located on the coronavirus envelope surface. Therefore, the spike protein is directly related to the stability and functionality of coronaviruses. Here, we have run a lengthy MD simulation to study the trimeric spike proteins of SARS-CoV-2 and SARS-CoV.
Table 2 gives a comparison between the free energies of the spike proteins for SARS-CoV-2 and SARS-CoV. It can be seen from the table that the spike protein of SARS-CoV-2 has a significantly lower total free energy ( G total = 67,303.28 kcal/mol) than the spike protein of SARS-CoV ( G total = 63,139.96 kcal/mol) (Table 2). These results suggest that SARS-CoV-2 is more stable and able to survive a significantly higher temperature than SARS-CoV. This may partly explain the higher infection ability of SARS-CoV-2 than SARS-CoV because SARS-CoV-2 would have a higher persistence than SARS-CoV at the same temperature.
The lower free energy of the SARS-CoV-2 spike protein may result from the virus evolution or adaption to hosts because SARS-like coronaviruses normally originate from bats that are known to have a higher body temperature than humans [50,51,52]. In other words, SARS-CoV-2 and other SARS-like coronaviruses would have evolved to achieve a lower free energy for their spike proteins by recombination or mutations so that they can survive in high-temperature animals like bats [1]. In addition, the free energy decomposition also shows that the lower free energy of SARS-CoV-2 spike protein than SARS-CoV spike protein is mainly attributed to the free energy in a vacuum G gas (−36,405.44 vs. −32,053.43 kcal/mol), whereas their solvation energies G solv are more comparable (−30,897.84 vs. −31,086.53 kcal/mol) (Table 2). This may reflect an evolution trend of SARS-like coronaviruses, i.e., favoring the internal interactions between residues instead of the solvation energy. This kind of evolution would also be beneficial for the persistence of coronaviruses because such kinds of coronaviruses with lower internal energy would be more robust and able to survive in both the air and solvent.

3.4. The Free RBD: SARS-CoV-2 Is More Temperature-Sensitive than SARS-CoV

Coronaviruses use the spike protein to attach host cells by binding the host cell receptor. Therefore, the receptor binding domain (RBD) of the spike protein is critical for coronaviruses to infect host cells. Here, we have run lengthy MD simulations to investigate the dynamic properties of the RBD proteins for SARS-CoV-2 and SARS-CoV. Table 3 shows a comparison between the free energies of the free RBD proteins for SARS-CoV-2 and SARS-CoV. Similar to the findings in the spike protein as detailed in the last section (Table 2), the RBD protein of SARS-CoV-2 also shows a significantly lower free energy than that of SARS-CoV ( 4090.04 vs. 3617.73 kcal/mol) (Table 3), which may also be understood by the evolution pressure on coronaviruses due to their high-temperature host environment. However, unlike in the spike protein where the free energy difference between SARS-CoV-2 and SARS-CoV is mostly attributed the inter-residue interactions in a vacuum ( G gas ), here in the RBD protein, the free energy difference between SARS-CoV-2 and SARS-CoV comes from both the free energies in vacuum G gas ( 2104.37 vs. 1703.66 kcal/mol) and solvation energy G solv ( 1985.68 vs. 1914.07 kcal/mol) (Table 3). The lower solvation energy of SARS-CoV-2 RBD than SARS-CoV RBD may be understood because the RBD must move up away from the spike protein and into the water in order to bind human ACE2 [27]. In other words, SARS-CoV-2 would have evolved to be more soluble so that it can move up and bind the ACE2 more easily. The better solubility of SARS-CoV-2 RBD than SARS-CoV RBD may also contribute to part of the higher infection ability of SARS-CoV-2 than SARS-CoV.
Protein flexibility is a critical factor in binding as it may not only change the binding interface between two proteins but also be an important contribution to the entropy penalty upon binding. Therefore, we have investigated the protein flexibility of the RBD domains for SARS-CoV-2 and SARS-CoV by analyzing their coordinate trajectories. Figure 3 shows two ensembles of selected trajectories over a period of 10 ns simulations for the RBD proteins of SARS-CoV-2 and SARS-CoV. The figure also gives a comparison of the root mean square fluctuations (RMSF), a rough measurement of protein flexibility, for the RBDs of SARS-CoV-2 and SARS-CoV. It can be seen from the figure that the RBD of SARS-CoV-2 shows a significantly higher RMSF than that of SARS-CoV. In other words, the SARS-CoV-2 RBD is much more flexible than the SARS-CoV RBD. The flexibility is especially higher in the loop of residues 470–490 near the binding site than in other regions (Figure 3A). That means that SARS-CoV-2 must overcome much more entropy penalty than SARS-CoV when binding to human ACE2. As we know, the binding free energy between two proteins can be expressed as, Δ G = Δ E T Δ S , where Δ E is the interaction energy, Δ S is the entropy loss upon binding, and T is the temperature of the system. As Δ S is negative here, the binding free energy will become higher and the binding will become weaker with the increasing temperature. Therefore, the RBD-ACE2 binding affinity for SARS-CoV-2 is expected to decrease much faster than that for SARS-CoV when the temperature increases. In other words, SARS-CoV-2 is much more temperature-sensitive than SARS-CoV in terms of RBD-ACE2 binding. Namely, SARS-CoV-2 will decrease its infection ability much faster than SARS-CoV when the temperature rises. Therefore, it is expected that SARS-CoV-2 would become less infectious compared to SARS-CoV, and the disease prevention and control for SARS-CoV-2 may get easier when the weather gets warmer/hotter, although the drug and vaccine development targeting the RBD protein may be more challenging because of the high protein flexibility near the binding site.

4. Conclusions

Previous studies showed that SARS-CoV-2 is highly homologous to human SARS-CoV and attaches host cells through the binding of the spike protein to the angiotensin-converting enzyme II (ACE2). However, the molecular mechanisms of SARS-CoV-2 binding to human ACE2 and evolution of SARS-CoV-2 remain unclear. In this study, we have extensively studied the RBD-ACE2 complex, spike protein, and free RBD protein systems of SARS-CoV-2 and SARS-CoV through protein-protein docking and MD simulations. It was found that SARS-CoV-2 can bind human ACE2 with a higher binding affinity than SARS-CoV, which may partly explain that SARS-CoV-2 is much more infectious than SARS-CoV. The spike protein of SARS-CoV-2 shows a lower free energy than that of SARS-CoV, suggesting that SARS-CoV-2 may be more stable and able to survive a higher temperature than SARS-CoV. This may also provide insights into the bat origin of SARS-CoV-2, as bats have a higher body-temperature than humans. In addition, the SARS-CoV-2 RBD exhibits a significantly higher flexibility than SARS-CoV RBD, especially near the binding site. That means that SARS-CoV-2 must overcome a higher entropy penalty in order to bind ACE2 and is thus more temperature-sensitive than SARS-CoV in human infection. Therefore, with the rising temperature, SARS-CoV-2 is expected to decrease its infection ability much faster and become much less infectious than SARS-CoV, which would make the disease prevention and control of SARS-CoV-2 easier. From the above results together, one may infer that unlike SARS-CoV, which was gone after 2003, SARS-CoV-2 might survive a high-temperature environment like Summer in which the virus is not active/infectious due to the high flexibility in the RBD, and then become infectious again when the temperature is low in the Winter. These results may have a far-reaching implication for the disease prevention and control as well as drug and vaccine development for SARS-CoV-2.

Author Contributions

S.-Y.H. and Y.X. designed the research; J.H., H.T., Y.Y., and S.-Y.H. performed the research. S.-Y.H. and Y.X. analyzed the data; J.H., H.T., Y.Y., S.-Y.H., and Y.X. wrote the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (grant No. 31670724) and the startup grant of Huazhong University of Science and Technology.

Acknowledgments

This work is supported by the National Natural Science Foundation of China (grant No. 31670724) and the startup grant of Huazhong University of Science and Technology.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cui, J.; Li, F.; Shi, Z.L. Origin and evolution of pathogenic coronaviruses. Nat. Rev. Microbiol. 2019, 17, 181–192. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Chen, Y.; Liu, Q.; Guo, D. Emerging coronaviruses: Genome structure, replication, and pathogenesis. J. Med. Virol. 2020, 92, 418–423. [Google Scholar] [CrossRef]
  3. Zhong, N.S.; Zheng, B.J.; Li, Y.M.; Poon, L.L.; Xie, Z.H.; Chan, K.H.; Li, P.H.; Tan, S.Y.; Chang, Q.; Xie, J.P.; et al. Epidemiology and cause of severe acute respiratory syndrome (SARS) in Guangdong, People’s Republic of China, in February, 2003. Lancet 2003, 362, 1353–1358. [Google Scholar] [CrossRef] [Green Version]
  4. Ksiazek, T.G.; Erdman, D.; Goldsmith, C.S.; Zaki, S.R.; Peret, T.; Emery, S.; Tong, S.; Urbani, C.; Comer, J.A.; Lim, W.; et al. A novel coronavirus associated with severe acute respiratory syndrome. N. Engl. J. Med. 2003, 348, 1953–1966. [Google Scholar] [CrossRef] [PubMed]
  5. Drosten, C.; Günther, S.; Preiser, W.; Van Der Werf, S.; Brodt, H.R.; Becker, S.; Rabenau, H.; Panning, M.; Kolesnikova, L.; Fouchier, R.A.; et al. Identification of a novel coronavirus in patients with severe acute respiratory syndrome. N. Engl. J. Med. 2003, 348, 1967–1976. [Google Scholar] [CrossRef]
  6. Peiris, J.S.; Lai, S.T.; Poon, L.L.; Guan, Y.; Yam, L.Y.; Lim, W.; Nicholls, J.; Yee, W.K.; Yan, W.W.; Cheung, M.T.; et al. Coronavirus as a possible cause of severe acute respiratory syndrome. Lancet 2003, 361, 1319–1325. [Google Scholar] [CrossRef] [Green Version]
  7. Zaki, A.M.; van Boheemen, S.; Bestebroer, T.M.; Osterhaus, A.D.; Fouchier, R.A. Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. N. Engl. J. Med. 2012, 367, 1814–1820. [Google Scholar] [CrossRef]
  8. De Groot, R.J.; Baker, S.C.; Baric, R.S.; Brown, C.S.; Drosten, C.; Enjuanes, L.; Fouchier, R.A.; Galiano, M.; Gorbalenya, A.E.; Memish, Z.A.; et al. Middle East respiratory syndrome coronavirus (MERS-CoV): announcement of the Coronavirus Study Group. J. Virol. 2013, 87, 7790–7792. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Lu, G.; Wang, Q.; Gao, G.F. Bat-to-human: spike features determining ‘host jump’ of coronaviruses SARSCoV, MERS-CoV, and beyond. Trends Microbiol. 2015, 23, 468–478. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  10. Song, Z.; Xu, Y.; Bao, L.; Zhang, L.; Yu, P.; Qu, Y.; Zhu, H.; Zhao, W.; Han, Y.; Qin, C. From SARS to MERS, Thrusting Coronaviruses into the Spotlight. Viruses 2019, 11, 59. [Google Scholar] [CrossRef] [Green Version]
  11. Wong, A.C.P.; Li, X.; Lau, S.K.P.; Woo, P.C.Y. Global Epidemiology of Bat Coronaviruses. Viruses 2019, 11, 174. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Gralinski, L.E.; Menachery, V.D. Return of the Coronavirus: 2019-nCoV. Viruses 2020, 12, 135. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Liu, S.-L.; Saif, L. Emerging Viruses without Borders: The Wuhan Coronavirus. Viruses 2020, 12, 130. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Jin, Y.; Yang, H.; Ji, W.; Wu, W.; Chen, S.; Zhang, W.; Duan, G. Virology, Epidemiology, Pathogenesis, and Control of COVID-19. Viruses 2020, 12, 372. [Google Scholar] [CrossRef] [Green Version]
  15. Chan, J.F.; Yuan, S.; Kok, K.H.; To, K.K.; Chu, H.; Yang, J.; Xing, F.; Liu, J.; Yip, C.C.; Poon, R.W.; et al. A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: A study of a family cluster. Lancet 2020, 395, 514–523. [Google Scholar] [CrossRef] [Green Version]
  16. Huang, C.; Wang, Y.; Li, X.; Ren, L.; Zhao, J.; Hu, Y.; Zhang, L.; Fan, G.; Xu, J.; Gu, X.; et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 2020, 395, 497–506. [Google Scholar] [CrossRef] [Green Version]
  17. Lu, R.; Zhao, X.; Li, J.; Niu, P.; Yang, B.; Wu, H.; Wang, W.; Song, H.; Huang, B.; Zhu, N.; et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet 2020, 395, 565–574. [Google Scholar] [CrossRef] [Green Version]
  18. Chen, N.; Zhou, M.; Dong, X.; Qu, J.; Gong, F.; Han, Y.; Qiu, Y.; Wang, J.; Liu, Y.; Wei, Y.; et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: A descriptive study. Lancet 2020, 395, 507–513. [Google Scholar] [CrossRef] [Green Version]
  19. Li, Q.; Guan, X.; Wu, P.; Wang, X.; Zhou, L.; Tong, Y.; Ren, R.; Leung, K.S.; Lau, E.H.; Wong, J.Y.; et al. Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia. N. Engl. J. Med. 2020, 382, 1199–1207. [Google Scholar] [CrossRef]
  20. WHO. Coronavirus Disease (COVID-2019) Situation Reports, Report-70. Available online: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports (accessed on 30 March 2020).
  21. Wu, F.; Zhao, S.; Yu, B.; Chen, Y.M.; Wang, W.; Song, Z.G.; Hu, Y.; Tao, Z.W.; Tian, J.H.; Pei, Y.Y.; et al. A new coronavirus associated with human respiratory disease in China. Nature 2020, 579, 265–269. [Google Scholar] [CrossRef] [Green Version]
  22. Zhou, P.; Yang, X.L.; Wang, X.G.; Hu, B.; Zhang, L.; Zhang, W.; Si, H.R.; Zhu, Y.; Li, B.; Huang, C.L.; et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature 2020, 579, 270–273. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Liu, W.; Morse, J.S.; Lalonde, T.; Xu, S. Learning from the Past: Possible Urgent Prevention and Treatment Options for Severe Acute Respiratory Infections Caused by 2019-nCoV. Chembiochem 2020, 21, 730–738. [Google Scholar]
  24. Xu, X.; Chen, P.; Wang, J.; Feng, J.; Zhou, H.; Li, X.; Zhong, W.; Hao, P. Evolution of the novel coronavirus from the ongoing Wuhan outbreak and modeling of its spike protein for risk of human transmission. Sci. China Life Sci. 2020, 63, 457–460. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Li, F. Structure, Function, and Evolution of Coronavirus Spike Proteins. Annu. Rev. Virol. 2016, 3, 237–261. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Gui, M.; Song, W.; Zhou, H.; Xu, J.; Chen, S.; Xiang, Y.; Wang, X. Cryo-electron microscopy structures of the SARS-CoV spike glycoprotein reveal a prerequisite conformational state for receptor binding. Cell Res. 2017, 27, 119–129. [Google Scholar] [CrossRef] [PubMed]
  27. Song, W.; Gui, M.; Wang, X.; Xiang, Y. Cryo-EM structure of the SARS coronavirus spike glycoprotein in complex with its host cell receptor ACE2. PLoS Pathog. 2018, 14, e1007236. [Google Scholar] [CrossRef]
  28. Li, W.; Moore, M.J.; Vasilieva, N.; Sui, J.; Wong, S.K.; Berne, M.A.; Somasundaran, M.; Sullivan, J.L.; Luzuriaga, K.; Greenough, T.C.; et al. Angiotensin-converting enzyme 2 is a functional receptor for the SARS coronavirus. Nature 2003, 426, 450–454. [Google Scholar] [CrossRef] [Green Version]
  29. Wan, Y.; Shang, J.; Graham, R.; Baric, R.S.; Li, F. Receptor recognition by novel coronavirus from Wuhan: An analysis based on decade-long structural studies of SARS. J. Virol. 2020, 94, e00127-20. [Google Scholar] [CrossRef] [Green Version]
  30. Wrapp, D.; Wang, N.; Corbett, K.S.; Goldsmith, J.A.; Hsieh, C.-L.; Abiona, O.; Graham, B.S.; McLellan, J.S. Cryo-EM Structure of the 2019-nCoV Spike in the Prefusion Conformation. Science 2020, 367, 1260–1263. [Google Scholar] [CrossRef] [Green Version]
  31. Yan, R.; Zhang, Y.; Li, Y.; Xia, L.; Guo, Y.; Zhou, Q. Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2. Science 2020, 367, 1444–1448. [Google Scholar] [CrossRef] [Green Version]
  32. Lan, J.; Ge, J.; Yu, J.; Shan, S.; Zhou, H.; Fan, S.; Zhang, Q.; Shi, X.; Wang, Q.; Zhang, L.; et al. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor. Nature 2020. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Shang, J.; Ye, G.; Shi, K.; Wan, Y.; Luo, C.; Aihara, H.; Geng, Q.; Auerbach, A.; Li, F. Structural basis of receptor recognition by SARS-CoV-2. Nature 2020. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Srinivasan, S.; Cui, H.; Gao, Z.; Liu, M.; Lu, S.; Mkandawire, W.; Narykov, O.; Sun, M.; Korkin, D. Structural Genomics of SARS-CoV-2 Indicates Evolutionary Conserved Functional Regions of Viral Proteins. Viruses 2020, 12, 360. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Ahmed, S.F.; Quadeer, A.A.; McKay, M.R. Preliminary Identification of Potential Vaccine Targets for the COVID-19 Coronavirus (SARS-CoV-2) Based on SARS-CoV Immunological Studies. Viruses 2020, 12, 254. [Google Scholar] [CrossRef] [Green Version]
  36. Burley, S.K.; Berman, H.M.; Bhikadiya, C.; Bi, C.; Chen, L.; Di Costanzo, L.; Christie, C.; Dalenberg, K.; Duarte, J.M.; Dutta, S.; et al. RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy. Nucleic Acids Res. 2019, 47, D464–D474. [Google Scholar] [CrossRef] [Green Version]
  37. Webb, B.; Sali, A. Comparative Protein Structure Modeling Using MODELLER. Curr. Protoc. Protein Sci. 2016, 86, 2.9.1–2.9.37. [Google Scholar] [CrossRef] [Green Version]
  38. Sievers, F.; Wilm, A.; Dineen, D.; Gibson, T.J.; Karplus, K.; Li, W.; Lopez, R.; McWilliam, H.; Remmert, M.; Soding, J.; et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 2011, 7, 539. [Google Scholar] [CrossRef]
  39. Larkin, M.A.; Blackshields, G.; Brown, N.P.; Chenna, R.; McGettigan, P.A.; McWilliam, H.; Valentin, F.; Wallace, I.M.; Wilm, A.; Lopez, R.; et al. Clustal W and Clustal X version 2.0. Bioinformatics 2007, 23, 2947–2948. [Google Scholar] [CrossRef] [Green Version]
  40. Yan, Y.; Wen, Z.; Wang, X.; Huang, S.-Y. Addressing recent docking challenges: A hybrid strategy to integrate template-based and free protein-protein docking. Proteins 2017, 85, 497–512. [Google Scholar] [CrossRef]
  41. Yan, Y.; Zhang, D.; Zhou, P.; Li, B.; Huang, S.-Y. HDOCK: A Web Server for Protein-Protein and Protein-DNA/RNA Docking Based on a Hybrid Strategy. Nucleic Acids Res. 2017, 45, W365–W373. [Google Scholar] [CrossRef]
  42. Yan, Y.; Huang, S.-Y. Pushing the accuracy limit of shape complementarity for protein-protein docking. BMC Bioinform. 2019, 20 (Suppl. 25), 696. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Yan, Y.; Tao, H.; He, J.; Huang, S.-Y. The HDOCK server for integrated protein-protein docking. Nat. Proc. 2020. [Google Scholar] [CrossRef]
  44. Huang, S.-Y.; Zou, X. An iterative knowledge-based scoring function for protein-protein recognition. Proteins 2008, 72, 557–579. [Google Scholar] [CrossRef] [PubMed]
  45. Case, D.A.; Betz, R.M.; Cerutti, D.S.; Cheatham, T.E., III; Darden, T.A.; Duke, R.E.; Giese, T.J.; Gohlke, H.; Goetz, A.W.; Homeyer, N.; et al. AMBER 2016. University of California: San Francisco, CA, USA. Available online: https://ambermd.org/ (accessed on 30 April 2016).
  46. Maier, J.A.; Martinez, C.; Kasavajhala, K.; Wickstrom, L.; Hauser, K.E.; Simmerling, C. ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. J. Chem. Theory Comput. 2015, 11, 3696–3713. [Google Scholar] [CrossRef] [Green Version]
  47. Salomon-Ferrer, R.; Gotz, A.W.; Poole, D.; Le Grand, S.; Walker, R.C. Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 2. Explicit Solvent Particle Mesh Ewald. J. Chem. Theory Comput. 2013, 9, 3878–3888. [Google Scholar] [CrossRef]
  48. Miller, B.R., 3rd; McGee, T.D., Jr.; Swails, J.M.; Homeyer, N.; Gohlke, H.; Roitberg, A.E. MMPBSA.py: An Efficient Program for End-State Free Energy Calculations. J. Chem. Theory Comput. 2012, 8, 3314–3321. [Google Scholar] [CrossRef]
  49. Roe, D.R.; Cheatham, T.E., 3rd. PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data. J. Chem. Theory Comput. 2013, 9, 3084–3095. [Google Scholar] [CrossRef]
  50. Banerjee, A.; Kulcsar, K.; Misra, V.; Frieman, M.; Mossman, K. Bats and Coronaviruses. Viruses 2019, 11, 41. [Google Scholar] [CrossRef] [Green Version]
  51. Fan, Y.; Zhao, K.; Shi, Z.-L.; Zhou, P. Bat Coronaviruses in China. Viruses 2019, 11, 210. [Google Scholar] [CrossRef] [Green Version]
  52. Fagre, A.C.; Kading, R.C. Can Bats Serve as Reservoirs for Arboviruses? Viruses 2019, 11, 215. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Protein-protein docking with SARS-CoV-2 receptor binding domain (RBD) and human angiotensin-converting enzyme II (ACE2). (A) Structural comparison between the predicted SARS-CoV-2 RBD-ACE2 complex and the experimental SARS-CoV RBD-ACE2 structure (PDB code: 3SCI). SARS-CoV-2 RBD is colored magenta and its interacting ACE2 is colored blue. SARS-CoV RBD is colored green and its interacting ACE2 is colored red, respectively. (B) The binding site residues on the RBD that are within 5.0 Å form the ACE2, where the hydrophobic residues on the SARS-CoV-2 RBD are highlighted in red. (C) Part of the sequence alignment between the spike proteins of SARS-CoV-2 and SARS-CoV, where the RBD residues are highlighted in yellow and the residues at the binding site are highlighted in magenta, respectively.
Figure 1. Protein-protein docking with SARS-CoV-2 receptor binding domain (RBD) and human angiotensin-converting enzyme II (ACE2). (A) Structural comparison between the predicted SARS-CoV-2 RBD-ACE2 complex and the experimental SARS-CoV RBD-ACE2 structure (PDB code: 3SCI). SARS-CoV-2 RBD is colored magenta and its interacting ACE2 is colored blue. SARS-CoV RBD is colored green and its interacting ACE2 is colored red, respectively. (B) The binding site residues on the RBD that are within 5.0 Å form the ACE2, where the hydrophobic residues on the SARS-CoV-2 RBD are highlighted in red. (C) Part of the sequence alignment between the spike proteins of SARS-CoV-2 and SARS-CoV, where the RBD residues are highlighted in yellow and the residues at the binding site are highlighted in magenta, respectively.
Viruses 12 00428 g001
Figure 2. Structural comparison between the predicted and experimental SARS-CoV-2 RBD-ACE2 complexes. (A) Alignment of the predicted and experimental SARS-CoV-2 RBD-ACE2 complexes. (B) Alignment of the predicted and experimental RBD structures. The RBD and ACE2 of the predicted complex are colored pink and blue. The RBD and ACE2 of the experimental complex (PDB code: 6M17) are colored green and red.
Figure 2. Structural comparison between the predicted and experimental SARS-CoV-2 RBD-ACE2 complexes. (A) Alignment of the predicted and experimental SARS-CoV-2 RBD-ACE2 complexes. (B) Alignment of the predicted and experimental RBD structures. The RBD and ACE2 of the predicted complex are colored pink and blue. The RBD and ACE2 of the experimental complex (PDB code: 6M17) are colored green and red.
Viruses 12 00428 g002
Figure 3. The conformational flexibility of the RBD protein. (A) The 50 representative trajectories of the SARS-CoV-2 RBD protein over a 10 ns MD simulation, where the highly flexible region (residues 470–490) is indicated by a circle. (B) The 50 representative trajectories of the SARS-CoV RBD protein over a 10 ns MD simulation. (C) The RMSFs for the RBD proteins of SARS-CoV-2 and SARS-CoV, where the residue numbering is taken from the SARS-CoV-2 RBD and the data of SARS-CoV are then aligned to those of SARS-CoV-2 according to the sequence alignment.
Figure 3. The conformational flexibility of the RBD protein. (A) The 50 representative trajectories of the SARS-CoV-2 RBD protein over a 10 ns MD simulation, where the highly flexible region (residues 470–490) is indicated by a circle. (B) The 50 representative trajectories of the SARS-CoV RBD protein over a 10 ns MD simulation. (C) The RMSFs for the RBD proteins of SARS-CoV-2 and SARS-CoV, where the residue numbering is taken from the SARS-CoV-2 RBD and the data of SARS-CoV are then aligned to those of SARS-CoV-2 according to the sequence alignment.
Viruses 12 00428 g003
Table 1. The binding free energies calculated from molecular dynamics (MD) simulations for the RBD-ACE2 interactions of SARS-CoV-2 and SARS-CoV, where Δ G gas is the interaction energy change in the gas and Δ G solv is the solvation energy change upon binding in the solvent, respectively. All units are reported in kcal/mol. The t-test was conducted to check the statistical significance of the difference between two sets of free energies. A p-value of < 0.05 indicates that the difference is statistically significant (95% confidence interval).
Table 1. The binding free energies calculated from molecular dynamics (MD) simulations for the RBD-ACE2 interactions of SARS-CoV-2 and SARS-CoV, where Δ G gas is the interaction energy change in the gas and Δ G solv is the solvation energy change upon binding in the solvent, respectively. All units are reported in kcal/mol. The t-test was conducted to check the statistical significance of the difference between two sets of free energies. A p-value of < 0.05 indicates that the difference is statistically significant (95% confidence interval).
SARS-CoV-2SARS-CoVT-Test
AverageStd. Dev.AverageStd. Dev.Differencep-Value
Δ G gas −725.406631.2987−733.311333.81767.90470.0001
Δ G solv 674.972129.5988696.557229.3955−21.5851<0.0001
Δ G total −50.43455.5811−36.75417.4944−13.6804<0.0001
Table 2. The free energies calculated from MD simulations for the spike proteins of SARS-CoV-2 and SARS-CoV, where G gas is the interaction energy in the gas and G solv is the solvation energy in the solvent, respectively. All units are reported in kcal/mol. The t-test was conducted to check the statistical significance of the difference between two sets of free energies. A p-value of <0.05 indicates that the difference is statistically significant (95% confidence interval).
Table 2. The free energies calculated from MD simulations for the spike proteins of SARS-CoV-2 and SARS-CoV, where G gas is the interaction energy in the gas and G solv is the solvation energy in the solvent, respectively. All units are reported in kcal/mol. The t-test was conducted to check the statistical significance of the difference between two sets of free energies. A p-value of <0.05 indicates that the difference is statistically significant (95% confidence interval).
SARS-CoV-2SARS-CoVT-Test
AverageStd. Dev.AverageStd. Dev.Differencep-Value
G gas −36,405.4424294.4540−32,053.4298270.8693−4352.0126<0.0001
G solv −30,897.8370283.0603−31,086.5339236.9148188.6969<0.0001
G total −67,303.2794171.9868−63,139.9637199.4728−4163.3157<0.0001
Table 3. The free energies calculated from MD simulations for the RBD proteins of SARS-CoV-2 and SARS-CoV, where G gas is the interaction energy in the gas and G solv is the solvation energy in the solvent, respectively. All units are reported in kcal/mol. The t-test was conducted to check the statistical significance of the difference between two sets of free energies. A p-value of <0.05 indicates that the difference is statistically significant (95% confidence interval).
Table 3. The free energies calculated from MD simulations for the RBD proteins of SARS-CoV-2 and SARS-CoV, where G gas is the interaction energy in the gas and G solv is the solvation energy in the solvent, respectively. All units are reported in kcal/mol. The t-test was conducted to check the statistical significance of the difference between two sets of free energies. A p-value of <0.05 indicates that the difference is statistically significant (95% confidence interval).
SARS-CoV-2SARS-CoVT-Test
AverageStd. Dev.AverageStd. Dev.Differencep-Value
G gas −2104.368772.3143−1703.658675.9595−400.7101<0.0001
G solv −1985.675951.4409−1914.072656.7862−71.6033<0.0001
G total −4090.044640.6600−3617.731238.9307−472.3134<0.0001

Share and Cite

MDPI and ACS Style

He, J.; Tao, H.; Yan, Y.; Huang, S.-Y.; Xiao, Y. Molecular Mechanism of Evolution and Human Infection with SARS-CoV-2. Viruses 2020, 12, 428. https://doi.org/10.3390/v12040428

AMA Style

He J, Tao H, Yan Y, Huang S-Y, Xiao Y. Molecular Mechanism of Evolution and Human Infection with SARS-CoV-2. Viruses. 2020; 12(4):428. https://doi.org/10.3390/v12040428

Chicago/Turabian Style

He, Jiahua, Huanyu Tao, Yumeng Yan, Sheng-You Huang, and Yi Xiao. 2020. "Molecular Mechanism of Evolution and Human Infection with SARS-CoV-2" Viruses 12, no. 4: 428. https://doi.org/10.3390/v12040428

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop