EpiCurator: an immunoinformatic workflow to predict and prioritize SARS-CoV-2 epitopes

View article
Bioinformatics and Genomics

Main article text

 

Introduction

Materials & Methods

Genome retrieval and protein annotation

Spike protein comparison

Epitope prediction

Prediction of SARS-CoV-2 epitopes

Accurate selection of epitopes - EpiCurator

Prediction of epitope conservancy

Human homology

Epitope sequence overlap

Search for epitopes from published articles - EpiMiner

IEDB matching

Mutation screening

Epitope properties

Evaluation of antigenicity, toxicity, and immunogenic profile

Estimation of population coverage

Docking analysis of the HLA-epitope complex

Genomes for EpiCurator pairwise comparison validation

Multi-epitope construct and structural modelling for EpiCurator validation

Linear and secondary structure evaluation

Multi-epitope 3D structure modelling, refinement, and evaluation

Results

Linear epitopes prediction from SARS-CoV-2 genome

Quality assessment analysis of the EpiCurator

Properties of accurately selected epitopes

Epitope-specific RBD Spike as a baseline for validation of EpiCurator selection

Multi-epitope construct for in silico validation of RBD epitopes

Discussion

Conclusions

Supplemental Information

Schematic Presentation of the final multi-epitope construct

The multi-epitope structure is constructed by 11 subunits (blue subunits represents HLA class I epitopes, purple subunits represents HLA class II epitopes and yellow subunits represents B-cell epitopes), an adjuvant represented in orange, linked by EAAAK, AAY and GPGPG linkers (right legend). The linear sequence of the multi-epitopes is represented with the same colors previously listed, added with a histidine hexamer.

DOI: 10.7717/peerj.12548/supp-1

Efficiency of EpiCurator analysis for accurate selection of epitopes

The plot represents the percentage of epitopes removed and each different analysis identified in the caption for the HLA class I epitopes (A), HLA class II epitopes (B) and B cell epitopes (C).

DOI: 10.7717/peerj.12548/supp-2

Structure of the HLA-epitope complex for the main T-cell epitopes in Spike RBD domain

Structure complexes provided by docking simulation show the MHC binding grooves (blue ribbons), and the epitope (red structure) for HLA class I alleles (A) and HLA class II alleles (B). For each complex, the amino acids sequence of the epitope and HLA binding allele are available.

DOI: 10.7717/peerj.12548/supp-3

Graphical Representation of the Secondary Structure of multi-epitope construct

The alpha helix residues are in pink square, the beta strand residues are in yellow square, the coil residues are in grey square. The disordered residues are in Blue border square and purple border square and the epitopes are in black border square.

DOI: 10.7717/peerj.12548/supp-4

Cell activation under multi-epitope in silico immune Simulation

(A) Concentration of B cell population, including memory B cell after multi-epitope exposures. (B) Concentration of T-cell population, including memory cell after multi-epitope exposures. (C) Concentration of B cell population per state - Active, Internalized the Ag, Presentation on MHC II, Duplicating in the mitotic cycle and Anergic (D) Concentration of T cell population per state - Active, Duplicating in the mitotic cycle, Anergic and Resting - not active. For all panels the specific subclasses are indicated as colored peaks.

DOI: 10.7717/peerj.12548/supp-5

Structural properties predicted by RaptorX

(A) shows the solvent accessibility of each residue along the structure, and the proportion of buried, medium exposed and high exposed residues. (B) represents the disorder degree of each residue.

DOI: 10.7717/peerj.12548/supp-6

Final 3D structure of the multi-epitope

The structure refined using the GalaxyRefine server had all the RBD epitopes highlighted following their type: yellow color refers to HLA class I T-cell epitopes, pink color corresponds to the HLA class II T-cell epitopes, and the B-cell epitopes are represented by the blue color.

DOI: 10.7717/peerj.12548/supp-7

Structure validation provided by the ProSA web server

(A) shows the z-score comparing our multi-epitope structure to determined structures from PDB of the same size. (B) corresponds to the level of energy of each residue.

DOI: 10.7717/peerj.12548/supp-8

Molecular dynamic results for the multi-epitope-TLR3 complex

(A) refers to the eigenvalue and the needed energy for structure deformation; (B) shows the covariance matrix representing the unrelated (white), correlated (red) and anti-correlated (blue) residues; (C) demonstrates the results of the elastic network model and the darker gray regions point to more rigid springs; plot (D) shows the main-chain deformability simulation; and (E) represents the uncertainty quantification for each residue through the b-factor values.

DOI: 10.7717/peerj.12548/supp-9

Molecular dynamic results for the multi-epitope-TLR4 complex

(A) refers to the eigenvalue and the needed energy for structure deformation; (B) shows the covariance matrix representing the unrelated (white), correlated (red) and anti-correlated (blue) residues; (C) demonstrates the results of the elastic network model and the darker gray regions point to more rigid springs; plot (D) shows the main-chain deformability simulation; and (E) represents the uncertainty quantification for each residue through the b-factor values.

DOI: 10.7717/peerj.12548/supp-10

SARS-CoV-2 samples from GISAID

DOI: 10.7717/peerj.12548/supp-11

HLA alleles and haplotypes frequent in Brazilian population

DOI: 10.7717/peerj.12548/supp-12

Final selected epitopes: Summary of properties

DOI: 10.7717/peerj.12548/supp-13

Articles selected by EpiMiner analysis

DOI: 10.7717/peerj.12548/supp-14

Estimated data from EpiCurator pairwise comparison validation using sequences reported by four previously published papers

DOI: 10.7717/peerj.12548/supp-15

Estimated population coverages of each accurately selected T cell epitope

DOI: 10.7717/peerj.12548/supp-16

Estimated number of samples and lineage where RBD epitopes were identified

DOI: 10.7717/peerj.12548/supp-17

Homology analysis for the Spike protein among the Brazilian lineages and other non-Brazilian VOCs

DOI: 10.7717/peerj.12548/supp-18

Performance parameters of HLA-epitope docking simulation

DOI: 10.7717/peerj.12548/supp-19

Quality scores for the initial structure and the five candidate models after the refinement process

The quality features are QDT-HA, RMSD, MolProbity, Clash score, Poor Rotamers and the Ramachandran plot score (Rama favored).

DOI: 10.7717/peerj.12548/supp-20

Additional Information and Declarations

Competing Interests

Ana Tereza R. Vasconcelos is an Academic Editor for PeerJ.

Author Contributions

Cristina S. Ferreira conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.

Yasmmin C. Martins conceived and designed the experiments, performed the experiments, authored or reviewed drafts of the paper, and approved the final draft.

Rangel Celso Souza analyzed the data, prepared figures and/or tables, and approved the final draft.

Ana Tereza R. Vasconcelos conceived and designed the experiments, authored or reviewed drafts of the paper, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The data is available at GitHub: https://github.com/YasCoMa/EpiCurator.

Funding

This work was developed in the frameworks of Corona-ômica-RJ (FAPERJ = E-26/210.179/2020) and Rede Corona-ômica BR MCTI/FINEP (FINEP = 01.20.0029.000462/20, CNPq = 404096/2020-4) and BRICS/CNPq - (440931/2020-7). Ana Tereza R. Vasconcelos is supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico - CNPq (303170/2017-4) and FAPERJ (E -26/202.826/2018). Yasmmin C Martins is currently supported by FAPERJ (E-26/202.168/2020). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

8 Citations 2,509 Views 199 Downloads