loading page

Docking with Rosetta and deep learning approaches in CAPRI rounds 47-55
  • +6
  • Ameya Harmalkar,
  • Lee-Shin Chu,
  • Samuel W. Canner,
  • Rituparna Samanta,
  • Rahel Frick,
  • Fatima A. Davila-Hernandez,
  • Sudeep Sarma,
  • Fatima Hitawala,
  • Jeffrey Gray
Ameya Harmalkar
Johns Hopkins University Department of Chemical and Biomolecular Engineering
Author Profile
Lee-Shin Chu
Johns Hopkins University Department of Chemical and Biomolecular Engineering
Author Profile
Samuel W. Canner
Johns Hopkins University Department of Biophysics and Biophysical Chemistry
Author Profile
Rituparna Samanta
Johns Hopkins University Department of Chemical and Biomolecular Engineering
Author Profile
Rahel Frick
Johns Hopkins University Department of Chemical and Biomolecular Engineering
Author Profile
Fatima A. Davila-Hernandez
Johns Hopkins University Department of Chemical and Biomolecular Engineering
Author Profile
Sudeep Sarma
Johns Hopkins University Department of Chemical and Biomolecular Engineering
Author Profile
Fatima Hitawala
Johns Hopkins University Department of Chemical and Biomolecular Engineering
Author Profile
Jeffrey Gray
Johns Hopkins University Department of Chemical and Biomolecular Engineering

Corresponding Author:jgray@jhu.edu

Author Profile

Abstract

Critical Assessment of PRediction of Interactions (CAPRI) rounds 47 through 55 introduced 49 targets comprising multistage assemblies, antibody-antigen complexes, and flexible interfaces. For these rounds, we combined various Rosetta docking approaches (RosettaDock, ReplicaDock, and SymDock) with deep learning approaches (AlphaFold2, IgFold, and AlphaRED). Since prior CAPRI rounds, we have developed methods to better capture conformational changes, updated our scoring function, and integrated structure prediction tools such as AlphaFold2 in our docking routines. Here, we highlight several notable CAPRI targets and address the major challenges in the blind prediction of protein-protein interactions, including binding-induced conformational changes, large multimeric proteins, and antibody-antigen interactions. Although predictors have achieved modest improvements in accuracy of simpler targets post-AlphaFold2, performance for more flexible complexes remains limited. We employed RosettaDock 4.0, ReplicaDock 2.0, and AlphaRED to enhance backbone conformational sampling for flexible complexes. Our docking routines improved the DockQ score (0.77 vs. 0.62 for AF2-multimer) for a GP2 bacteriophage protein (T194), effectively capturing binding-induced conformational changes. Additionally, we introduce a fold-and-dock approach for predicting the assembly of a surface-layer SAP protein derived from Bacillus anthracis (T160), a large hetero-multimer comprising six distinct sub-units. For large symmetric complexes, we used Rosetta-based SymDock 2.0, successfully predicting a human DNA repair protein complex with A10 stoichiometry (T230) with high CAPRI-quality ranking. We also address the challenges in modeling antibody/nanobody-antigen interactions, particularly through the integration of deep learning tools and docking methods. Despite advances with tools like IgFold and AlphaFold2, accurately predicting CDR H3 loops and antibody-antigen binding interfaces remains challenging. Combining ReplicaDock 2.0 with deep learning highlights these difficulties and underscores the need for extensive sampling and CDR-focused strategies to improve prediction accuracy.
16 Sep 2024Submitted to PROTEINS: Structure, Function, and Bioinformatics
16 Sep 2024Submission Checks Completed
16 Sep 2024Assigned to Editor
16 Sep 2024Review(s) Completed, Editorial Evaluation Pending
01 Oct 2024Reviewer(s) Assigned
01 Nov 2024Editorial Decision: Revise Major