Benchmarking the accuracy of structure-based binding affinity predictors on Spike–ACE2 deep mutational interaction set

Ozden, Burcu; Şamiloğlu, Eda; Özsan, Atakan; Erguven, Mehmet; Yükrük, Can; Koşaca, Mehdi; Oktayoğlu, Melis; Menteş, Muratcan; Arslan, Nazmiye; KARAKÜLAH, GÖKHAN; Barlas, Ayşe; Savaş, Büşra; Karaca, EZGİ

doi:10.1002/prot.26645

Benchmarking the accuracy of structure-based binding affinity predictors on Spike–ACE2 deep mutational interaction set

Ozden B., Şamiloğlu E., Özsan A., Erguven M., Yükrük C., Koşaca M., ...Daha Fazla

Proteins: Structure, Function and Bioinformatics, cilt.92, sa.4, ss.529-539, 2024 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 92 Sayı: 4
Basım Tarihi: 2024
Doi Numarası: 10.1002/prot.26645
Dergi Adı: Proteins: Structure, Function and Bioinformatics
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, BIOSIS, Biotechnology Research Abstracts, CAB Abstracts, Chemical Abstracts Core, Food Science & Technology Abstracts, INSPEC, MEDLINE, Veterinary Science Database
Sayfa Sayıları: ss.529-539
Anahtar Kelimeler: ACE2, binding affinity prediction, deep mutagenesis, RBD, SARS-CoV-2
Dokuz Eylül Üniversitesi Adresli: Evet

Özet

Since the start of COVID-19 pandemic, a huge effort has been devoted to understanding the Spike (SARS-CoV-2)–ACE2 recognition mechanism. To this end, two deep mutational scanning studies traced the impact of all possible mutations across receptor binding domain (RBD) of Spike and catalytic domain of human ACE2. By concentrating on the interface mutations of these experimental data, we benchmarked six commonly used structure-based binding affinity predictors (FoldX, EvoEF1, MutaBind2, SSIPe, HADDOCK, and UEP). These predictors were selected based on their user-friendliness, accessibility, and speed. As a result of our benchmarking efforts, we observed that none of the methods could generate a meaningful correlation with the experimental binding data. The best correlation is achieved by FoldX (R = −0.51). When we simplified the prediction problem to a binary classification, that is, whether a mutation is enriching or depleting the binding, we showed that the highest accuracy is achieved by FoldX with a 64% success rate. Surprisingly, on this set, simple energetic scoring functions performed significantly better than the ones using extra evolutionary-based terms, as in Mutabind and SSIPe. Furthermore, we demonstrated that recent AI approaches, mmCSM-PPI and TopNetTree, yielded comparable performances to the force field-based techniques. These observations suggest plenty of room to improve the binding affinity predictors in guessing the variant-induced binding profile changes of a host–pathogen system, such as Spike–ACE2. To aid such improvements we provide our benchmarking data at https://github.com/CSB-KaracaLab/RBD-ACE2-MutBench with the option to visualize our mutant models at https://rbd-ace2-mutbench.github.io/.