Strengths and weaknesses of data-driven docking in critical assessment of prediction of interactions

de Vries S. J., Melquiond A. S. J., Kastritis P. L., KARACA EREK E., Bordogna A., van Dijk M., ...More

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, vol.78, no.15, pp.3242-3249, 2010 (SCI-Expanded) identifier identifier identifier

  • Publication Type: Article / Article
  • Volume: 78 Issue: 15
  • Publication Date: 2010
  • Doi Number: 10.1002/prot.22814
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Page Numbers: pp.3242-3249
  • Keywords: Haddock, multi-body docking, web server, scoring, experimental restraints, interface prediction, RNA-BINDING SITES, PROTEIN-PROTEIN COMPLEXES, CRYSTAL-STRUCTURE, STRUCTURAL BASIS, DUAL RECOGNITION, INHIBITOR, SERVER, SPECIFICITY, HADDOCK, DOMAIN
  • Dokuz Eylül University Affiliated: No


The recent CAPRI rounds have introduced new docking challenges in the form of protein-RNA complexes, multiple alternative interfaces, and an unprecedented number of targets for which homology modeling was required. We present here the performance of HADDOCK and its web server in the CAPRI experiment and discuss the strengths and weaknesses of data-driven docking. HADDOCK was successful for 6 out of 9 complexes (6 out of 11 targets) and accurately predicted the individual interfaces for two more complexes. The HADDOCK server, which is the first allowing the simultaneous docking of generic multi-body complexes, was successful in 4 out of 7 complexes for which it participated. In the scoring experiment, we predicted the highest number of targets of any group. The main weakness of data-driven docking revealed from these last CAPRI results is its vulnerability for incorrect experimental data related to the interface or the stoichiometry of the complex. At the same time, the use of experimental and/or predicted information is also the strength of our approach as evidenced for those targets for which accurate experimental information was available (e.g., the 10 three-stars predictions for T40!). Even when the models show a wrong orientation, the individual interfaces are generally well predicted with an average coverage of 60% +/- 26% over all targets. This makes data-driven docking particularly valuable in a biological context to guide experimental studies like, for example, targeted mutagenesis. Proteins 2010; 78:3242-3249. (C) 2010 Wiley-Liss, Inc.