Clustering biomolecular complexes by residue contacts similarity


Rodrigues J. P. G. L. M., Trellet M., Schmitz C., Kastritis P., KARACA EREK E., Melquiond A. S. J., ...Daha Fazla

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, cilt.80, sa.7, ss.1810-1817, 2012 (SCI-Expanded) identifier identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 80 Sayı: 7
  • Basım Tarihi: 2012
  • Doi Numarası: 10.1002/prot.24078
  • Dergi Adı: PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.1810-1817
  • Anahtar Kelimeler: biomolecular clustering, protein docking, fraction of common contacts, PROTEIN-STRUCTURE, RESOLUTION, CONFORMATIONS, RECOGNITION, PREDICTION, ANGSTROM, MODELS
  • Dokuz Eylül Üniversitesi Adresli: Hayır

Özet

Inaccuracies in computational molecular modeling methods are often counterweighed by brute-force generation of a plethora of putative solutions. These are then typically sieved via structural clustering based on similarity measures such as the root mean square deviation (RMSD) of atomic positions. Albeit widely used, these measures suffer from several theoretical and technical limitations (e.g., choice of regions for fitting) that impair their application in multicomponent systems (N > 2), large-scale studies (e.g., interactomes), and other time-critical scenarios. We present here a simple similarity measure for structural clustering based on atomic contactsthe fraction of common contactsand compare it with the most used similarity measure of the protein docking communityinterface backbone RMSD. We show that this method produces very compact clusters in remarkably short time when applied to a collection of binary and multicomponent proteinprotein and proteinDNA complexes. Furthermore, it allows easy clustering of similar conformations of multicomponent symmetrical assemblies in which chain permutations can occur. Simple contact-based metrics should be applicable to other structural biology clustering problems, in particular for time-critical or large-scale endeavors.Proteins 2012; (c) 2012 Wiley Periodicals, Inc.