Interaction prediction of PDZ domains using a machine learning approach


KALYONCU UZUNLAR S., Keskin O., Gursoy A.

2010 5th International Symposium on Health Informatics and Bioinformatics, HIBIT 2010, Antalya, Türkiye, 20 - 22 Nisan 2010, ss.121-124, (Tam Metin Bildiri) identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/hibit.2010.5478896
  • Basıldığı Şehir: Antalya
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.121-124
  • Anahtar Kelimeler: Pdz domains, Protein-protein interactions, Random forest
  • Dokuz Eylül Üniversitesi Adresli: Evet

Özet

Protein interaction domains play crucial roles in many complex cellular pathways. PDZ domains are one of the most common protein interaction domains. Prediction of binding specificity of PDZ domains by a computational manner could eliminate unnecessary, time-consuming experiments. In this study, interactions of PDZ domains are predicted by using a machine learning approach in which only primary sequences of PDZ domains and peptides are used. In order to encode feature vectors for each interaction, trigram frequencies of primary sequences of PDZ domains and corresponding peptides are calculated. After construction of numerical interaction dataset, we compared different classifiers and ended up with Random Forest (RF) algorithm which gave the top performance. We obtained very high prediction accuracy (91.4%) for binary interaction prediction which outperforms all previous similar methods. © 2009 IEEE.