Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm

Bi-Qing Li; Yu-Hang Zhang; Mei-Ling Jin; Tao Huang; Yu-Dong Cai

doi:10.2174/1574893611666160711162006

ISSN: 1574-8936
E-ISSN: 2212-392X

Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm
By Bi-Qing Li, Yu-Hang Zhang, Mei-Ling Jin, Tao Huang and Yu-Dong Cai
Source: Current Bioinformatics, Volume 13, Issue 1, Feb 2018, p. 14 - 24
DOI: https://doi.org/10.2174/1574893611666160711162006
- Available online: 01 Feb 2018

Abstract

Background: As a crucial component of the entire protein-protein interaction (PPI) network, protein-peptide interactions are ubiquitous in living cells. These interactions play important roles in signaling transduction and regulation. Compared with laborious and time-consuming experimental approaches, predicting protein-peptide interactions with effective computational methods could be convenient and rapid. Method: This study proposed a novel method for the prediction of interactions between proteins and peptides using various features extracted from both proteins and peptides. The traditional amino acid composition as well as pseudo-amino acid composition and features derived from 205 domains were utilized to represent a protein-peptide interaction. The predictor was constructed based on four different machine learning algorithms including SMO (sequential minimal optimization), IB1 (nearest neighbor algorithm), dagging, and random forest (RF). All features were analyzed by some feature selection technologies, such as the maximum relevance minimum redundancy method and the incremental feature selection method, to extract optimal features. Additionally, an optimal predictor based on IB1 was constructed according to the extracted optimal features. Results: MCC values of 0.4436 for the cross-validation test of the training set and 0.4444 for the independent test set were obtained with the IB1 algorithm. Different encoding methods were compared. The domain-based method outperformed the pseudo-amino acid composition method. An optimal feature set of 230 features was selected, which contributed most to the prediction of the protein-peptide pairs. Conclusion: Several important domains related to some features in the optimal feature set were deemed to play key roles in determining the protein-peptide interactions.

Article metrics loading...

/content/journals/cbio/10.2174/1574893611666160711162006

2018-02-01

2025-10-20

From This Site

/content/journals/cbio/10.2174/1574893611666160711162006

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cbio/10.2174/1574893611666160711162006

Article Type: Research Article

Keyword(s): functional domain composition; incremental feature selection; maximum relevance minimum redundancy; Protein-peptide interactions; pseudo-amino acid composition

Most Cited Most Cited RSS feed

- A Review of Ensemble Methods in Bioinformatics
  
  Authors: Pengyi Yang, Yee Hwa Yang, Bing B. Zhou and Albert Y. Zomaya
- Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis
  
  Authors: Masahiro Sugimoto, Masato Kawakami, Martin Robert, Tomoyoshi Soga and Masaru Tomita
- Distance-based Support Vector Machine to Predict DNA N6- methyladenine Modification
  
  Authors: Haoyu Zhang, Quan Zou, Ying Ju, Chenggang Song and Dong Chen
- A Review on the Recent Developments of Sequence-based Protein Feature Extraction Methods
  
  Authors: Jun Zhang and Bin Liu
- Molecular Genetic Markers: Discovery, Applications, Data Storage and Visualisation
  
  Authors: Chris Duran, Nikki Appleby, David Edwards and Jacqueline Batley
- A Brief Survey of Machine Learning Methods in Protein Sub-Golgi Localization
  
  Authors: Wuritu Yang, Xiao-Juan Zhu, Jian Huang, Hui Ding and Hao Lin
- Cancer Diagnosis Through IsomiR Expression with Machine Learning Method
  
  Authors: Zhijun Liao, Dapeng Li, Xinrui Wang, Lisheng Li and Quan Zou
- The Advances and Challenges of Deep Learning Application in Biological Big Data Processing
  
  Authors: Li Peng, Manman Peng, Bo Liao, Guohua Huang, Weibiao Li and Dingfeng Xie
- Relevance of Molecular Docking Studies in Drug Designing
  
  Authors: Ritu Jakhar, Mehak Dangi, Alka Khichi and Anil K. Chhillar
- Gene Expression Profile Classification: A Review
  
  Authors: Musa H. Asyali, Dilek Colak, Omer Demirkaya and Mehmet S. Inan
More Less

Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm

Abstract

Most Read This Month

Most Cited Most Cited RSS feed