A Sequence-segment Neighbor Encoding Schema for Protein Hotspot Residue Prediction

Peng Chen; Tong Shen; Youzhi Zhang; Bing Wang

doi:10.2174/1574893615666200106115421

ISSN: 1574-8936
E-ISSN: 2212-392X

A Sequence-segment Neighbor Encoding Schema for Protein Hotspot Residue Prediction
By Peng Chen, Tong Shen, Youzhi Zhang and Bing Wang
Source: Current Bioinformatics, Volume 15, Issue 5, Jun 2020, p. 445 - 454
DOI: https://doi.org/10.2174/1574893615666200106115421
- Available online: 01 Jun 2020

Abstract

Background: Hotspots are those residues that contribute major free energy of binding in protein-protein interactions. Protein functions are frequently dependent on hotspot residues. At present, hotspot residues are always identified by Alanine scanning mutagenesis technology, which is costly, time-consuming and laborious. Objective: Therefore, more accurate and efficient methods have to be developed to identify protein hotspot residues. Methods: This paper proposed a novel encoding schema of sequence-segment neighbors and constructed a random forest-based model to identify hotspots in protein interaction interfaces. Firstly, 10 amino acid physicochemical properties, 16 features related to the PI and DI, and 25 features related to ASA were extracted. Different from the previous residue encoding schemas, such as auto correlation descriptor or triplet combination information, this paper employed the influence of amino acids neighbors to hotspot residues and amino acids with a certain distance in sequence to the hotspot. Results: Moreover, the proposed model was compared with other hotspot prediction methods, including APIS, Robetta, FOLDEF, KFC, MINERVA models, etc. Conclusion: The experimental results showed that the proposed model can improve the prediction ability of protein hotspot residues on the same test set.

Article metrics loading...

/content/journals/cbio/10.2174/1574893615666200106115421

2020-06-01

2026-02-21

From This Site

/content/journals/cbio/10.2174/1574893615666200106115421

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cbio/10.2174/1574893615666200106115421

Article Type: Research Article

Keyword(s): encoding of sequence-segment neighbors; hotspots; Protein interaction; random forest; schema; sliding window

Most Cited Most Cited RSS feed

- A Review of Ensemble Methods in Bioinformatics
  
  Authors: Pengyi Yang, Yee Hwa Yang, Bing B. Zhou and Albert Y. Zomaya
- Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis
  
  Authors: Masahiro Sugimoto, Masato Kawakami, Martin Robert, Tomoyoshi Soga and Masaru Tomita
- Distance-based Support Vector Machine to Predict DNA N6- methyladenine Modification
  
  Authors: Haoyu Zhang, Quan Zou, Ying Ju, Chenggang Song and Dong Chen
- A Review on the Recent Developments of Sequence-based Protein Feature Extraction Methods
  
  Authors: Jun Zhang and Bin Liu
- Molecular Genetic Markers: Discovery, Applications, Data Storage and Visualisation
  
  Authors: Chris Duran, Nikki Appleby, David Edwards and Jacqueline Batley
- A Brief Survey of Machine Learning Methods in Protein Sub-Golgi Localization
  
  Authors: Wuritu Yang, Xiao-Juan Zhu, Jian Huang, Hui Ding and Hao Lin
- Cancer Diagnosis Through IsomiR Expression with Machine Learning Method
  
  Authors: Zhijun Liao, Dapeng Li, Xinrui Wang, Lisheng Li and Quan Zou
- Relevance of Molecular Docking Studies in Drug Designing
  
  Authors: Ritu Jakhar, Mehak Dangi, Alka Khichi and Anil K. Chhillar
- The Advances and Challenges of Deep Learning Application in Biological Big Data Processing
  
  Authors: Li Peng, Manman Peng, Bo Liao, Guohua Huang, Weibiao Li and Dingfeng Xie
- Gene Expression Profile Classification: A Review
  
  Authors: Musa H. Asyali, Dilek Colak, Omer Demirkaya and Mehmet S. Inan
More Less

A Sequence-segment Neighbor Encoding Schema for Protein Hotspot Residue Prediction

Abstract

Most Read This Month

Most Cited Most Cited RSS feed