Skip to content
2000
Volume 25, Issue 1
  • ISSN: 1386-2073
  • E-ISSN: 1875-5402

Abstract

Background and Objective: DNA-binding proteins play important roles in a variety of biological processes, such as gene transcription and regulation, DNA replication and repair, DNA recombination and packaging, and the formation of chromatin and ribosomes. Therefore, it is urgent to develop a computational method to improve the recognition efficiency of DNA-binding proteins. Methods: We proposed a novel method, DBP-PSSM, which constructed the features from amino acid composition and evolutionary information of protein sequences. The maximum relevance, minimum redundancy (mRMR) was employed to select the optimal features for establishing the XGBoost classifier, therefore, the novel model of prediction DNA-binding proteins, DBP-PSSM, was established with 5-fold cross-validation on the training dataset. Results: DBP-PSSM achieved an accuracy of 81.18% and MCC of 0.657 in a test dataset, which outperformed the many existing methods. These results demonstrated that our method can effectively predict DNA-binding proteins. Conclusion: The data and source code are provided at https://github.com/784221489/DNA-binding.

Loading

Article metrics loading...

/content/journals/cchts/10.2174/1386207323999201124203531
2022-01-01
2025-06-18
Loading full text...

Full text loading...

/content/journals/cchts/10.2174/1386207323999201124203531
Loading

  • Article Type:
    Research Article
Keyword(s): DNA-binding proteins; Local_DPP; mRMR; PSSM400; sliding window and smoothing window; XGBoost
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test