Skip to content
2000
Volume 13, Issue 10
  • ISSN: 0929-8665
  • E-ISSN: 1875-5305

Abstract

The identification of the thermostability from the amino acid sequence information would be helpful in computational screening for thermostable proteins. We have developed a method to discriminate thermophilic and mesophilic proteins based on support vector machines. Using self-consistency validation, 5-fold cross-validation and independent testing procedure with other datasets, this module achieved overall accuracy of 94.2%, 90.5% and 92.4%, respectively. The performance of this SVM-based module was better than the classifiers built using alternative machine learning and statistical algorithms including artificial neural networks, Bayesian statistics, and decision trees, when evaluated using these three validation methods. The influence of protein size on prediction accuracy was also addressed.

Loading

Article metrics loading...

/content/journals/ppl/10.2174/092986606778777560
2006-10-01
2025-06-12
Loading full text...

Full text loading...

/content/journals/ppl/10.2174/092986606778777560
Loading
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test