Skip to content
2000
Volume 8, Issue 2
  • ISSN: 1574-8936
  • E-ISSN: 2212-392X

Abstract

In systems biology, it is a great challenge for researchers to identify whether the given set of organic compounds can combine together and form a meaningful pathway. Fortunately, it becomes more and more feasible to address and solve such a problem with the rapidly accumulated information on various organisms. Based on the attainable information, a novel computational approach is proposed to investigate this problem by adopting the metabolic pathway of yeast as the subject of the study. And we produced a benchmark dataset with 13,736 pathways consisting of both valid and invalid pathways and identified the valid pathways among them. Each of these pathways was encoded into a numeric vector, consisting of three parts: graph property, chemical functional group, and chemical structural set. Methods of Minimum Redundancy Maximum Relevance and Incremental Feature Selection were utilized to select an optimal feature set, and Nearest Neighbor Algorithm was adopted as the classification model, while Jackknife Test was used to evaluate the model. As a result, an optimal feature set consisting of 16 features, which were able to identify the valid pathways most successfully, was obtained.

Loading

Article metrics loading...

/content/journals/cbio/10.2174/1574893611308020008
2013-04-01
2025-05-02
Loading full text...

Full text loading...

/content/journals/cbio/10.2174/1574893611308020008
Loading
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test