Skip to content
2000
Volume 17, Issue 1
  • ISSN: 0929-8665
  • E-ISSN: 1875-5305

Abstract

With the rapid increase of protein sequences in the post-genomic age, the need for an automated and accurate tool to predict protein subcellular localization becomes increasingly important. Many efforts have been tried. Most of them aim to find the optimal classification scheme and less of them take the simplifying the complexity of biological system into consideration. This work shows how to decrease the complexity of biological system with linear DR (Dimensionality Reduction) method by transforming the original high-dimensional feature vectors into the low-dimensional feature vectors. A powerful sequence encoding scheme by fusing PSSM (Position-Specific Score Matrix) and Chou’s PseAA (Pseudo Amino Acid) composition is proposed to represent the protein samples. Then, the K-NN (K-Nearest Neighbor) classifier is employed to identify the subcellular localization based on their reduced low-dimensional feature vectors. Experimental results thus obtained are quite encouraging, indicating that the aforementioned linear DR method is quite promising in dealing with complicated biological problems, such as predicting the subcellular localization of Gramnegative bacterial proteins.

Loading

Article metrics loading...

/content/journals/ppl/10.2174/092986610789909494
2010-01-01
2025-06-23
Loading full text...

Full text loading...

/content/journals/ppl/10.2174/092986610789909494
Loading

  • Article Type:
    Research Article
Keyword(s): LDA; Linear dimensionality reduction; PCA; PseAAC; PSSM; Subcellular localization
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test