Gene Ontology-Based Protein Function Prediction by Using Sequence Composition Information

Qiwen Dong; Shuigeng Zhou; Lei Deng; Jihong Guan

doi:10.2174/092986610791190336

ISSN: 0929-8665
E-ISSN: 1875-5305

Gene Ontology-Based Protein Function Prediction by Using Sequence Composition Information
By Qiwen Dong, Shuigeng Zhou, Lei Deng and Jihong Guan
Source: Protein and Peptide Letters, Volume 17, Issue 6, Jun 2010, p. 789 - 795
DOI: https://doi.org/10.2174/092986610791190336
- Available online: 01 Jun 2010

Abstract

The prediction of protein function is a difficult and important problem in computational biology. In this study, an efficient method is presented to predict protein function with sequence composition information. Four kinds of basic building blocks of protein sequences are investigated, including N-grams, binary profiles, PFAM domains and InterPro domains. The protein sequences are mapped into high-dimensional vectors by using the occurrence frequencies of each kind of building blocks. The resulting vectors are then taken as input to support vector machine to predict their function based on gene ontology. Experiments are conducted over the subset of GOA database. The experimental results show that the protein function can be predicted from primary sequence information. The method based on InterPro domains outperforms the other building blocks, and gets an overall accuracy of 0.87 and ROC score is 0.93. We also demonstrate that the use of feature extraction algorithms such as latent semantic analysis and nonnegative matrix factorization, can efficiently remove noise and improve the prediction efficiency without significantly degrading the performance. The results obtained here are helpful for the prediction of protein function by using only sequence information.

Article metrics loading...

/content/journals/ppl/10.2174/092986610791190336

2010-06-01

2026-02-09

From This Site

/content/journals/ppl/10.2174/092986610791190336

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/ppl/10.2174/092986610791190336

Article Type: Research Article

Keyword(s): basic building block; Protein function prediction; support vector machine

Gene Ontology-Based Protein Function Prediction by Using Sequence Composition Information

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

Association between Higher Expression of Vav1 in Hepatocellular Carcinoma and Unfavourable Clinicopathological Features and Prognosis

The Role of TGFBR3 in the Development of Lung Cancer

Wogonin Restrains the Malignant Progression of Lung Cancer Through Modulating MMP1 and PI3K/AKT Signaling Pathway

miR-1204 Positioning in 8q24.21 Involved in the Tumorigenesis of Colorectal Cancer by Targeting MASPIN

The LL-37 Antimicrobial Peptide as a Treatment for Systematic Infection of Acinetobacter baumannii in a Mouse Model

ZNF165: A Pan-Cancer Biomarker with Prognostic and Therapeutic Potential

Anti-Cancer Bioactive Peptide Induces Apoptosis in Gastric Cancer Cells through TP53 Signaling Cascade

Circ_0002762 Regulates Oncoprotein YBX1 in Cervical Cancer via mir-375 to Regulate the Malignancy of Cancer Cells

Bioactive Peptides from Marine Organisms

MicroRNA-605-3p Inhibited the Growth and Chemoresistance of Osteosarcoma Cells via Negatively Modulating RAF1