Highly Accurate Gene Essentiality Prediction with W-Nucleotide Z Curve Features and Feature Selection Technique in Saccharomyces cerevisiae

Wen-Xin Zheng; Shu-Xuan Wang; Hong Liu

doi:10.2174/1574893616666210506150436

ISSN: 1574-8936
E-ISSN: 2212-392X

Highly Accurate Gene Essentiality Prediction with W-Nucleotide Z Curve Features and Feature Selection Technique in Saccharomyces cerevisiae
By Wen-Xin Zheng, Shu-Xuan Wang and Hong Liu
Source: Current Bioinformatics, Volume 16, Issue 8, Oct 2021, p. 1081 - 1088
DOI: https://doi.org/10.2174/1574893616666210506150436
- Available online: 01 Oct 2021

Abstract

Background: Many studies have been conducted on essentiality prediction in the Saccharomyces cerevisiae genome, but the accuracy is not as high as those in bacterial or human genomes. The most frequently used features are Protein-Protein Interaction (PPI) networks combined with some other features, such as evolutionary conservation, expression level, and protein domain information. Sequence composition features are the least used features. Objective: To improve the accuracy of essentiality prediction in the Saccharomyces cerevisiae genome, we proposed a highly accurate gene essentiality prediction algorithm. Methods: In this paper, we propose an algorithm based on a linear Support Vector Machine (SVM) using sequence features only. The variables in this paper are derived from sequence data based on the w-nucleotide Z curve format without any other information. Results: After feature selection, the best area under the receiver operating characteristic curve (AUC) was 0.944 for 5-fold cross-validation. From 1- to 6-nucleotide Z curve variables, feature extraction can increase the AUC in all cases. Conclusion: The prediction on sequence composition is only promising, particularly when a feature filtering method is used, and maybe a good complement for algorithms based on other features.

Article metrics loading...

/content/journals/cbio/10.2174/1574893616666210506150436

2021-10-01

2026-02-14

From This Site

/content/journals/cbio/10.2174/1574893616666210506150436

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cbio/10.2174/1574893616666210506150436

Article Type: Research Article

Keyword(s): cell; Essential gene prediction; genome; Saccharomyces crevisiae; sequence composition; W-nucleotide Z curve

Most Cited Most Cited RSS feed

- A Review of Ensemble Methods in Bioinformatics
  
  Authors: Pengyi Yang, Yee Hwa Yang, Bing B. Zhou and Albert Y. Zomaya
- Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis
  
  Authors: Masahiro Sugimoto, Masato Kawakami, Martin Robert, Tomoyoshi Soga and Masaru Tomita
- Distance-based Support Vector Machine to Predict DNA N6- methyladenine Modification
  
  Authors: Haoyu Zhang, Quan Zou, Ying Ju, Chenggang Song and Dong Chen
- A Review on the Recent Developments of Sequence-based Protein Feature Extraction Methods
  
  Authors: Jun Zhang and Bin Liu
- Molecular Genetic Markers: Discovery, Applications, Data Storage and Visualisation
  
  Authors: Chris Duran, Nikki Appleby, David Edwards and Jacqueline Batley
- A Brief Survey of Machine Learning Methods in Protein Sub-Golgi Localization
  
  Authors: Wuritu Yang, Xiao-Juan Zhu, Jian Huang, Hui Ding and Hao Lin
- Cancer Diagnosis Through IsomiR Expression with Machine Learning Method
  
  Authors: Zhijun Liao, Dapeng Li, Xinrui Wang, Lisheng Li and Quan Zou
- Relevance of Molecular Docking Studies in Drug Designing
  
  Authors: Ritu Jakhar, Mehak Dangi, Alka Khichi and Anil K. Chhillar
- The Advances and Challenges of Deep Learning Application in Biological Big Data Processing
  
  Authors: Li Peng, Manman Peng, Bo Liao, Guohua Huang, Weibiao Li and Dingfeng Xie
- Gene Expression Profile Classification: A Review
  
  Authors: Musa H. Asyali, Dilek Colak, Omer Demirkaya and Mehmet S. Inan
More Less

Highly Accurate Gene Essentiality Prediction with W-Nucleotide Z Curve Features and Feature Selection Technique in Saccharomyces cerevisiae

Abstract

Most Read This Month

Most Cited Most Cited RSS feed