iPSI(2L)-EDL: a Two-layer Predictor for Identifying Promoters and their Types based on Ensemble Deep Learning

Xuan Xiao; Zaihao Hu; ZhenTao Luo; Zhaochun Xu

doi:10.2174/0115748936264316230926073231

ISSN: 1574-8936
E-ISSN: 2212-392X

iPSI(2L)-EDL: a Two-layer Predictor for Identifying Promoters and their Types based on Ensemble Deep Learning
By Xuan Xiao, Zaihao Hu, ZhenTao Luo and Zhaochun Xu
Source: Current Bioinformatics, Volume 19, Issue 4, May 2024, p. 327 - 340
DOI: https://doi.org/10.2174/0115748936264316230926073231
- Available online: 01 May 2024

Abstract

Background: DNA fragments located near the transcription initiation site are categorized
into two types, strong promoters, and weak promoters, based on their transcriptional activation and
expression levels.

Identifying promoters and determining their strength is crucial for understanding gene expression
regulation. There is a need to improve the predictive quality of promoter prediction models for realworld
applications.

Methods: The most recent training dataset was constructed from the RegalonDB website, where all
promoters had been experimentally validated, and their sequence similarity was below 85%. DNA
sequence samples were represented using one-hot encoding, along with nucleotide chemical properties
and density (NCPD). An integrated deep learning framework was developed, incorporating a
multi-head attention module, a long short-term memory (LSTM) module, and a convolutional neural
network (CNN) module.

Results: The AUC and MCC for iPSI(2L)-EDL in identifying promoters improved by 2.23% and
2.96%, respectively, compared to the PseDNC-DL method on independent testing data. The AUC
and MCC for iPSI(2L)-EDL increased by 3.74% and 5.86%, respectively, in predicting promoter
strength type.

Conclusion: The importance of different input positions and long-range dependency relationships
among features contributed to better promoter recognition. The CNN played a crucial role in recognizing
promoters. Furthermore, to facilitate access for most experimental scientists, a user-friendly
web server has been established, which can be accessed at http://47.94.248.117/IPSW(2L)-EDL.

Article metrics loading...

/content/journals/cbio/10.2174/0115748936264316230926073231

2024-05-01

2025-06-12

From This Site

/content/journals/cbio/10.2174/0115748936264316230926073231

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cbio/10.2174/0115748936264316230926073231

Article Type: Research Article

Keyword(s): ensemble deep learning; multi-feature fusion; Promoter recognition; strong promoter; weak promoter

Most Cited Most Cited RSS feed

- A Review of Ensemble Methods in Bioinformatics
  
  Authors: Pengyi Yang, Yee Hwa Yang, Bing B. Zhou and Albert Y. Zomaya
- s Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis
  
  Authors: Masahiro Sugimoto, Masato Kawakami, Martin Robert, Tomoyoshi Soga and Masaru Tomita
- Distance-based Support Vector Machine to Predict DNA N6- methyladenine Modification
  
  Authors: Haoyu Zhang, Quan Zou, Ying Ju, Chenggang Song and Dong Chen
- A Review on the Recent Developments of Sequence-based Protein Feature Extraction Methods
  
  Authors: Jun Zhang and Bin Liu
- Molecular Genetic Markers: Discovery, Applications, Data Storage and Visualisation
  
  Authors: Chris Duran, Nikki Appleby, David Edwards and Jacqueline Batley
- A Brief Survey of Machine Learning Methods in Protein Sub-Golgi Localization
  
  Authors: Wuritu Yang, Xiao-Juan Zhu, Jian Huang, Hui Ding and Hao Lin
- Cancer Diagnosis Through IsomiR Expression with Machine Learning Method
  
  Authors: Zhijun Liao, Dapeng Li, Xinrui Wang, Lisheng Li and Quan Zou
- The Advances and Challenges of Deep Learning Application in Biological Big Data Processing
  
  Authors: Li Peng, Manman Peng, Bo Liao, Guohua Huang, Weibiao Li and Dingfeng Xie
- Gene Expression Profile Classification: A Review
  
  Authors: Musa H. Asyali, Dilek Colak, Omer Demirkaya and Mehmet S. Inan
- Relevance of Molecular Docking Studies in Drug Designing
  
  Authors: Ritu Jakhar, Mehak Dangi, Alka Khichi and Anil K. Chhillar
More Less

iPSI(2L)-EDL: a Two-layer Predictor for Identifying Promoters and their Types based on Ensemble Deep Learning

Abstract

Most Read This Month

Most Cited Most Cited RSS feed

s Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis