Skip to content
2000
Volume 19, Issue 10
  • ISSN: 1574-8936
  • E-ISSN: 2212-392X

Abstract

Background

The chemical modification of RNA plays a crucial role in many biological processes. N7-methylguanosine (m7G), being one of the most important epigenetic modifications, plays an important role in gene expression, processing metabolism, and protein synthesis. Detecting the exact location of m7G sites in the transcriptome is key to understanding their relevant mechanism in gene expression. On the basis of experimentally validated data, several machine learning or deep learning tools have been designed to identify internal m7G sites and have shown advantages over traditional experimental methods in terms of speed, cost-effectiveness and robustness.

Aims

In this study, we aim to develop a computational model to help predict the exact location of m7G sites in humans.

Objective

Simple and advanced encoding methods and deep learning networks are designed to achieve excellent m7G prediction efficiently.

Methods

Three types of feature extractions and six classification algorithms were tested to identify m7G sites. Our final model, named Sia-m7G, adopts one-hot encoding and a delicate Siamese neural network with an attention mechanism. In addition, multiple 10-fold cross-validation tests were conducted to evaluate our predictor.

Results

Sia-m7G achieved the highest sensitivity, specificity and accuracy on 10-fold cross-validation tests compared with the other six m7G predictors. Nucleotide preference and model visualization analyses were conducted to strengthen the interpretability of Sia-m7G and provide a further understanding of m7G site fragments in genomic sequences.

Conclusion

Sia-m7G has significant advantages over other classifiers and predictors, which proves the superiority of the Siamese neural network algorithm in identifying m7G sites.

Loading

Article metrics loading...

/content/journals/cbio/10.2174/0115748936285540240116065719
2024-02-07
2024-11-22
Loading full text...

Full text loading...

References

  1. FryeM. HaradaB.T. BehmM. HeC. RNA modifications modulate gene expression during development.Science201836164091346134910.1126/science.aau1646 30262497
    [Google Scholar]
  2. KomalS. ZhangL.R. HanS.N. Potential regulatory role of epigenetic RNA methylation in cardiovascular diseases.Biomed. Pharmacother.202113711137610.1016/j.biopha.2021.111376 33588266
    [Google Scholar]
  3. FuruichiY. Discovery of m(7)G-cap in eukaryotic mRNAs.Proc. Jpn. Acad., Ser. B, Phys. Biol. Sci.201591839440910.2183/pjab.91.394 26460318
    [Google Scholar]
  4. TomikawaC. 7-Methylguanosine modifications in Transfer RNA (tRNA).Int. J. Mol. Sci.20181912408010.3390/ijms19124080 30562954
    [Google Scholar]
  5. LinS. LiuQ. LelyveldV.S. ChoeJ. SzostakJ.W. GregoryR.I. Mettl1/Wdr4-Mediated m7G tRNA methylome is required for Normal mRNA translation and embryonic stem cell self-renewal and differentiation.Mol. Cell2018712244255.e510.1016/j.molcel.2018.06.001 29983320
    [Google Scholar]
  6. MarchandV. AyadiL. ErnstF.G.M. AlkAniline‐Seq: Profiling of m 7 G and m 3 C RNA modifications at single nucleotide resolution.Angew. Chem. Int. Ed.20185751167851679010.1002/anie.201810946 30370969
    [Google Scholar]
  7. ZhangL.S. LiuC. MaH. Transcriptome-wide mapping of internal N7-methylguanosine methylome in mammalian mRNA.Mol. Cell201974613041316.e810.1016/j.molcel.2019.03.036 31031084
    [Google Scholar]
  8. MalbecL. ZhangT. ChenY.S. Dynamic methylome of internal mRNA N7-methylguanosine and its regulatory role in translation.Cell Res.2019291192794110.1038/s41422‑019‑0230‑z 31520064
    [Google Scholar]
  9. LuoX. ChiW. DengM. Deepprune: Learning efficient and interpretable convolutional networks through weight pruning for predicting DNA-protein binding.Front. Genet.201910114510.3389/fgene.2019.01145 31824562
    [Google Scholar]
  10. ZhangY. QiaoS. JiS. LiY. DeepSite: Bidirectional LSTM and CNN models for predicting DNA-protein binding.Int. J. Mach. Learn. Cybern.202011484185110.1007/s13042‑019‑00990‑x
    [Google Scholar]
  11. ChenW. FengP. SongX. LvH. LinH. iRNA-m7G: Identifying N7-methylguanosine sites by fusing multiple features.Mol. Ther. Nucleic Acids20191826927410.1016/j.omtn.2019.08.022 31581051
    [Google Scholar]
  12. YangY.H. MaC. WangJ.S. Prediction of N7-methylguanosine sites in human RNA based on optimal sequence features.Genomics202011264342434710.1016/j.ygeno.2020.07.035 32721444
    [Google Scholar]
  13. SongB. TangY. ChenK. m7GHub: Deciphering the location, regulation and pathogenesis of internal mRNA N7-methylguanosine (m7G) sites in human.Bioinformatics202036113528353610.1093/bioinformatics/btaa178 32163126
    [Google Scholar]
  14. ZouH. YinZ. m7G-DPP: Identifying N7-methylguanosine sites based on dinucleotide physicochemical properties of RNA.Biophys. Chem.202127910669710.1016/j.bpc.2021.106697 34628276
    [Google Scholar]
  15. LiuX. LiuZ. MaoX. LiQ. m7GPredictor: An improved machine learning-based model for predicting internal m7G modifications using sequence properties.Anal. Biochem.202060911390510.1016/j.ab.2020.113905 32805275
    [Google Scholar]
  16. DaiC. FengP. CuiL. SuR. ChenW. WeiL. Iterative feature representation algorithm to improve the predictive performance of N7-methylguanosine sites.Brief. Bioinform.2021224bbaa27810.1093/bib/bbaa278
    [Google Scholar]
  17. BiY. XiangD. GeZ. LiF. JiaC. SongJ. An interpretable prediction model for identifying N7-methylguanosine sites based on XGBoost and SHAP.Mol. Ther. Nucleic Acids20202236237210.1016/j.omtn.2020.08.022 33230441
    [Google Scholar]
  18. ZhangL. QinX. LiuM. LiuG. RenY. BERT-m7G: A transformer architecture based on BERT and stacking ensemble to identify RNA N7-methylguanosine sites from sequence information.Comput. Math. Methods Med.202120217764764
    [Google Scholar]
  19. ShoombuatongW. BasithS. PittiT. LeeG. ManavalanB. THRONE: A new approach for accurate prediction of human RNA N7-methylguanosine sites.J. Mol. Biol.20224341116754910.1016/j.jmb.2022.167549 35662472
    [Google Scholar]
  20. ZhangY. YuL. JingR. HanB. LuoJ. Fast and efficient design of deep neural networks for predicting N 7 -methylguanosine sites using autobioseqpy.ACS Omega2023822197281974010.1021/acsomega.3c01371 37305295
    [Google Scholar]
  21. NingQ. ShengM. m7G-DLSTM: Intergrating directional Double-LSTM and fully connected network for RNA N7-methlguanosine sites prediction in human.Chemom. Intell. Lab. Syst.202121710439810.1016/j.chemolab.2021.104398
    [Google Scholar]
  22. TahirM. HayatM. KhanR. ChongK.T. An effective deep learning-based architecture for prediction of N7-methylguanosine sites in health systems.Electronics20221112191710.3390/electronics11121917
    [Google Scholar]
  23. ChenZ. ZhaoP. LiF. iLearn: An integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data.Brief. Bioinform.20202131047105710.1093/bib/bbz041 31067315
    [Google Scholar]
  24. ChenW. TangH. YeJ. LinH. ChouK-C. iRNA-PseU: Identifying RNA pseudouridine sites.Mol. Ther. Nucleic Acids201657e332 28427142
    [Google Scholar]
  25. WuH. PanX. YangY. ShenH.B. Recognizing binding sites of poorly characterized RNA-binding proteins on circular RNAs using attention Siamese network.Brief. Bioinform.2021226bbab27910.1093/bib/bbab279 34297803
    [Google Scholar]
  26. VacicV. IakouchevaL.M. RadivojacP. Two Sample Logo: A graphical representation of the differences between two sets of sequence alignments.Bioinformatics200622121536153710.1093/bioinformatics/btl151 16632492
    [Google Scholar]
  27. LuoX. TuX. DingY. GaoG. DengM. Expectation pooling: An effective and interpretable pooling method for predicting DNA–protein binding.Bioinformatics20203651405141210.1093/bioinformatics/btz768 31598637
    [Google Scholar]
  28. KeG. MengQ. FinleyT. WangT. ChenW. MaW. Eds. LightGBM: A highly efficient gradient boosting decision tree.31st Annual Conference on Neural Information Processing Systems (NIPS)04-09 DecLong Beach, CA, USA2017
    [Google Scholar]
  29. TangZ. LiZ. HouT. SiGra: Single-cell spatial elucidation through an image-augmented graph transformer.Nat. Commun.2023141561810.1038/s41467‑023‑41437‑w 37699885
    [Google Scholar]
  30. TangZ. LiuX. LiZ. SpaRx: Elucidate single-cell spatial heterogeneity of drug responses for personalized treatment.Brief. Bioinform.2023246bbad33810.1093/bib/bbad338 37798249
    [Google Scholar]
  31. VaswaniA. ShazeerN. ParmarN. UszkoreitJ. JonesL. GomezA.N. Eds. Attention is all you need.31st Annual Conference on Neural Information Processing Systems (NIPS)04-09 DecLong Beach, CA, USA2017
    [Google Scholar]
  32. van der MaatenL. HintonG. Visualizing data using t-SNE.J. Mach. Learn. Res.2008925792605
    [Google Scholar]
/content/journals/cbio/10.2174/0115748936285540240116065719
Loading
/content/journals/cbio/10.2174/0115748936285540240116065719
Loading

Data & Media loading...

Supplements

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test