Skip to content
2000
Volume 19, Issue 10
  • ISSN: 1574-8936
  • E-ISSN: 2212-392X

Abstract

Background

When using clinical data for multi-omics analysis, there are issues such as the insufficient number of omics data types and relatively small sample size due to the protection of patients' privacy, the requirements of data management by various institutions, and the relatively large number of features of each omics data. This paper describes the analysis of multi-omics pathway relationships using statistical data in the absence of clinical data.

Methods

We proposed a novel approach to exploit easily accessible statistics in public databases. This approach introduces phenotypic associations that are not included in the clinical data and uses these data to build a three-layer heterogeneous network. To simplify the analysis, we decomposed the three-layer network into double two-layer networks to predict the weights of the inter-layer associations. By adding a hyperparameter β, the weights of the two layers of the network were merged, and then k-fold cross-validation was used to evaluate the accuracy of this method. In calculating the weights of the two-layer networks, the RWR with fixed restart probability was combined with PBMDA and CIPHER to generate the PCRWR with biased weights and improved accuracy.

Results

The area under the receiver operating characteristic curve was increased by approximately 7% in the case of the RWR with initial weights.

Conclusion

Multi-omics statistical data were used to establish genotype and phenotype correlation networks for analysis, which was similar to the effect of clinical multi-omics analysis.

Loading

Article metrics loading...

/content/journals/cbio/10.2174/0115748936276861240109045208
2024-02-06
2024-11-22
Loading full text...

Full text loading...

References

  1. GuoX. SongY. LiuS. GaoM. QiY. ShangX. Linking genotype to phenotype in multi-omics data of small sample.BMC Genomics202122153710.1186/s12864‑021‑07867‑w 34256701
    [Google Scholar]
  2. GuoX. HanJ. SongY. YinZ. LiuS. ShangX. Using expression quantitative trait loci data and graph-embedded neural networks to uncover genotype–phenotype interactions.Front. Genet.20221392177510.3389/fgene.2022.921775 36046233
    [Google Scholar]
  3. GuoY. LiuS. LiZ. ShangX. BCDForest: A boosting cascade deep forest model towards the classification of cancer subtypes based on gene expression data.BMC Bioinformatics201819S5Suppl. 511810.1186/s12859‑018‑2095‑4 29671390
    [Google Scholar]
  4. GuoX. LuY. YinZ. ShangX. IPMM: Cancer subtype clustering model based on multiomics data and pathway and motif information.ChamSpringer International Publishing2020560568
    [Google Scholar]
  5. FisconG. ConteF. FarinaL. PaciP. SAveRUNNER: A network-based algorithm for drug repurposing and its application to COVID-19.PLOS Comput. Biol.2021172e100868610.1371/journal.pcbi.1008686 33544720
    [Google Scholar]
  6. van DrielM.A. BruggemanJ. VriendG. BrunnerH.G. LeunissenJ.A.M. A text-mining analysis of the human phenome.Eur. J. Hum. Genet.200614553554210.1038/sj.ejhg.5201585 16493445
    [Google Scholar]
  7. KimY. ParkJ.H. ChoY.R. Network-based approaches for disease-gene association prediction using protein-protein interaction networks.Int. J. Mol. Sci.20222313741110.3390/ijms23137411 35806415
    [Google Scholar]
  8. WuX. JiangR. ZhangM.Q. LiS. Network-based global inference of human disease genes.Mol. Syst. Biol.20084118910.1038/msb.2008.27 18463613
    [Google Scholar]
  9. GiladY. RifkinS.A. PritchardJ.K. Revealing the architecture of gene regulation: the promise of eQTL studies.Trends Genet.200824840841510.1016/j.tig.2008.06.001 18597885
    [Google Scholar]
  10. SchadtE.E. LambJ. YangX. An integrative genomics approach to infer causal associations between gene expression and disease.Nat. Genet.200537771071710.1038/ng1589 15965475
    [Google Scholar]
  11. ZhuZ. ZhangF. HuH. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets.Nat. Genet.201648548148710.1038/ng.3538 27019110
    [Google Scholar]
  12. RoytmanM. KichaevG. GusevA. PasaniucB. Methods for fine-mapping with chromatin and expression data.PLoS Genet.2018142e100724010.1371/journal.pgen.1007240 29481575
    [Google Scholar]
  13. KöhlerS. GarganoM. MatentzogluN. The human phenotype ontology in 2021.Nucleic Acids Res.202149D1D1207D121710.1093/nar/gkaa1043 33264411
    [Google Scholar]
  14. MurtaghF. ContrerasP. Algorithms for hierarchical clustering: An overview.Wiley Interdiscip. Rev. Data Min. Knowl. Discov.201221869710.1002/widm.53
    [Google Scholar]
  15. HavensT.C. BezdekJ.C. LeckieC. HallL.O. PalaniswamiM. Fuzzy c-means algorithms for very large data.IEEE Trans. Fuzzy Syst.20122061130114610.1109/TFUZZ.2012.2201485
    [Google Scholar]
  16. KohonenT. The self-organizing map.Neurocomputing1998211-31610.1016/S0925‑2312(98)00030‑7
    [Google Scholar]
  17. WuF.X. Genetic weighted k-means algorithm for clustering large-scale gene expression data.BMC Bioinformatics20089S6Suppl. 6S1210.1186/1471‑2105‑9‑S6‑S12 18541047
    [Google Scholar]
  18. YouZ.H. HuangZ.A. ZhuZ. PBMDA: A novel and effective path-based computational model for miRNA-disease association prediction.PLOS Comput. Biol.2017133e100545510.1371/journal.pcbi.1005455 28339468
    [Google Scholar]
  19. Ba-alawiW. SoufanO. EssackM. KalnisP. BajicV.B. DASPfind: new efficient method to predict drug–target interactions.J. Cheminform.2016811510.1186/s13321‑016‑0128‑4 26985240
    [Google Scholar]
  20. LuoJ LongY. NTSHMDA: Prediction of human microbe-disease association based on random walk by integrating network topological similarity.IEEE/ACM Trans Comput Biol Bioinform202017134151
    [Google Scholar]
  21. KöhlerS. BauerS. HornD. RobinsonP.N. Walking the interactome for prioritization of candidate disease genes.Am. J. Hum. Genet.200882494995810.1016/j.ajhg.2008.02.013 18371930
    [Google Scholar]
  22. LiY. PatraJ.C. Genome-wide inferring gene–phenotype relationship by walking on the heterogeneous network.Bioinformatics20102691219122410.1093/bioinformatics/btq108 20215462
    [Google Scholar]
  23. ChenX. LiuM.X. YanG.Y. RWRMDA: Predicting novel human microRNA–disease associations.Mol. Biosyst.20128102792279810.1039/c2mb25180a 22875290
    [Google Scholar]
  24. SmedleyD. HaiderS. DurinckS. The BioMart community portal: An innovative alternative to large, centralized data repositories.Nucleic Acids Res.201543W1W589-9810.1093/nar/gkv350 25897122
    [Google Scholar]
  25. Keshava PrasadTS GoelR KandasamyK Human protein reference database-2009 update.Nucleic Acids Res200937DatabaseD7677210.1093/nar/gkn892 18988627
    [Google Scholar]
  26. MathivananS. AhmedM. AhnN.G. Human Proteinpedia enables sharing of human protein data.Nat. Biotechnol.200826216416710.1038/nbt0208‑164 18259167
    [Google Scholar]
  27. PiñeroJ. BravoÀ. Queralt-RosinachN. DisGeNET: A comprehensive platform integrating information on human disease-associated genes and variants.Nucleic Acids Res.201745D1D833D83910.1093/nar/gkw943 27924018
    [Google Scholar]
  28. PengJ. HuiW. LiQ. A learning-based framework for miRNA-disease association identification using neural networks.Bioinformatics201935214364437110.1093/bioinformatics/btz254 30977780
    [Google Scholar]
  29. RamosE.M. HoffmanD. JunkinsH.A. Phenotype–genotype integrator (PheGenI): Synthesizing genome-wide association study (GWAS) data with existing genomic resources.Eur. J. Hum. Genet.201422114414710.1038/ejhg.2013.96 23695286
    [Google Scholar]
  30. CornishA.J. DavidA. SternbergM.J.E. PhenoRank: Reducing study bias in gene prioritization through simulation.Bioinformatics201834122087209510.1093/bioinformatics/bty028 29360927
    [Google Scholar]
  31. ZhangY. LiuJ. LiuX. Prioritizing disease genes with an improved dual label propagation framework.BMC Bioinformatics20181914710.1186/s12859‑018‑2040‑6 29422030
    [Google Scholar]
  32. YangK. WangR. LiuG. HerGePred: Heterogeneous network embedding representation for disease gene prediction.IEEE J. Biomed. Health Inform.20192341805181510.1109/JBHI.2018.2870728 31283472
    [Google Scholar]
/content/journals/cbio/10.2174/0115748936276861240109045208
Loading
/content/journals/cbio/10.2174/0115748936276861240109045208
Loading

Data & Media loading...


  • Article Type:
    Research Article
Keyword(s): CIPHER; Genotype; hyperparameter β; multi-omics statistical data; PBMDA; phenotype
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test