Skip to content
2000
Volume 17, Issue 9
  • ISSN: 1574-8936
  • E-ISSN: 2212-392X

Abstract

This paper presents a sequence of steps oriented to gain biological knowledge from microarray gene expression data. The pipeline's core is a canonical multi-objective Genetic Algorithm (GA), which takes a gene expression matrix and a factor as input. The factor groups samples according to different criteria, e.g., healthy tissue and diseased tissue samples. The result of one run of the GA is a gene set with good properties both at the individual level, in terms of differential expression, and at the aggregate level, in terms of correlation between expression profiles. Microarray experiment data are obtained from GEO (Gene Expression Omnibus dataset). As for the pipeline structure, independent runs of the GA are analyzed, genes in common between all the runs are collected, and over-representation analysis is performed. At the end of the process, a small number of genes of interest arise. The methodology is exemplified with a leukemia benchmark dataset, and a group of genes of interest is obtained for the illustrative example.

Loading

Article metrics loading...

/content/journals/cbio/10.2174/1574893617666220804112743
2022-11-01
2025-05-25
Loading full text...

Full text loading...

/content/journals/cbio/10.2174/1574893617666220804112743
Loading
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test