Using Machine Learning Methods to Predict Experimental High Throughput Screening Data

Cherif Mballo; Vladimir Makarenkov

doi:10.2174/138620710791292958

ISSN: 1386-2073
E-ISSN: 1875-5402

Using Machine Learning Methods to Predict Experimental High Throughput Screening Data
Authors: Cherif Mballo¹ and Vladimir Makarenkov²
View Affiliations Hide Affiliations

¹ Departement d'informatique, Universite du Quebec a Montreal, C.P. 8888, Succursale Centre-Ville, Montreal (QC) H3C 3P8, Canada. ² Departement d'informatique, Universite du Quebec a Montreal, C.P. 8888, Succursale Centre-Ville, Montreal (QC) H3C 3P8, Canada.
Source: Combinatorial Chemistry & High Throughput Screening, Volume 13, Issue 5, Jun 2010, p. 430 - 441
DOI: https://doi.org/10.2174/138620710791292958
- Available online: 01 Jun 2010

Abstract

High throughput screening (HTS) remains a very costly process notwithstanding many recent technological advances in the field of biotechnology. In this study we consider the application of machine learning methods for predicting experimental HTS measurements. Such a virtual HTS analysis can be based on the results of real HTS campaigns carried out with similar compounds libraries and similar drug targets. In this way, we analyzed Test assay from McMaster University Data Mining and Docking Competition [1] using binary decision trees, neural networks, support vector machines (SVM), linear discriminant analysis, k-nearest neighbors and partial least squares. First, we studied separately the sets of molecular and atomic descriptors in order to establish which of them provides a better prediction. Then, the comparison of the six considered machine learning methods was made in terms of false positives and false negatives, method's sensitivity and enrichment factor. Finally, a variable selection procedure allowing one to improve the method's sensitivity was implemented and applied in the framework of polynomial SVM.

Article metrics loading...

/content/journals/cchts/10.2174/138620710791292958

2010-06-01

2026-02-11

From This Site

/content/journals/cchts/10.2174/138620710791292958

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cchts/10.2174/138620710791292958

Article Type: Research Article

Keyword(s): CART; decision trees; drug target; hit; k-nearest neighbors (kNN); linear discriminant analysis (LDA); neural networks (NN); partial least squares (PLS); ROC curve; sampling; support vector machines (SVM); virtual high throughput screening

Using Machine Learning Methods to Predict Experimental High Throughput Screening Data

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

Privileged Structures: Applications in Drug Discovery

Computational Methods in Developing Quantitative Structure-Activity Relationships (QSAR): A Review

Recent Advances on Potentiometric Membrane Sensors for Pharmaceutical Analysis

Label-Free Detection of Biomolecular Interactions Using BioLayer Interferometry for Kinetic Characterization

Metalloproteinase Inhibitors for the Disintegrin-Like Metalloproteinases ADAM10 and ADAM17 that Differentially Block Constitutive and Phorbol Ester-Inducible Shedding of Cell Surface Molecules

On Various Metrics Used for Validation of Predictive QSAR Models with Applications in Virtual Screening and Focused Library Design

Diversity Among Microbial Cyclic Lipopeptides: Iturins and Surfactins. Activity-Structure Relationships to Design New Bioactive Agents

Building a Tiered Approach to In Vitro Predictive Toxicity Screening: A Focus on Assays with In Vivo Relevance

Antioxidants and Inflammatory Disease: Synthetic and Natural Antioxidants with Anti-Inflammatory Activity

Machine Learning in Virtual Screening