Virtual High Throughput Screening Using Combined Random Forest and Flexible Docking

Dariusz Plewczynski; Marcin v. Grotthuss; Leszek Rychlewski; Krzysztof Ginalski

doi:10.2174/138620709788489000

ISSN: 1386-2073
E-ISSN: 1875-5402

Virtual High Throughput Screening Using Combined Random Forest and Flexible Docking
Authors: Dariusz Plewczynski¹, Marcin v. Grotthuss², Leszek Rychlewski³ and Krzysztof Ginalski⁴
View Affiliations Hide Affiliations

¹ Interdisciplinary Centre for Mathematical and Computational Modelling, University of Warsaw, Pawinskiego 5a, 02-106 Warsaw, Poland. ² Interdisciplinary Centre for Mathematical and Computational Modelling, University of Warsaw, Pawinskiego 5a, 02-106 Warsaw, Poland. ³ Interdisciplinary Centre for Mathematical and Computational Modelling, University of Warsaw, Pawinskiego 5a, 02-106 Warsaw, Poland. ⁴ Interdisciplinary Centre for Mathematical and Computational Modelling, University of Warsaw, Pawinskiego 5a, 02-106 Warsaw, Poland.
Source: Combinatorial Chemistry & High Throughput Screening, Volume 12, Issue 5, Jun 2009, p. 484 - 489
DOI: https://doi.org/10.2174/138620709788489000
- Available online: 01 Jun 2009

Abstract

We present here the random forest supervised machine learning algorithm applied to flexible docking results from five typical virtual high throughput screening (HTS) studies. Our approach is aimed at: i) reducing the number of compounds to be tested experimentally against the given protein target and ii) extending results of flexible docking experiments performed only on a subset of a chemical library in order to select promising inhibitors from the whole dataset. The random forest (RF) method is applied and tested here on compounds from the MDL drug data report (MDDR). The recall values for selected five diverse protein targets are over 90% and the performance reaches 100%. This machine learning method combined with flexible docking is capable to find 60% of the active compounds for most protein targets by docking only 10% of screened ligands. Therefore our in silico approach is able to scan very large databases rapidly in order to predict biological activity of small molecule inhibitors and provides an effective alternative for more computationally demanding methods in virtual HTS.

Article metrics loading...

/content/journals/cchts/10.2174/138620709788489000

2009-06-01

2026-02-15

From This Site

/content/journals/cchts/10.2174/138620709788489000

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/cchts/10.2174/138620709788489000

Article Type: Research Article

Keyword(s): atom pairs; compound identification; machine- learning methods; MDL drug data report; protein target specificity; random forest; Virtual high throughput screening

Virtual High Throughput Screening Using Combined Random Forest and Flexible Docking

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

Privileged Structures: Applications in Drug Discovery

Computational Methods in Developing Quantitative Structure-Activity Relationships (QSAR): A Review

Recent Advances on Potentiometric Membrane Sensors for Pharmaceutical Analysis

Label-Free Detection of Biomolecular Interactions Using BioLayer Interferometry for Kinetic Characterization

Metalloproteinase Inhibitors for the Disintegrin-Like Metalloproteinases ADAM10 and ADAM17 that Differentially Block Constitutive and Phorbol Ester-Inducible Shedding of Cell Surface Molecules

On Various Metrics Used for Validation of Predictive QSAR Models with Applications in Virtual Screening and Focused Library Design

Diversity Among Microbial Cyclic Lipopeptides: Iturins and Surfactins. Activity-Structure Relationships to Design New Bioactive Agents

Building a Tiered Approach to In Vitro Predictive Toxicity Screening: A Focus on Assays with In Vivo Relevance

Antioxidants and Inflammatory Disease: Synthetic and Natural Antioxidants with Anti-Inflammatory Activity

Machine Learning in Virtual Screening