TY - JOUR
T1 - Virtual screening of the SAMPL4 blinded HIV integrase inhibitors dataset
AU - Colas, Claire
AU - Iorga, Bogdan I.
N1 - Funding Information:
Acknowledgments Our laboratory is a member of the Laboratory of Excellence in Research on Medication and Innovative Therapeutics (LERMIT) supported by a grant from the French National Research Agency (ANR-10-LABX-33). We would like to thank the SAMPL4 organizers, with a special mention to David L. Mobley, for providing the experimental data required for the evaluation of our predictions, as well as for the alternate analysis of the virtual screening results. The pertinent comments and suggestions of the manuscript reviewers are also kindly acknowledged.
PY - 2014/4
Y1 - 2014/4
N2 - Several combinations of docking software and scoring functions were evaluated for their ability to predict the binding of a dataset of potential HIV integrase inhibitors. We found that different docking software were appropriate for each one of the three binding sites considered (LEDGF, Y3 and fragment sites), and the most suitable two docking protocols, involving Glide SP and Gold ChemScore, were selected using a training set of compounds identified from the structural data available. These protocols could successfully predict respectively 20.0 and 23.6 % of the HIV integrase binders, all of them being present in the LEDGF site. When a different analysis of the results was carried out by removing all alternate isomers of binders from the set, our predictions were dramatically improved, with an overall ROC AUC of 0.73 and enrichment factor at 10 % of 2.89 for the prediction obtained using Gold ChemScore. This study highlighted the ability of the selected docking protocols to correctly position in most cases the ortho-alkoxy-carboxylate core functional group of the ligands in the corresponding binding site, but also their difficulties to correctly rank the docking poses.
AB - Several combinations of docking software and scoring functions were evaluated for their ability to predict the binding of a dataset of potential HIV integrase inhibitors. We found that different docking software were appropriate for each one of the three binding sites considered (LEDGF, Y3 and fragment sites), and the most suitable two docking protocols, involving Glide SP and Gold ChemScore, were selected using a training set of compounds identified from the structural data available. These protocols could successfully predict respectively 20.0 and 23.6 % of the HIV integrase binders, all of them being present in the LEDGF site. When a different analysis of the results was carried out by removing all alternate isomers of binders from the set, our predictions were dramatically improved, with an overall ROC AUC of 0.73 and enrichment factor at 10 % of 2.89 for the prediction obtained using Gold ChemScore. This study highlighted the ability of the selected docking protocols to correctly position in most cases the ortho-alkoxy-carboxylate core functional group of the ligands in the corresponding binding site, but also their difficulties to correctly rank the docking poses.
KW - Docking
KW - HIV integrase inhibitors
KW - SAMPL4 blind challenge
KW - Scoring function
KW - Virtual screening
UR - https://www.scopus.com/pages/publications/84902794813
U2 - 10.1007/s10822-014-9707-5
DO - 10.1007/s10822-014-9707-5
M3 - Article
C2 - 24458507
AN - SCOPUS:84902794813
SN - 0920-654X
VL - 28
SP - 455
EP - 462
JO - Journal of Computer-Aided Molecular Design
JF - Journal of Computer-Aided Molecular Design
IS - 4
ER -