The impact of library size and scale of testing on virtual screening

Fangyu Liu; Olivier Mailhot; Isabella S Glenn; Seth F Vigneron; Violla Bassim; Xinyu Xu; Karla Fonseca-Valencia; Matthew S Smith; Dmytro S Radchenko; James S Fraser; Yurii S Moroz; John J Irwin; Brian K Shoichet

doi:10.1038/s41589-024-01797-w

The impact of library size and scale of testing on virtual screening

Nat Chem Biol. 2025 Jan 3. doi: 10.1038/s41589-024-01797-w. Online ahead of print.

Authors

Affiliations

¹ Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA.
² Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA.
³ Enamine Ltd, Kyїv, Ukraine.
⁴ Enamine Ltd, Kyїv, Ukraine. [email protected].
⁵ Chemspace LLC, Kyїv, Ukraine. [email protected].
⁶ Department of Chemistry, Taras Shevchenko National University of Kyїv, Kyїv, Ukraine. [email protected].
⁷ Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA. [email protected].
⁸ Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA. [email protected].

PMID: 39753705
DOI: 10.1038/s41589-024-01797-w

Abstract

Virtual ligand libraries for ligand discovery have recently increased 10,000-fold. Whether this has improved hit rates and potencies has not been directly tested. Meanwhile, typically only dozens of docking hits are assayed, clouding hit-rate interpretation. Here we docked a 1.7 billion-molecule virtual library against β-lactamase, testing 1,521 new molecules and comparing the results to a 99 million-molecule screen where 44 molecules were tested. In a larger screen, hit rates improved twofold, more scaffolds were discovered and potency improved. Fifty-fold more inhibitors were found, supporting the idea that the large libraries harbor many more ligands than are being tested. In sampling smaller sets from the 1,521, hit rates only converged when several hundred molecules were tested. Hit rates and affinities improved steadily with docking score. It may be that as the scale of docking libraries and their testing grows, both ligands and our ability to rank them will improve.

Abstract

Grants and funding