Kernel-elastic autoencoder for molecular design

Haote Li; Yu Shee; Brandon Allen; Federica Maschietto; Anton Morgunov; Victor Batista

doi:10.1093/pnasnexus/pgae168

Kernel-elastic autoencoder for molecular design

PNAS Nexus. 2024 Apr 25;3(4):pgae168. doi: 10.1093/pnasnexus/pgae168. eCollection 2024 Apr.

Authors

Haote Li¹, Yu Shee¹, Brandon Allen¹, Federica Maschietto¹, Anton Morgunov¹, Victor Batista¹

Affiliation

¹ Department of Chemistry, Yale University, New Haven, CT 06520, USA.

Abstract

We introduce the kernel-elastic autoencoder (KAE), a self-supervised generative model based on the transformer architecture with enhanced performance for molecular design. KAE employs two innovative loss functions: modified maximum mean discrepancy (m-MMD) and weighted reconstruction ( $L_{WCEL}$ ). The m-MMD loss has significantly improved the generative performance of KAE when compared to using the traditional Kullback-Leibler loss of VAE, or standard maximum mean discrepancy. Including the weighted reconstruction loss $L_{WCEL}$ , KAE achieves valid generation and accurate reconstruction at the same time, allowing for generative behavior that is intermediate between VAE and autoencoder not available in existing generative approaches. Further advancements in KAE include its integration with conditional generation, setting a new state-of-the-art benchmark in constrained optimizations. Moreover, KAE has demonstrated its capability to generate molecules with favorable binding affinities in docking applications, as evidenced by AutoDock Vina and Glide scores, outperforming all existing candidates from the training dataset. Beyond molecular design, KAE holds promise to solve problems by generation across a broad spectrum of applications.

Keywords: generative modeling; molecular docking; molecular optimization.

Published by Oxford University Press on behalf of National Academy of Sciences 2024.

Grants and funding

T32 GM149438/GM/NIGMS NIH HHS/United States