Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Tripp, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07709  [pdf, other

    cs.LG physics.chem-ph stat.ML

    Diagnosing and fixing common problems in Bayesian optimization for molecule design

    Authors: Austin Tripp, José Miguel Hernández-Lobato

    Abstract: Bayesian optimization (BO) is a principled approach to molecular design tasks. In this paper we explain three pitfalls of BO which can cause poor empirical performance: an incorrect prior width, over-smoothing, and inadequate acquisition function maximization. We show that with these issues addressed, even a basic BO setup is able to achieve the highest overall performance on the PMO benchmark for… ▽ More

    Submitted 25 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: 8 pages, 4 figures. ICML 2024 AI for science workshop (https://openreview.net/forum?id=V4aG4wsoIt). Code at: https://github.com/AustinT/basic-mol-bo-workshop2024

  2. arXiv:2310.20581  [pdf, other

    cs.LG stat.ML

    Stochastic Gradient Descent for Gaussian Processes Done Right

    Authors: Jihao Andreas Lin, Shreyas Padhy, Javier Antorán, Austin Tripp, Alexander Terenin, Csaba Szepesvári, José Miguel Hernández-Lobato, David Janz

    Abstract: As is well known, both sampling from the posterior and computing the mean of the posterior in Gaussian process regression reduces to solving a large linear system of equations. We study the use of stochastic gradient descent for solving this linear system, and show that when \emph{done right} -- by which we mean using specific insights from the optimisation and kernel communities -- stochastic gra… ▽ More

    Submitted 28 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  3. arXiv:2310.19796  [pdf, other

    cs.LG cs.AI q-bio.QM

    Re-evaluating Retrosynthesis Algorithms with Syntheseus

    Authors: Krzysztof Maziarz, Austin Tripp, Guoqing Liu, Megan Stanley, Shufang Xie, Piotr Gaiński, Philipp Seidl, Marwin Segler

    Abstract: The planning of how to synthesize molecules, also known as retrosynthesis, has been a growing focus of the machine learning and chemistry communities in recent years. Despite the appearance of steady progress, we argue that imperfect benchmarks and inconsistent comparisons mask systematic shortcomings of existing techniques. To remedy this, we present a benchmarking library called syntheseus which… ▽ More

    Submitted 19 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

  4. arXiv:2310.09270  [pdf, other

    cs.AI cs.LG

    Retro-fallback: retrosynthetic planning in an uncertain world

    Authors: Austin Tripp, Krzysztof Maziarz, Sarah Lewis, Marwin Segler, José Miguel Hernández-Lobato

    Abstract: Retrosynthesis is the task of planning a series of chemical reactions to create a desired molecule from simpler, buyable molecules. While previous works have proposed algorithms to find optimal solutions for a range of metrics (e.g. shortest, lowest-cost), these works generally overlook the fact that we have imperfect knowledge of the space of possible reactions, meaning plans created by algorithm… ▽ More

    Submitted 13 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 camera ready version (https://openreview.net/forum?id=dl0u4ODCuW). 58 pages total. Code available at: https://github.com/AustinT/retro-fallback-iclr24. This version has 1) updated writing 2) updated figures 3) additional experimental results 4) more complete explanation of AND/OR graphs in the appendices 5) correct typos + error in fig G.5 caption

  5. arXiv:2310.09267  [pdf, ps, other

    cs.NE cs.LG q-bio.QM

    Genetic algorithms are strong baselines for molecule generation

    Authors: Austin Tripp, José Miguel Hernández-Lobato

    Abstract: Generating molecules, both in a directed and undirected fashion, is a huge part of the drug discovery pipeline. Genetic algorithms (GAs) generate molecules by randomly modifying known molecules. In this paper we show that GAs are very strong algorithms for such tasks, outperforming many complicated machine learning methods: a result which many researchers may find surprising. We therefore propose… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Currently under review. Code will be made available at a later date

  6. arXiv:2306.14809  [pdf, other

    cs.LG

    Tanimoto Random Features for Scalable Molecular Machine Learning

    Authors: Austin Tripp, Sergio Bacallado, Sukriti Singh, José Miguel Hernández-Lobato

    Abstract: The Tanimoto coefficient is commonly used to measure the similarity between molecules represented as discrete fingerprints, either as a distance metric or a positive definite kernel. While many kernel methods can be accelerated using random feature approximations, at present there is a lack of such approximations for the Tanimoto kernel. In this paper we propose two kinds of novel random features… ▽ More

    Submitted 13 November, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Camera-ready version presented at NeurIPS 2023. Updates include: notation changes, better description of features in section 4, updated experiments, link to code

  7. arXiv:2301.13755  [pdf, other

    cs.AI cs.LG

    Retrosynthetic Planning with Dual Value Networks

    Authors: Guoqing Liu, Di Xue, Shufang Xie, Yingce Xia, Austin Tripp, Krzysztof Maziarz, Marwin Segler, Tao Qin, Zongzhang Zhang, Tie-Yan Liu

    Abstract: Retrosynthesis, which aims to find a route to synthesize a target molecule from commercially available starting materials, is a critical task in drug discovery and materials design. Recently, the combination of ML-based single-step reaction predictors with multi-step planners has led to promising results. However, the single-step predictors are mostly trained offline to optimize the single-step ac… ▽ More

    Submitted 3 March, 2024; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: Accepted to ICML 2023

  8. arXiv:2212.04450  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci cs.LG

    GAUCHE: A Library for Gaussian Processes in Chemistry

    Authors: Ryan-Rhys Griffiths, Leo Klarner, Henry B. Moss, Aditya Ravuri, Sang Truong, Samuel Stanton, Gary Tom, Bojana Rankovic, Yuanqi Du, Arian Jamasb, Aryan Deshwal, Julius Schwartz, Austin Tripp, Gregory Kell, Simon Frieder, Anthony Bourached, Alex Chan, Jacob Moss, Chengzhi Guo, Johannes Durholt, Saudamini Chaurasia, Felix Strieth-Kalthoff, Alpha A. Lee, Bingqing Cheng, Alán Aspuru-Guzik , et al. (2 additional authors not shown)

    Abstract: We introduce GAUCHE, a library for GAUssian processes in CHEmistry. Gaussian processes have long been a cornerstone of probabilistic machine learning, affording particular advantages for uncertainty quantification and Bayesian optimisation. Extending Gaussian processes to chemical representations, however, is nontrivial, necessitating kernels defined over structured inputs such as graphs, strings… ▽ More

    Submitted 21 February, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

  9. arXiv:2205.02708  [pdf, other

    cs.LG

    Meta-learning Adaptive Deep Kernel Gaussian Processes for Molecular Property Prediction

    Authors: Wenlin Chen, Austin Tripp, José Miguel Hernández-Lobato

    Abstract: We propose Adaptive Deep Kernel Fitting with Implicit Function Theorem (ADKF-IFT), a novel framework for learning deep kernel Gaussian processes (GPs) by interpolating between meta-learning and conventional deep kernel learning. Our approach employs a bilevel optimization objective where we meta-learn generally useful feature representations across tasks, in the sense that task-specific GP models… ▽ More

    Submitted 16 February, 2023; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at The Eleventh International Conference on Learning Representations (ICLR 2023); code available at: https://github.com/Wenlin-Chen/ADKF-IFT

  10. arXiv:2110.15486  [pdf, other

    stat.ML cs.LG q-bio.BM

    DOCKSTRING: easy molecular docking yields better benchmarks for ligand design

    Authors: Miguel García-Ortegón, Gregor N. C. Simm, Austin J. Tripp, José Miguel Hernández-Lobato, Andreas Bender, Sergio Bacallado

    Abstract: The field of machine learning for drug discovery is witnessing an explosion of novel methods. These methods are often benchmarked on simple physicochemical properties such as solubility or general druglikeness, which can be readily computed. However, these properties are poor representatives of objective functions in drug design, mainly because they do not depend on the candidate's interaction wit… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  11. arXiv:2006.09191  [pdf, other

    cs.LG stat.ML

    Sample-Efficient Optimization in the Latent Space of Deep Generative Models via Weighted Retraining

    Authors: Austin Tripp, Erik Daxberger, José Miguel Hernández-Lobato

    Abstract: Many important problems in science and engineering, such as drug design, involve optimizing an expensive black-box objective function over a complex, high-dimensional, and structured input space. Although machine learning techniques have shown promise in solving such problems, existing approaches substantially lack sample efficiency. We introduce an improved method for efficient black-box optimiza… ▽ More

    Submitted 25 October, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 23 pages, 14 figures; Includes supplementary material; NeurIPS 2020