Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Masters, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.15796  [pdf, other

    cs.LG physics.ao-ph

    GenCast: Diffusion-based ensemble forecasting for medium-range weather

    Authors: Ilan Price, Alvaro Sanchez-Gonzalez, Ferran Alet, Tom R. Andersson, Andrew El-Kadi, Dominic Masters, Timo Ewalds, Jacklynn Stott, Shakir Mohamed, Peter Battaglia, Remi Lam, Matthew Willson

    Abstract: Weather forecasts are fundamentally uncertain, so predicting the range of probable weather scenarios is crucial for important decisions, from warning the public about hazardous weather, to planning renewable energy use. Here, we introduce GenCast, a probabilistic weather model with greater skill and speed than the top operational medium-range weather forecast in the world, the European Centre for… ▽ More

    Submitted 1 May, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Main text 11 pages, Appendices 76 pages

  2. arXiv:2311.01135  [pdf, other

    cs.LG physics.chem-ph

    Generating QM1B with PySCF$_{\text{IPU}}$

    Authors: Alexander Mathiasen, Hatem Helal, Kerstin Klaser, Paul Balanca, Josef Dean, Carlo Luschi, Dominique Beaini, Andrew Fitzgibbon, Dominic Masters

    Abstract: The emergence of foundation models in Computer Vision and Natural Language Processing have resulted in immense progress on downstream tasks. This progress was enabled by datasets with billions of training examples. Similar benefits are yet to be unlocked for quantum chemistry, where the potential of deep learning is constrained by comparatively small datasets with 100k to 20M training examples. Th… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 15 pages, 7 figures. NeurIPS 2023 Track Datasets and Benchmarks

    ACM Class: I.2.6; J.2

  3. arXiv:2310.04292  [pdf, other

    cs.LG

    Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets

    Authors: Dominique Beaini, Shenyang Huang, Joao Alex Cunha, Zhiyi Li, Gabriela Moisescu-Pareja, Oleksandr Dymov, Samuel Maddrell-Mander, Callum McLean, Frederik Wenkel, Luis Müller, Jama Hussein Mohamud, Ali Parviz, Michael Craig, Michał Koziarski, Jiarui Lu, Zhaocheng Zhu, Cristian Gabellini, Kerstin Klaser, Josef Dean, Cas Wognum, Maciej Sypetkowski, Guillaume Rabusseau, Reihaneh Rabbany, Jian Tang, Christopher Morris , et al. (10 additional authors not shown)

    Abstract: Recently, pre-trained foundation models have enabled significant advancements in multiple fields. In molecular machine learning, however, where datasets are often hand-curated, and hence typically small, the lack of datasets with labeled features, and codebases to manage those datasets, has hindered the development of foundation models. In this work, we present seven novel datasets categorized by… ▽ More

    Submitted 18 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  4. arXiv:2303.16999  [pdf, other

    cs.LG

    PopSparse: Accelerated block sparse matrix multiplication on IPU

    Authors: Zhiyi Li, Douglas Orr, Valeriu Ohan, Godfrey Da costa, Tom Murray, Adam Sanders, Deniz Beker, Dominic Masters

    Abstract: Reducing the computational cost of running large scale neural networks using sparsity has attracted great attention in the deep learning community. While much success has been achieved in reducing FLOP and parameter counts while maintaining acceptable task performance, achieving actual speed improvements has typically been much more difficult, particularly on general purpose accelerators (GPAs) su… ▽ More

    Submitted 5 April, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

  5. arXiv:2302.02947  [pdf, other

    cs.LG

    GPS++: Reviving the Art of Message Passing for Molecular Property Prediction

    Authors: Dominic Masters, Josef Dean, Kerstin Klaser, Zhiyi Li, Sam Maddrell-Mander, Adam Sanders, Hatem Helal, Deniz Beker, Andrew Fitzgibbon, Shenyang Huang, Ladislav Rampášek, Dominique Beaini

    Abstract: We present GPS++, a hybrid Message Passing Neural Network / Graph Transformer model for molecular property prediction. Our model integrates a well-tuned local message passing component and biased global attention with other key ideas from prior literature to achieve state-of-the-art results on large-scale molecular dataset PCQM4Mv2. Through a thorough ablation study we highlight the impact of indi… ▽ More

    Submitted 12 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2212.02229

  6. arXiv:2212.02229  [pdf, other

    q-bio.QM cs.LG

    GPS++: An Optimised Hybrid MPNN/Transformer for Molecular Property Prediction

    Authors: Dominic Masters, Josef Dean, Kerstin Klaser, Zhiyi Li, Sam Maddrell-Mander, Adam Sanders, Hatem Helal, Deniz Beker, Ladislav Rampášek, Dominique Beaini

    Abstract: This technical report presents GPS++, the first-place solution to the Open Graph Benchmark Large-Scale Challenge (OGB-LSC 2022) for the PCQM4Mv2 molecular property prediction task. Our approach implements several key principles from the prior literature. At its core our GPS++ method is a hybrid MPNN/Transformer model that incorporates 3D atom positions and an auxiliary denoising task. The effectiv… ▽ More

    Submitted 6 December, 2022; v1 submitted 18 November, 2022; originally announced December 2022.

  7. arXiv:2206.02915  [pdf, other

    cs.LG

    8-bit Numerical Formats for Deep Neural Networks

    Authors: Badreddine Noune, Philip Jones, Daniel Justus, Dominic Masters, Carlo Luschi

    Abstract: Given the current trend of increasing size and complexity of machine learning architectures, it has become of critical importance to identify new approaches to improve the computational efficiency of model training. In this context, we address the advantages of floating-point over fixed-point representation, and present an in-depth study on the use of 8-bit floating-point number formats for activa… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  8. arXiv:2106.03743  [pdf, other

    cs.LG cs.CV stat.ML

    Proxy-Normalizing Activations to Match Batch Normalization while Removing Batch Dependence

    Authors: Antoine Labatie, Dominic Masters, Zach Eaton-Rosen, Carlo Luschi

    Abstract: We investigate the reasons for the performance degradation incurred with batch-independent normalization. We find that the prototypical techniques of layer normalization and instance normalization both induce the appearance of failure modes in the neural network's pre-activations: (i) layer normalization induces a collapse towards channel-wise constant functions; (ii) instance normalization induce… ▽ More

    Submitted 3 April, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 camera-ready

  9. arXiv:2106.03640  [pdf, other

    cs.LG cs.CV stat.ML

    Making EfficientNet More Efficient: Exploring Batch-Independent Normalization, Group Convolutions and Reduced Resolution Training

    Authors: Dominic Masters, Antoine Labatie, Zach Eaton-Rosen, Carlo Luschi

    Abstract: Much recent research has been dedicated to improving the efficiency of training and inference for image classification. This effort has commonly focused on explicitly improving theoretical efficiency, often measured as ImageNet validation accuracy per FLOP. These theoretical savings have, however, proven challenging to achieve in practice, particularly on high-performance training accelerators.… ▽ More

    Submitted 26 August, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

  10. arXiv:1804.07612  [pdf, other

    cs.LG cs.CV stat.ML

    Revisiting Small Batch Training for Deep Neural Networks

    Authors: Dominic Masters, Carlo Luschi

    Abstract: Modern deep neural network training is typically based on mini-batch stochastic gradient optimization. While the use of large mini-batches increases the available computational parallelism, small batch training has been shown to provide improved generalization performance and allows a significantly smaller memory footprint, which might also be exploited to improve machine throughput. In this pap… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.