Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Beaini, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.11568  [pdf, other

    cs.LG

    On the Scalability of GNNs for Molecular Graphs

    Authors: Maciej Sypetkowski, Frederik Wenkel, Farimah Poursafaei, Nia Dickson, Karush Suri, Philip Fradkin, Dominique Beaini

    Abstract: Scaling deep learning models has been at the heart of recent revolutions in language modelling and image generation. Practitioners have observed a strong relationship between model size, dataset size, and performance. However, structure-based architectures such as Graph Neural Networks (GNNs) are yet to show the benefits of scale mainly due to the lower efficiency of sparse operations, large data… ▽ More

    Submitted 1 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  2. arXiv:2404.10242  [pdf, other

    cs.CV cs.AI cs.LG

    Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology

    Authors: Oren Kraus, Kian Kenyon-Dean, Saber Saberian, Maryam Fallah, Peter McLean, Jess Leung, Vasudev Sharma, Ayla Khan, Jia Balakrishnan, Safiye Celik, Dominique Beaini, Maciej Sypetkowski, Chi Vicky Cheng, Kristen Morse, Maureen Makes, Ben Mabey, Berton Earnshaw

    Abstract: Featurizing microscopy images for use in biological research remains a significant challenge, especially for large-scale experiments spanning millions of images. This work explores the scaling properties of weakly supervised classifiers and self-supervised masked autoencoders (MAEs) when training with increasingly larger model backbones and microscopy datasets. Our results show that ViT-based MAEs… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Highlight. arXiv admin note: text overlap with arXiv:2309.16064

  3. arXiv:2311.01135  [pdf, other

    cs.LG physics.chem-ph

    Generating QM1B with PySCF$_{\text{IPU}}$

    Authors: Alexander Mathiasen, Hatem Helal, Kerstin Klaser, Paul Balanca, Josef Dean, Carlo Luschi, Dominique Beaini, Andrew Fitzgibbon, Dominic Masters

    Abstract: The emergence of foundation models in Computer Vision and Natural Language Processing have resulted in immense progress on downstream tasks. This progress was enabled by datasets with billions of training examples. Similar benefits are yet to be unlocked for quantum chemistry, where the potential of deep learning is constrained by comparatively small datasets with 100k to 20M training examples. Th… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 15 pages, 7 figures. NeurIPS 2023 Track Datasets and Benchmarks

    ACM Class: I.2.6; J.2

  4. arXiv:2311.00862  [pdf, other

    physics.chem-ph cs.LG

    Role of Structural and Conformational Diversity for Machine Learning Potentials

    Authors: Nikhil Shenoy, Prudencio Tossou, Emmanuel Noutahi, Hadrien Mary, Dominique Beaini, Jiarui Ding

    Abstract: In the field of Machine Learning Interatomic Potentials (MLIPs), understanding the intricate relationship between data biases, specifically conformational and structural diversity, and model generalization is critical in improving the quality of Quantum Mechanics (QM) data generation efforts. We investigate these dynamics through two distinct experiments: a fixed budget one, where the dataset size… ▽ More

    Submitted 30 October, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023 AI4D3 and AI4S workshops

  5. arXiv:2310.04292  [pdf, other

    cs.LG

    Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets

    Authors: Dominique Beaini, Shenyang Huang, Joao Alex Cunha, Zhiyi Li, Gabriela Moisescu-Pareja, Oleksandr Dymov, Samuel Maddrell-Mander, Callum McLean, Frederik Wenkel, Luis Müller, Jama Hussein Mohamud, Ali Parviz, Michael Craig, Michał Koziarski, Jiarui Lu, Zhaocheng Zhu, Cristian Gabellini, Kerstin Klaser, Josef Dean, Cas Wognum, Maciej Sypetkowski, Guillaume Rabusseau, Reihaneh Rabbany, Jian Tang, Christopher Morris , et al. (10 additional authors not shown)

    Abstract: Recently, pre-trained foundation models have enabled significant advancements in multiple fields. In molecular machine learning, however, where datasets are often hand-curated, and hence typically small, the lack of datasets with labeled features, and codebases to manage those datasets, has hindered the development of foundation models. In this work, we present seven novel datasets categorized by… ▽ More

    Submitted 18 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  6. arXiv:2307.07107  [pdf, other

    cs.LG

    Graph Positional and Structural Encoder

    Authors: Semih Cantürk, Renming Liu, Olivier Lapointe-Gagné, Vincent Létourneau, Guy Wolf, Dominique Beaini, Ladislav Rampášek

    Abstract: Positional and structural encodings (PSE) enable better identifiability of nodes within a graph, rendering them essential tools for empowering modern GNNs, and in particular graph Transformers. However, designing PSEs that work optimally for all graph prediction tasks is a challenging and unsolved problem. Here, we present the Graph Positional and Structural Encoder (GPSE), the first-ever graph en… ▽ More

    Submitted 10 June, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: Accepted at ICML 2024; 34 pages, 6 figures

  7. arXiv:2302.02947  [pdf, other

    cs.LG

    GPS++: Reviving the Art of Message Passing for Molecular Property Prediction

    Authors: Dominic Masters, Josef Dean, Kerstin Klaser, Zhiyi Li, Sam Maddrell-Mander, Adam Sanders, Hatem Helal, Deniz Beker, Andrew Fitzgibbon, Shenyang Huang, Ladislav Rampášek, Dominique Beaini

    Abstract: We present GPS++, a hybrid Message Passing Neural Network / Graph Transformer model for molecular property prediction. Our model integrates a well-tuned local message passing component and biased global attention with other key ideas from prior literature to achieve state-of-the-art results on large-scale molecular dataset PCQM4Mv2. Through a thorough ablation study we highlight the impact of indi… ▽ More

    Submitted 12 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2212.02229

  8. arXiv:2301.11517  [pdf, other

    cs.LG

    Task-Agnostic Graph Neural Network Evaluation via Adversarial Collaboration

    Authors: Xiangyu Zhao, Hannes Stärk, Dominique Beaini, Yiren Zhao, Pietro Liò

    Abstract: It has been increasingly demanding to develop reliable methods to evaluate the progress of Graph Neural Network (GNN) research for molecular representation learning. Existing GNN benchmarking methods for molecular representation learning focus on comparing the GNNs' performances on some node/graph classification/regression tasks on certain datasets. However, there lacks a principled, task-agnostic… ▽ More

    Submitted 26 March, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: 11th International Conference on Learning Representations (ICLR 2023) Machine Learning for Drug Discovery (MLDD) Workshop. 17 pages, 6 figures, 4 tables

  9. arXiv:2212.02229  [pdf, other

    q-bio.QM cs.LG

    GPS++: An Optimised Hybrid MPNN/Transformer for Molecular Property Prediction

    Authors: Dominic Masters, Josef Dean, Kerstin Klaser, Zhiyi Li, Sam Maddrell-Mander, Adam Sanders, Hatem Helal, Deniz Beker, Ladislav Rampášek, Dominique Beaini

    Abstract: This technical report presents GPS++, the first-place solution to the Open Graph Benchmark Large-Scale Challenge (OGB-LSC 2022) for the PCQM4Mv2 molecular property prediction task. Our approach implements several key principles from the prior literature. At its core our GPS++ method is a hybrid MPNN/Transformer model that incorporates 3D atom positions and an auxiliary denoising task. The effectiv… ▽ More

    Submitted 6 December, 2022; v1 submitted 18 November, 2022; originally announced December 2022.

  10. arXiv:2206.08164  [pdf, other

    cs.LG

    Long Range Graph Benchmark

    Authors: Vijay Prakash Dwivedi, Ladislav Rampášek, Mikhail Galkin, Ali Parviz, Guy Wolf, Anh Tuan Luu, Dominique Beaini

    Abstract: Graph Neural Networks (GNNs) that are based on the message passing (MP) paradigm generally exchange information between 1-hop neighbors to build node representations at each layer. In principle, such networks are not able to capture long-range interactions (LRI) that may be desired or necessary for learning a given task on graphs. Recently, there has been an increasing interest in development of T… ▽ More

    Submitted 28 November, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Added reference to Tönshoff et al., 2023 in Sec. 4.1; NeurIPS 2022 Track on D&B; Open-sourced at: https://github.com/vijaydwivedi75/lrgb

  11. arXiv:2205.12454  [pdf, other

    cs.LG

    Recipe for a General, Powerful, Scalable Graph Transformer

    Authors: Ladislav Rampášek, Mikhail Galkin, Vijay Prakash Dwivedi, Anh Tuan Luu, Guy Wolf, Dominique Beaini

    Abstract: We propose a recipe on how to build a general, powerful, scalable (GPS) graph Transformer with linear complexity and state-of-the-art results on a diverse set of benchmarks. Graph Transformers (GTs) have gained popularity in the field of graph representation learning with a variety of recent publications but they lack a common foundation about what constitutes a good positional or structural encod… ▽ More

    Submitted 15 January, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: In Proceedings of NeurIPS 2022

  12. arXiv:2110.04126  [pdf, other

    cs.LG cs.AI q-bio.BM

    3D Infomax improves GNNs for Molecular Property Prediction

    Authors: Hannes Stärk, Dominique Beaini, Gabriele Corso, Prudencio Tossou, Christian Dallago, Stephan Günnemann, Pietro Liò

    Abstract: Molecular property prediction is one of the fastest-growing applications of deep learning with critical real-world impacts. Including 3D molecular structure as input to learned models improves their performance for many molecular tasks. However, this information is infeasible to compute at the scale required by several real-world applications. We propose pre-training a model to reason about the ge… ▽ More

    Submitted 4 June, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: 39th International Conference on Machine Learning (ICML 2022). Also accepted at NeurIPS 2021 ML4PH, AI4S, and SSL workshops and as oral at ELLIS ML4Molecules. 24 pages, 7 figures, 18 tables

    Journal ref: 39th International Conference on Machine Learning (ICML 2022)

  13. arXiv:2106.03893  [pdf, other

    cs.LG

    Rethinking Graph Transformers with Spectral Attention

    Authors: Devin Kreuzer, Dominique Beaini, William L. Hamilton, Vincent Létourneau, Prudencio Tossou

    Abstract: In recent years, the Transformer architecture has proven to be very successful in sequence processing, but its application to other data structures, such as graphs, has remained limited due to the difficulty of properly defining positions. Here, we present the $\textit{Spectral Attention Network}$ (SAN), which uses a learned positional encoding (LPE) that can take advantage of the full Laplacian s… ▽ More

    Submitted 27 October, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted in Proceedings of NeurIPS 2021

  14. arXiv:2010.02863  [pdf, other

    cs.LG cs.CG cs.SI

    Directional Graph Networks

    Authors: Dominique Beaini, Saro Passaro, Vincent Létourneau, William L. Hamilton, Gabriele Corso, Pietro Liò

    Abstract: The lack of anisotropic kernels in graph neural networks (GNNs) strongly limits their expressiveness, contributing to well-known issues such as over-smoothing. To overcome this limitation, we propose the first globally consistent anisotropic kernels for GNNs, allowing for graph convolutions that are defined according to topologicaly-derived directional flows. First, by defining a vector field in t… ▽ More

    Submitted 7 April, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 11 pages, 10 pages appendix, 6 figures, subtitle: Anisotropic aggregation in graph neural networks via directional vector fields

  15. arXiv:2004.05718  [pdf, other

    cs.LG cs.CV stat.ML

    Principal Neighbourhood Aggregation for Graph Nets

    Authors: Gabriele Corso, Luca Cavalleri, Dominique Beaini, Pietro Liò, Petar Veličković

    Abstract: Graph Neural Networks (GNNs) have been shown to be effective models for different predictive tasks on graph-structured data. Recent work on their expressive power has focused on isomorphism tasks and countable feature spaces. We extend this theoretical framework to include continuous features - which occur regularly in real-world input domains and within the hidden layers of GNNs - and we demonstr… ▽ More

    Submitted 31 December, 2020; v1 submitted 12 April, 2020; originally announced April 2020.

    Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  16. arXiv:2003.05182  [pdf

    cs.CV cs.AI

    Improving Convolutional Neural Networks Via Conservative Field Regularisation and Integration

    Authors: Dominique Beaini, Sofiane Achiche, Maxime Raison

    Abstract: Current research in convolutional neural networks (CNN) focuses mainly on changing the architecture of the networks, optimizing the hyper-parameters and improving the gradient descent. However, most work use only 3 standard families of operations inside the CNN, the convolution, the activation function, and the pooling. In this work, we propose a new family of operations based on the Green's funct… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

    Comments: 11 pages, 3 figures

  17. arXiv:2002.04380  [pdf

    cs.CV

    Saliency Enhancement using Gradient Domain Edges Merging

    Authors: Dominique Beaini, Sofiane Achiche, Alexandre Duperre, Maxime Raison

    Abstract: In recent years, there has been a rapid progress in solving the binary problems in computer vision, such as edge detection which finds the boundaries of an image and salient object detection which finds the important object in an image. This progress happened thanks to the rise of deep-learning and convolutional neural networks (CNN) which allow to extract complex and abstract features. However, e… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  18. Deep Green Function Convolution for Improving Saliency in Convolutional Neural Networks

    Authors: Dominique Beaini, Sofiane Achiche, Alexandre Duperré, Maxime Raison

    Abstract: Current saliency methods require to learn large scale regional features using small convolutional kernels, which is not possible with a simple feed-forward network. Some methods solve this problem by using segmentation into superpixels while others downscale the image through the network and rescale it back to its original size. The objective of this paper is to show that saliency convolutional ne… ▽ More

    Submitted 14 November, 2019; v1 submitted 22 August, 2019; originally announced August 2019.

    Comments: 15 pages, 11 figures

  19. arXiv:1905.11577  [pdf, other

    cs.LG q-bio.BM stat.ML

    Towards Interpretable Sparse Graph Representation Learning with Laplacian Pooling

    Authors: Emmanuel Noutahi, Dominique Beaini, Julien Horwood, Sébastien Giguère, Prudencio Tossou

    Abstract: Recent work in graph neural networks (GNNs) has led to improvements in molecular activity and property prediction tasks. Unfortunately, GNNs often fail to capture the relative importance of interactions between molecular substructures, in part due to the absence of efficient intermediate pooling steps. To address these issues, we propose LaPool (Laplacian Pooling), a novel, data-driven, and interp… ▽ More

    Submitted 2 April, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: 11 pages, with Appendices

  20. arXiv:1902.00176  [pdf

    cs.CV cs.DM

    Fast and Optimal Laplacian Solver for Gradient-Domain Image Editing using Green Function Convolution

    Authors: Dominique Beaini, Sofiane Achiche, Fabrice Nonez, Olivier Brochu Dufour, Cédric Leblond-Ménard, Mahdis Asaadi, Maxime Raison

    Abstract: In computer vision, the gradient and Laplacian of an image are used in different applications, such as edge detection, feature extraction, and seamless image cloning. Computing the gradient of an image is straightforward since numerical derivatives are available in most computer vision toolboxes. However, the reverse problem is more difficult, since computing an image from its gradient requires to… ▽ More

    Submitted 1 July, 2019; v1 submitted 31 January, 2019; originally announced February 2019.

    Comments: 17 pages, single column scientific paper. Patent submitted

  21. arXiv:1806.07996  [pdf

    cs.CV

    Novel Convolution Kernels for Computer Vision and Shape Analysis based on Electromagnetism

    Authors: Dominique Beaini, Sofiane Achiche, Yann-Seing Law-Kam Cio, Maxime Raison

    Abstract: Computer vision is a growing field with a lot of new applications in automation and robotics, since it allows the analysis of images and shapes for the generation of numerical or analytical information. One of the most used method of information extraction is image filtering through convolution kernels, with each kernel specialized for specific applications. The objective of this paper is to prese… ▽ More

    Submitted 20 June, 2018; originally announced June 2018.

    Comments: Keywords: Shape analysis; Stroke analysis; Computer vision; Electromagnetic potential field; Feature extraction; Image filtering; Image convolution Published in PolyPublie: https://publications.polymtl.ca/3162/

    Journal ref: Beaini, D., Achiche, S., Law-Kam Cio, Y.-S. & Raison, M. (2018). Novel convolution kernels for computer vision and shape analysis based on electromagnetism (Report). https://publications.polymtl.ca/3162/

  22. arXiv:1806.01339  [pdf

    cs.CV math.NA

    Computing the Spatial Probability of Inclusion inside Partial Contours for Computer Vision Applications

    Authors: Dominique Beaini, Sofiane Achiche, Fabrice Nonez, Maxime Raison

    Abstract: In Computer Vision, edge detection is one of the favored approaches for feature and object detection in images since it provides information about their objects boundaries. Other region-based approaches use probabilistic analysis such as clustering and Markov random fields, but those methods cannot be used to analyze edges and their interaction. In fact, only image segmentation can produce regions… ▽ More

    Submitted 18 August, 2019; v1 submitted 4 June, 2018; originally announced June 2018.

    Comments: Keywords: Computer vision; Stroke analysis; Partial contour; Probability of inclusion; Edge interaction; Image convolution; Electromagnetic potential field