Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Ibarz, B

.
  1. arXiv:2406.09308  [pdf, other

    cs.CL cs.LG

    Transformers meet Neural Algorithmic Reasoners

    Authors: Wilfried Bounsi, Borja Ibarz, Andrew Dudzik, Jessica B. Hamrick, Larisa Markeeva, Alex Vitvitskyi, Razvan Pascanu, Petar Veličković

    Abstract: Transformers have revolutionized machine learning with their simple yet effective architecture. Pre-training Transformers on massive text datasets from the Internet has led to unmatched generalization for natural language understanding (NLU) tasks. However, such language models remain fragile when tasked with algorithmic forms of reasoning, where computations must be precise and robust. To address… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: To appear at CVPR 2024 Multimodal Algorithmic Reasoning (MAR) Workshop. 10 pages, 5 figures

  2. arXiv:2406.04229  [pdf, other

    cs.LG cs.AI cs.CL cs.DS stat.ML

    The CLRS-Text Algorithmic Reasoning Language Benchmark

    Authors: Larisa Markeeva, Sean McLeish, Borja Ibarz, Wilfried Bounsi, Olga Kozlova, Alex Vitvitskyi, Charles Blundell, Tom Goldstein, Avi Schwarzschild, Petar Veličković

    Abstract: Eliciting reasoning capabilities from language models (LMs) is a critical direction on the path towards building intelligent systems. Most recent studies dedicated to reasoning focus on out-of-distribution performance on procedurally-generated synthetic benchmarks, bespoke-built to evaluate specific skills only. This trend makes results hard to transfer across publications, slowing down progress.… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Preprint, under review. Comments welcome

  3. arXiv:2305.13233  [pdf, other

    physics.comp-ph cond-mat.stat-mech cs.LG stat.ML

    Estimating Gibbs free energies via isobaric-isothermal flows

    Authors: Peter Wirnsberger, Borja Ibarz, George Papamakarios

    Abstract: We present a machine-learning model based on normalizing flows that is trained to sample from the isobaric-isothermal ensemble. In our approach, we approximate the joint distribution of a fully-flexible triclinic simulation box and particle coordinates to achieve a desired internal pressure. This novel extension of flow-based sampling to the isobaric-isothermal ensemble yields direct estimates of… ▽ More

    Submitted 6 September, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 19 pages, 7 figures

  4. arXiv:2302.10258  [pdf, other

    cs.LG cs.AI stat.ME

    Neural Algorithmic Reasoning with Causal Regularisation

    Authors: Beatrice Bevilacqua, Kyriacos Nikiforou, Borja Ibarz, Ioana Bica, Michela Paganini, Charles Blundell, Jovana Mitrovic, Petar Veličković

    Abstract: Recent work on neural algorithmic reasoning has investigated the reasoning capabilities of neural networks, effectively demonstrating they can learn to execute classical algorithms on unseen data coming from the train distribution. However, the performance of existing neural reasoners significantly degrades on out-of-distribution (OOD) test data, where inputs have larger sizes. In this work, we ma… ▽ More

    Submitted 3 July, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: ICML 2023, Camera Ready; 17 pages, 7 figures

  5. arXiv:2209.11142  [pdf, other

    cs.LG cs.AI stat.ML

    A Generalist Neural Algorithmic Learner

    Authors: Borja Ibarz, Vitaly Kurin, George Papamakarios, Kyriacos Nikiforou, Mehdi Bennani, Róbert Csordás, Andrew Dudzik, Matko Bošnjak, Alex Vitvitskyi, Yulia Rubanova, Andreea Deac, Beatrice Bevilacqua, Yaroslav Ganin, Charles Blundell, Petar Veličković

    Abstract: The cornerstone of neural algorithmic reasoning is the ability to solve algorithmic tasks, especially in a way that generalises out of distribution. While recent years have seen a surge in methodological improvements in this area, they mostly focused on building specialist models. Specialist models are capable of learning to neurally execute either only one algorithm or a collection of algorithms… ▽ More

    Submitted 3 December, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: To appear at LoG 2022 (Spotlight talk). 23 pages, 11 figures

  6. arXiv:2111.08696  [pdf, other

    physics.comp-ph cond-mat.stat-mech stat.ML

    Normalizing flows for atomic solids

    Authors: Peter Wirnsberger, George Papamakarios, Borja Ibarz, Sébastien Racanière, Andrew J. Ballard, Alexander Pritzel, Charles Blundell

    Abstract: We present a machine-learning approach, based on normalizing flows, for modelling atomic solids. Our model transforms an analytically tractable base distribution into the target solid without requiring ground-truth samples for training. We report Helmholtz free energy estimates for cubic and hexagonal ice modelled as monatomic water as well as for a truncated and shifted Lennard-Jones system, and… ▽ More

    Submitted 28 April, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: 20 pages, 7 figures

  7. arXiv:1811.06521  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Reward learning from human preferences and demonstrations in Atari

    Authors: Borja Ibarz, Jan Leike, Tobias Pohlen, Geoffrey Irving, Shane Legg, Dario Amodei

    Abstract: To solve complex real-world problems with reinforcement learning, we cannot rely on manually specified reward functions. Instead, we can have humans communicate an objective to the agent directly. In this work, we combine two approaches to learning from human feedback: expert demonstrations and trajectory preferences. We train a deep neural network to model the reward function and use its predicte… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

    Comments: NIPS 2018

  8. arXiv:1811.02861  [pdf, other

    q-bio.NC cond-mat.dis-nn nlin.AO

    Growing Critical: Self-Organized Criticality in a Developing Neural System

    Authors: Felipe Yaroslav Kalle Kossio, Sven Goedeke, Benjamin van den Akker, Borja Ibarz, Raoul-Martin Memmesheimer

    Abstract: Experiments in various neural systems found avalanches: bursts of activity with characteristics typical for critical dynamics. A possible explanation for their occurrence is an underlying network that self-organizes into a critical state. We propose a simple spiking model for developing neural networks, showing how these may "grow into" criticality. Avalanches generated by our model correspond to… ▽ More

    Submitted 7 November, 2018; originally announced November 2018.

    Comments: 6 pages, 4 figures, supplemental material: 10 pages, 7 figures

    Journal ref: Phys. Rev. Lett. 121(5), 058301, 2018