Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Rainone, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.04858  [pdf, other

    cs.AI cs.CL cs.LG

    CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

    Authors: Natasha Butt, Blazej Manczak, Auke Wiggers, Corrado Rainone, David W. Zhang, Michaël Defferrard, Taco Cohen

    Abstract: Large language models are increasingly solving tasks that are commonly believed to require human-level reasoning ability. However, these models still perform very poorly on benchmarks of general intelligence such as the Abstraction and Reasoning Corpus (ARC). In this paper, we approach ARC as a programming-by-examples problem, and introduce a novel and scalable method for language model self-impro… ▽ More

    Submitted 1 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: ICML'24 camera-ready version

  2. arXiv:2302.05446  [pdf, other

    cs.AI cs.LG cs.PL

    Robust Scheduling with GFlowNets

    Authors: David W. Zhang, Corrado Rainone, Markus Peschl, Roberto Bondesan

    Abstract: Finding the best way to schedule operations in a computation graph is a classical NP-hard problem which is central to compiler optimization. However, evaluating the goodness of a schedule on the target hardware can be very time-consuming. Traditional approaches as well as previous machine learning ones typically optimize proxy metrics, which are fast to evaluate but can lead to bad schedules when… ▽ More

    Submitted 14 February, 2023; v1 submitted 17 January, 2023; originally announced February 2023.

    Comments: Published at International Conference on Learning Representations (ICLR) 2023; an earlier version appeared at the NeurIPS 2022 workshop ML4Systems

  3. arXiv:2207.05899  [pdf, other

    cs.LG

    Neural Topological Ordering for Computation Graphs

    Authors: Mukul Gagrani, Corrado Rainone, Yang Yang, Harris Teague, Wonseok Jeon, Herke Van Hoof, Weiliang Will Zeng, Piero Zappi, Christopher Lott, Roberto Bondesan

    Abstract: Recent works on machine learning for combinatorial optimization have shown that learning based approaches can outperform heuristic methods in terms of speed and performance. In this paper, we consider the problem of finding an optimal topological order on a directed acyclic graph with focus on the memory minimization problem which arises in compilers. We propose an end-to-end machine learning base… ▽ More

    Submitted 7 October, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: To appear in NeurIPS 2022

  4. arXiv:2207.00283  [pdf, other

    hep-lat cond-mat.stat-mech cs.LG hep-th

    Learning Lattice Quantum Field Theories with Equivariant Continuous Flows

    Authors: Mathis Gerdes, Pim de Haan, Corrado Rainone, Roberto Bondesan, Miranda C. N. Cheng

    Abstract: We propose a novel machine learning method for sampling from the high-dimensional probability distributions of Lattice Field Theories, which is based on a single neural ODE layer and incorporates the full symmetries of the problem. We test our model on the $φ^4$ theory, showing that it systematically outperforms previously proposed flow-based methods in sampling efficiency, and the improvement is… ▽ More

    Submitted 20 December, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: 17 pages, 9 figures, 1 table; slightly expanded published version, added 2 figures and 2 sections to appendix

    Journal ref: SciPost Phys. 15, 238 (2023)

  5. arXiv:2110.02673  [pdf, other

    cs.LG cond-mat.stat-mech hep-lat

    Scaling Up Machine Learning For Quantum Field Theory with Equivariant Continuous Flows

    Authors: Pim de Haan, Corrado Rainone, Miranda C. N. Cheng, Roberto Bondesan

    Abstract: We propose a continuous normalizing flow for sampling from the high-dimensional probability distributions of Quantum Field Theories in Physics. In contrast to the deep architectures used so far for this task, our proposal is based on a shallow design and incorporates the symmetries of the problem. We test our model on the $φ^4$ theory, showing that it systematically outperforms a realNVP baseline… ▽ More

    Submitted 25 November, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: 8 pages, 5 figures. Fourth Workshop on Machine Learning and the Physical Sciences (NeurIPS 2021)