Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Kirsch, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03878  [pdf, other

    cs.LG cs.AI

    Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning

    Authors: Aditya A. Ramesh, Kenny Young, Louis Kirsch, Jürgen Schmidhuber

    Abstract: Temporal credit assignment in reinforcement learning is challenging due to delayed and stochastic outcomes. Monte Carlo targets can bridge long delays between action and consequence but lead to high-variance targets due to stochasticity. Temporal difference (TD) learning uses bootstrapping to overcome variance but introduces a bias that can only be corrected through many iterations. TD($λ$) provid… ▽ More

    Submitted 4 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: ICML 2024 version

  2. arXiv:2402.16823  [pdf, other

    cs.AI cs.CL cs.LG cs.MA

    Language Agents as Optimizable Graphs

    Authors: Mingchen Zhuge, Wenyi Wang, Louis Kirsch, Francesco Faccio, Dmitrii Khizbullin, Jürgen Schmidhuber

    Abstract: Various human-designed prompt engineering techniques have been proposed to improve problem solvers based on Large Language Models (LLMs), yielding many disparate code bases. We unify these approaches by describing LLM-based agents as computational graphs. The nodes implement functions to process multimodal data or query LLMs, and the edges describe the information flow between operations. Graphs c… ▽ More

    Submitted 22 August, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Project Website: https://gptswarm.org ; Github Repo: https://github.com/metauto-ai/gptswarm . In Forty-first International Conference on Machine Learning (2024)

  3. arXiv:2402.05828  [pdf, other

    cs.LG cs.AI

    Discovering Temporally-Aware Reinforcement Learning Algorithms

    Authors: Matthew Thomas Jackson, Chris Lu, Louis Kirsch, Robert Tjarko Lange, Shimon Whiteson, Jakob Nicolaus Foerster

    Abstract: Recent advancements in meta-learning have enabled the automatic discovery of novel reinforcement learning algorithms parameterized by surrogate objective functions. To improve upon manually designed algorithms, the parameterization of this learned objective function must be expressive enough to represent novel principles of learning (instead of merely recovering already established ones) while sti… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Published at ICLR 2024

  4. arXiv:2309.11197  [pdf, other

    cs.LG cs.CL

    The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute

    Authors: Aleksandar Stanić, Dylan Ashley, Oleg Serikov, Louis Kirsch, Francesco Faccio, Jürgen Schmidhuber, Thomas Hofmann, Imanol Schlag

    Abstract: The Languini Kitchen serves as both a research collective and codebase designed to empower researchers with limited computational resources to contribute meaningfully to the field of language modelling. We introduce an experimental protocol that enables model comparisons based on equivalent compute, measured in accelerator hours. The number of tokens on which a model is trained is defined by the m… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  5. arXiv:2305.17066  [pdf, other

    cs.AI cs.CL cs.CV cs.LG cs.MA

    Mindstorms in Natural Language-Based Societies of Mind

    Authors: Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, Jinjie Mai, Piotr Piękos, Aditya Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanić, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-Ping Fan, Bernard Ghanem , et al. (1 additional authors not shown)

    Abstract: Both Minsky's "society of mind" and Schmidhuber's "learning to think" inspire diverse societies of large multimodal neural networks (NNs) that solve problems by interviewing each other in a "mindstorm." Recent implementations of NN-based societies of minds consist of large language models (LLMs) and other NN-based experts communicating through a natural language interface. In doing so, they overco… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 9 pages in main text + 7 pages of references + 38 pages of appendices, 14 figures in main text + 13 in appendices, 7 tables in appendices

    MSC Class: 68T07 ACM Class: I.2.6; I.2.11

  6. arXiv:2305.11252  [pdf, other

    cs.NE cs.AI cs.LG q-bio.NC

    Brain-inspired learning in artificial neural networks: a review

    Authors: Samuel Schmidgall, Jascha Achterberg, Thomas Miconi, Louis Kirsch, Rojin Ziaei, S. Pardis Hajiseyedrazi, Jason Eshraghian

    Abstract: Artificial neural networks (ANNs) have emerged as an essential tool in machine learning, achieving remarkable success across diverse domains, including image and speech generation, game playing, and robotics. However, there exist fundamental differences between ANNs' operating mechanisms and those of the biological brain, particularly concerning learning processes. This paper presents a comprehens… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  7. arXiv:2212.14392  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Eliminating Meta Optimization Through Self-Referential Meta Learning

    Authors: Louis Kirsch, Jürgen Schmidhuber

    Abstract: Meta Learning automates the search for learning algorithms. At the same time, it creates a dependency on human engineering on the meta-level, where meta learning algorithms need to be designed. In this paper, we investigate self-referential meta learning systems that modify themselves without the need for explicit meta optimization. We discuss the relationship of such systems to in-context and mem… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: The first version appeared at ICML 2022, DARL Workshop

  8. arXiv:2212.14374  [pdf, other

    cs.LG cs.AI

    Learning One Abstract Bit at a Time Through Self-Invented Experiments Encoded as Neural Networks

    Authors: Vincent Herrmann, Louis Kirsch, Jürgen Schmidhuber

    Abstract: There are two important things in science: (A) Finding answers to given questions, and (B) Coming up with good questions. Our artificial scientists not only learn to answer given questions, but also continually invent new questions, by proposing hypotheses to be verified or falsified through potentially complex and time-consuming experiments, including thought experiments akin to those of mathemat… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: 20 pages, 6 figures

  9. arXiv:2212.04458  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    General-Purpose In-Context Learning by Meta-Learning Transformers

    Authors: Louis Kirsch, James Harrison, Jascha Sohl-Dickstein, Luke Metz

    Abstract: Modern machine learning requires system designers to specify aspects of the learning pipeline, such as losses, architectures, and optimizers. Meta-learning, or learning-to-learn, instead aims to learn those aspects, and promises to unlock greater capabilities with less manual effort. One particularly ambitious goal of meta-learning is to train general-purpose in-context learning algorithms from sc… ▽ More

    Submitted 9 January, 2024; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: Published at the NeurIPS 2022 Workshop on Meta-Learning. Full version currently under review

  10. arXiv:2211.10282  [pdf, other

    cs.LG

    Exploring through Random Curiosity with General Value Functions

    Authors: Aditya Ramesh, Louis Kirsch, Sjoerd van Steenkiste, Jürgen Schmidhuber

    Abstract: Efficient exploration in reinforcement learning is a challenging problem commonly addressed through intrinsic rewards. Recent prominent approaches are based on state novelty or variants of artificial curiosity. However, directly applying them to partially observable environments can be ineffective and lead to premature dissipation of intrinsic rewards. Here we propose random curiosity with general… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS 2022

  11. arXiv:2211.02222  [pdf, other

    cs.LG

    The Benefits of Model-Based Generalization in Reinforcement Learning

    Authors: Kenny Young, Aditya Ramesh, Louis Kirsch, Jürgen Schmidhuber

    Abstract: Model-Based Reinforcement Learning (RL) is widely believed to have the potential to improve sample efficiency by allowing an agent to synthesize large amounts of imagined experience. Experience Replay (ER) can be considered a simple kind of model, which has proved effective at improving the stability and efficiency of deep RL. In principle, a learned parametric model could improve on ER by general… ▽ More

    Submitted 10 July, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: Update to ICML version

  12. arXiv:2207.01570  [pdf, other

    cs.LG stat.ML

    Goal-Conditioned Generators of Deep Policies

    Authors: Francesco Faccio, Vincent Herrmann, Aditya Ramesh, Louis Kirsch, Jürgen Schmidhuber

    Abstract: Goal-conditioned Reinforcement Learning (RL) aims at learning optimal policies, given goals encoded in special command inputs. Here we study goal-conditioned neural nets (NNs) that learn to generate deep NN policies in form of context-specific weight matrices, similar to Fast Weight Programmers and other methods from the 1990s. Using context commands of the form "generate a policy that achieves a… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: Preprint. Under Review

  13. arXiv:2109.10781  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Introducing Symmetries to Black Box Meta Reinforcement Learning

    Authors: Louis Kirsch, Sebastian Flennerhag, Hado van Hasselt, Abram Friesen, Junhyuk Oh, Yutian Chen

    Abstract: Meta reinforcement learning (RL) attempts to discover new RL algorithms automatically from environment interaction. In so-called black-box approaches, the policy and the learning algorithm are jointly represented by a single neural network. These methods are very flexible, but they tend to underperform in terms of generalisation to new, unseen environments. In this paper, we explore the role of sy… ▽ More

    Submitted 5 June, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: AAAI 2022

  14. arXiv:2012.14905  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Meta Learning Backpropagation And Improving It

    Authors: Louis Kirsch, Jürgen Schmidhuber

    Abstract: Many concepts have been proposed for meta learning with neural networks (NNs), e.g., NNs that learn to reprogram fast weights, Hebbian plasticity, learned learning rules, and meta recurrent NNs. Our Variable Shared Meta Learning (VSML) unifies the above and demonstrates that simple weight-sharing and sparsity in an NN is sufficient to express powerful learning algorithms (LAs) in a reusable fashio… ▽ More

    Submitted 13 March, 2022; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: Updated to the NeurIPS 2021 camera ready; fixed typo in eq 4

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  15. arXiv:2006.09226  [pdf, other

    cs.LG cs.AI stat.ML

    Parameter-Based Value Functions

    Authors: Francesco Faccio, Louis Kirsch, Jürgen Schmidhuber

    Abstract: Traditional off-policy actor-critic Reinforcement Learning (RL) algorithms learn value functions of a single target policy. However, when value functions are updated to track the learned policy, they forget potentially useful information about old policies. We introduce a class of value functions called Parameter-Based Value Functions (PBVFs) whose inputs include the policy parameters. They can ge… ▽ More

    Submitted 13 August, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Published as a conference paper at ICLR 2021

  16. arXiv:1910.04098  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Improving Generalization in Meta Reinforcement Learning using Learned Objectives

    Authors: Louis Kirsch, Sjoerd van Steenkiste, Jürgen Schmidhuber

    Abstract: Biological evolution has distilled the experiences of many learners into the general learning algorithms of humans. Our novel meta reinforcement learning algorithm MetaGenRL is inspired by this process. MetaGenRL distills the experiences of many complex agents to meta-learn a low-complexity neural objective function that decides how future individuals will learn. Unlike recent meta-RL algorithms,… ▽ More

    Submitted 14 February, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: Accepted to ICLR 2020

    ACM Class: I.2.6

  17. arXiv:1902.04340  [pdf, other

    cs.LG cs.IT stat.ML

    Gaussian Mean Field Regularizes by Limiting Learned Information

    Authors: Julius Kunze, Louis Kirsch, Hippolyt Ritter, David Barber

    Abstract: Variational inference with a factorized Gaussian posterior estimate is a widely used approach for learning parameters and hidden variables. Empirically, a regularizing effect can be observed that is poorly understood. In this work, we show how mean field inference improves generalization by limiting mutual information between learned parameters and the data through noise. We quantify a maximum cap… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  18. arXiv:1811.05249  [pdf, other

    cs.LG cs.AI stat.ML

    Modular Networks: Learning to Decompose Neural Computation

    Authors: Louis Kirsch, Julius Kunze, David Barber

    Abstract: Scaling model capacity has been vital in the success of deep learning. For a typical network, necessary compute resources and training time grow dramatically with model size. Conditional computation is a promising way to increase the number of parameters with a relatively small increase in resources. We propose a training algorithm that flexibly chooses neural modules based on the data to be proce… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

    Comments: NIPS 2018

  19. arXiv:1706.00290  [pdf, other

    cs.LG cs.CL cs.NE stat.ML

    Transfer Learning for Speech Recognition on a Budget

    Authors: Julius Kunze, Louis Kirsch, Ilia Kurenkov, Andreas Krug, Jens Johannsmeier, Sebastian Stober

    Abstract: End-to-end training of automated speech recognition (ASR) systems requires massive data and compute resources. We explore transfer learning based on model adaptation as an approach for training ASR models under constrained GPU memory, throughput and training data. We conduct several systematic experiments adapting a Wav2Letter convolutional neural network originally trained for English ASR to the… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

    Comments: Accepted for 2nd ACL Workshop on Representation Learning for NLP