Zum Hauptinhalt springen

Showing 1–23 of 23 results for author: Rothfuss, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.16644  [pdf, other

    cs.RO cs.LG

    Bridging the Sim-to-Real Gap with Bayesian Inference

    Authors: Jonas Rothfuss, Bhavya Sukhija, Lenart Treven, Florian Dörfler, Stelian Coros, Andreas Krause

    Abstract: We present SIM-FSVGD for learning robot dynamics from data. As opposed to traditional methods, SIM-FSVGD leverages low-fidelity physical priors, e.g., in the form of simulators, to regularize the training of neural network models. While learning accurate dynamics already in the low data regime, SIM-FSVGD scales and excels also when more data is available. We empirically show that learning with imp… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  2. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2311.07558  [pdf, other

    cs.LG cs.RO

    Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning

    Authors: Arjun Bhardwaj, Jonas Rothfuss, Bhavya Sukhija, Yarden As, Marco Hutter, Stelian Coros, Andreas Krause

    Abstract: We introduce PACOH-RL, a novel model-based Meta-Reinforcement Learning (Meta-RL) algorithm designed to efficiently adapt control policies to changing dynamics. PACOH-RL meta-learns priors for the dynamics model, allowing swift adaptation to new dynamics with minimal interaction data. Existing Meta-RL methods require abundant meta-learning data, limiting their applicability in settings such as robo… ▽ More

    Submitted 6 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  4. arXiv:2303.01076  [pdf, other

    cs.LG cs.AI stat.ML

    Hallucinated Adversarial Control for Conservative Offline Policy Evaluation

    Authors: Jonas Rothfuss, Bhavya Sukhija, Tobias Birchler, Parnian Kassraie, Andreas Krause

    Abstract: We study the problem of conservative off-policy evaluation (COPE) where given an offline dataset of environment interactions, collected by other agents, we seek to obtain a (tight) lower bound on a policy's performance. This is crucial when deciding whether a given policy satisfies certain minimal performance/safety criteria before it can be deployed in the real world. To this end, we introduce HA… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Conference on Uncertainty in Artificial Intelligence (UAI) 2023, first three authors contributed equally

  5. arXiv:2211.07206  [pdf, other

    stat.ML cs.LG

    Scalable PAC-Bayesian Meta-Learning via the PAC-Optimal Hyper-Posterior: From Theory to Practice

    Authors: Jonas Rothfuss, Martin Josifoski, Vincent Fortuin, Andreas Krause

    Abstract: Meta-Learning aims to speed up the learning process on new tasks by acquiring useful inductive biases from datasets of related learning tasks. While, in practice, the number of related tasks available is often small, most of the existing approaches assume an abundance of tasks; making them unrealistic and prone to overfitting. A central question in the meta-learning literature is how to regularize… ▽ More

    Submitted 22 December, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: JMLR, 62 pages, text overlap with arXiv:2002.05551

    Journal ref: Journal of Machine Learning Research (24), 2023, 1-62

  6. arXiv:2211.01258  [pdf, other

    stat.ML cs.LG

    Instance-Dependent Generalization Bounds via Optimal Transport

    Authors: Songyan Hou, Parnian Kassraie, Anastasis Kratsios, Andreas Krause, Jonas Rothfuss

    Abstract: Existing generalization bounds fail to explain crucial factors that drive the generalization of modern neural networks. Since such bounds often hold uniformly over all parameters, they suffer from over-parametrization and fail to account for the strong inductive bias of initialization and stochastic gradient descent. As an alternative, we propose a novel optimal transport interpretation of the gen… ▽ More

    Submitted 13 November, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Journal of Machine Learning Research (JMLR), 51 pages

  7. arXiv:2210.15513  [pdf, other

    stat.ML cs.AI cs.LG

    Lifelong Bandit Optimization: No Prior and No Regret

    Authors: Felix Schur, Parnian Kassraie, Jonas Rothfuss, Andreas Krause

    Abstract: Machine learning algorithms are often repeatedly applied to problems with similar structure over and over again. We focus on solving a sequence of bandit optimization tasks and develop LIBO, an algorithm which adapts to the environment by learning from past experience and becomes more sample-efficient in the process. We assume a kernelized structure where the kernel is unknown but shared across al… ▽ More

    Submitted 20 June, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 35 pages, 6 figures, In Proceedings of UAI 2023

  8. arXiv:2210.13319  [pdf, other

    cs.LG stat.ML

    MARS: Meta-Learning as Score Matching in the Function Space

    Authors: Krunoslav Lehman Pavasovic, Jonas Rothfuss, Andreas Krause

    Abstract: Meta-learning aims to extract useful inductive biases from a set of related datasets. In Bayesian meta-learning, this is typically achieved by constructing a prior distribution over neural network parameters. However, specifying families of computationally viable prior distributions over the high-dimensional neural network parameters is difficult. As a result, existing approaches resort to meta-le… ▽ More

    Submitted 10 June, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: In International Conference on Learning Representations (ICLR), 2023

  9. arXiv:2210.00762  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Meta-Learning Priors for Safe Bayesian Optimization

    Authors: Jonas Rothfuss, Christopher Koenig, Alisa Rupenyan, Andreas Krause

    Abstract: In robotics, optimizing controller parameters under safety constraints is an important challenge. Safe Bayesian optimization (BO) quantifies uncertainty in the objective and constraints to safely guide exploration in such settings. Hand-designing a suitable probabilistic model can be challenging, however. In the presence of unknown safety constraints, it is crucial to choose reliable model hyper-p… ▽ More

    Submitted 12 June, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Conference on Robot Learning (CoRL) 2022

  10. arXiv:2206.01665  [pdf, other

    cs.LG stat.ME stat.ML

    BaCaDI: Bayesian Causal Discovery with Unknown Interventions

    Authors: Alexander Hägele, Jonas Rothfuss, Lars Lorch, Vignesh Ram Somnath, Bernhard Schölkopf, Andreas Krause

    Abstract: Inferring causal structures from experimentation is a central task in many domains. For example, in biology, recent advances allow us to obtain single-cell expression data under multiple interventions such as drugs or gene knockouts. However, the targets of the interventions are often uncertain or unknown and the number of observations limited. As a result, standard causal discovery methods can no… ▽ More

    Submitted 23 February, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Accepted to AISTATS 2023. 26 pages

  11. arXiv:2205.12934  [pdf, other

    cs.LG stat.ML

    Amortized Inference for Causal Structure Learning

    Authors: Lars Lorch, Scott Sussex, Jonas Rothfuss, Andreas Krause, Bernhard Schölkopf

    Abstract: Inferring causal structure poses a combinatorial search problem that typically involves evaluating structures with a score or independence test. The resulting search is costly, and designing suitable scores or tests that capture prior knowledge is difficult. In this work, we propose to amortize causal structure learning. Rather than searching over structures, we train a variational inference model… ▽ More

    Submitted 15 December, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022, fixed formatting of Figure 5

  12. arXiv:2202.00602  [pdf, other

    stat.ML cs.AI cs.LG

    Meta-Learning Hypothesis Spaces for Sequential Decision-making

    Authors: Parnian Kassraie, Jonas Rothfuss, Andreas Krause

    Abstract: Obtaining reliable, adaptive confidence sets for prediction functions (hypotheses) is a central challenge in sequential decision-making tasks, such as bandits and model-based reinforcement learning. These confidence sets typically rely on prior assumptions on the hypothesis space, e.g., the known kernel of a Reproducing Kernel Hilbert Space (RKHS). Hand-designing such kernels is error prone, and m… ▽ More

    Submitted 17 June, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: 23 pages, 11 figures

  13. arXiv:2106.07635  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Causal Networks: Approximate Bayesian Inference over Causal Structures

    Authors: Yashas Annadani, Jonas Rothfuss, Alexandre Lacoste, Nino Scherrer, Anirudh Goyal, Yoshua Bengio, Stefan Bauer

    Abstract: Learning the causal structure that underlies data is a crucial step towards robust real-world decision making. The majority of existing work in causal inference focuses on determining a single directed acyclic graph (DAG) or a Markov equivalence class thereof. However, a crucial aspect to acting intelligently upon the knowledge about causal structure which has been inferred from finite data demand… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: 10 pages, 6 figures

  14. arXiv:2106.03195  [pdf, other

    cs.LG cs.AI stat.ML

    Meta-Learning Reliable Priors in the Function Space

    Authors: Jonas Rothfuss, Dominique Heyn, Jinfan Chen, Andreas Krause

    Abstract: When data are scarce meta-learning can improve a learner's accuracy by harnessing previous experience from related learning tasks. However, existing methods have unreliable uncertainty estimates which are often overconfident. Addressing these shortcomings, we introduce a novel meta-learning framework, called F-PACOH, that treats meta-learned priors as stochastic processes and performs meta-level r… ▽ More

    Submitted 11 January, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: In Advances of Neural Information Processing Systems (NeurIPS) 2021

  15. arXiv:2105.11839  [pdf, other

    cs.LG stat.ML

    DiBS: Differentiable Bayesian Structure Learning

    Authors: Lars Lorch, Jonas Rothfuss, Bernhard Schölkopf, Andreas Krause

    Abstract: Bayesian structure learning allows inferring Bayesian network structure from data while reasoning about the epistemic uncertainty -- a key element towards enabling active causal discovery and designing interventions in real world systems. In this work, we propose a general, fully differentiable framework for Bayesian structure learning (DiBS) that operates in the continuous space of a latent proba… ▽ More

    Submitted 16 December, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: NeurIPS 2021; updated run time results

  16. arXiv:2103.06002  [pdf, other

    cs.LG stat.ML

    Robustness to Pruning Predicts Generalization in Deep Neural Networks

    Authors: Lorenz Kuhn, Clare Lyle, Aidan N. Gomez, Jonas Rothfuss, Yarin Gal

    Abstract: Existing generalization measures that aim to capture a model's simplicity based on parameter counts or norms fail to explain generalization in overparameterized deep neural networks. In this paper, we introduce a new, theoretically motivated measure of a network's simplicity which we call prunability: the smallest \emph{fraction} of the network's parameters that can be kept while pruning without a… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

  17. arXiv:2002.05551  [pdf, other

    stat.ML cs.LG

    PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees

    Authors: Jonas Rothfuss, Vincent Fortuin, Martin Josifoski, Andreas Krause

    Abstract: Meta-learning can successfully acquire useful inductive biases from data. Yet, its generalization properties to unseen learning tasks are poorly understood. Particularly if the number of meta-training tasks is small, this raises concerns about overfitting. We provide a theoretical analysis using the PAC-Bayesian framework and derive novel generalization bounds for meta-learning. Using these bounds… ▽ More

    Submitted 18 June, 2021; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: International Conference on Machine Learning (ICML) 2021

    MSC Class: 68Q32

  18. arXiv:1907.08982  [pdf, other

    stat.ML cs.LG

    Noise Regularization for Conditional Density Estimation

    Authors: Jonas Rothfuss, Fabio Ferreira, Simon Boehm, Simon Walther, Maxim Ulrich, Tamim Asfour, Andreas Krause

    Abstract: Modelling statistical relationships beyond the conditional mean is crucial in many settings. Conditional density estimation (CDE) aims to learn the full conditional probability density from data. Though highly expressive, neural network based CDE models can suffer from severe over-fitting when trained with the maximum likelihood objective. Due to the inherent structure of such models, classical re… ▽ More

    Submitted 14 February, 2020; v1 submitted 21 July, 2019; originally announced July 2019.

  19. arXiv:1903.00954  [pdf, other

    stat.ML cs.LG q-fin.CP q-fin.ST

    Conditional Density Estimation with Neural Networks: Best Practices and Benchmarks

    Authors: Jonas Rothfuss, Fabio Ferreira, Simon Walther, Maxim Ulrich

    Abstract: Given a set of empirical observations, conditional density estimation aims to capture the statistical relationship between a conditional variable $\mathbf{x}$ and a dependent variable $\mathbf{y}$ by modeling their conditional probability $p(\mathbf{y}|\mathbf{x})$. The paper develops best practices for conditional density estimation for finance applications with neural networks, grounded on mathe… ▽ More

    Submitted 13 April, 2019; v1 submitted 3 March, 2019; originally announced March 2019.

  20. arXiv:1810.06784  [pdf, other

    cs.LG stat.ML

    ProMP: Proximal Meta-Policy Search

    Authors: Jonas Rothfuss, Dennis Lee, Ignasi Clavera, Tamim Asfour, Pieter Abbeel

    Abstract: Credit assignment in Meta-reinforcement learning (Meta-RL) is still poorly understood. Existing methods either neglect credit assignment to pre-adaptation behavior or implement it naively. This leads to poor sample-efficiency during meta-training as well as ineffective task identification strategies. This paper provides a theoretical analysis of credit assignment in gradient-based Meta-RL. Buildin… ▽ More

    Submitted 11 February, 2022; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: The first three authors contributed equally. Published at ICLR 2019

  21. arXiv:1809.05214  [pdf, other

    cs.LG cs.AI stat.ML

    Model-Based Reinforcement Learning via Meta-Policy Optimization

    Authors: Ignasi Clavera, Jonas Rothfuss, John Schulman, Yasuhiro Fujita, Tamim Asfour, Pieter Abbeel

    Abstract: Model-based reinforcement learning approaches carry the promise of being data efficient. However, due to challenges in learning dynamics models that sufficiently match the real-world dynamics, they struggle to achieve the same asymptotic performance as model-free methods. We propose Model-Based Meta-Policy-Optimization (MB-MPO), an approach that foregoes the strong reliance on accurate learned dyn… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

    Comments: First 2 authors contributed equally. Accepted for Conference on Robot Learning (CoRL)

  22. arXiv:1807.00703  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Introducing the Simulated Flying Shapes and Simulated Planar Manipulator Datasets

    Authors: Fabio Ferreira, Jonas Rothfuss, Eren Erdal Aksoy, You Zhou, Tamim Asfour

    Abstract: We release two artificial datasets, Simulated Flying Shapes and Simulated Planar Manipulator that allow to test the learning ability of video processing systems. In particular, the dataset is meant as a tool which allows to easily assess the sanity of deep neural network models that aim to encode, reconstruct or predict video frame sequences. The datasets each consist of 90000 videos. The Simulate… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: technical documentation, 2 figures, links to repositories

  23. arXiv:1801.04134  [pdf, other

    cs.AI cs.CV cs.RO

    Deep Episodic Memory: Encoding, Recalling, and Predicting Episodic Experiences for Robot Action Execution

    Authors: Jonas Rothfuss, Fabio Ferreira, Eren Erdal Aksoy, You Zhou, Tamim Asfour

    Abstract: We present a novel deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and predicting action experiences. Our proposed unsupervised deep episodic memory model 1) encodes observed actions in a latent vector space and, based on this latent encoding, 2) infers most similar episodes previously experienced, 3) reconstruct… ▽ More

    Submitted 14 July, 2018; v1 submitted 12 January, 2018; originally announced January 2018.