Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Klink, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.01885  [pdf, other

    cs.LG cs.RO

    Domain Randomization via Entropy Maximization

    Authors: Gabriele Tiboni, Pascal Klink, Jan Peters, Tatiana Tommasi, Carlo D'Eramo, Georgia Chalvatzaki

    Abstract: Varying dynamics parameters in simulation is a popular Domain Randomization (DR) approach for overcoming the reality gap in Reinforcement Learning (RL). Nevertheless, DR heavily hinges on the choice of the sampling distribution of the dynamics parameters, since high variability is crucial to regularize the agent's behavior but notoriously leads to overly conservative policies when randomizing exce… ▽ More

    Submitted 26 March, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: Published as a conference paper at ICLR 2024. Project website at https://gabrieletiboni.github.io/doraemon/

  2. arXiv:2309.14096  [pdf, other

    cs.LG cs.RO

    Tracking Control for a Spherical Pendulum via Curriculum Reinforcement Learning

    Authors: Pascal Klink, Florian Wolf, Kai Ploeger, Jan Peters, Joni Pajarinen

    Abstract: Reinforcement Learning (RL) allows learning non-trivial robot control laws purely from data. However, many successful applications of RL have relied on ad-hoc regularizations, such as hand-crafted curricula, to regularize the learning performance. In this paper, we pair a recent algorithm for automatically building curricula with RL on massively parallelized simulations to learn a tracking control… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  3. On the Benefit of Optimal Transport for Curriculum Reinforcement Learning

    Authors: Pascal Klink, Carlo D'Eramo, Jan Peters, Joni Pajarinen

    Abstract: Curriculum reinforcement learning (CRL) allows solving complex tasks by generating a tailored sequence of learning tasks, starting from easy ones and subsequently increasing their difficulty. Although the potential of curricula in RL has been clearly shown in various works, it is less clear how to generate them for a given learning environment, resulting in various methods aiming to automate this… ▽ More

    Submitted 4 May, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  4. arXiv:2307.06055  [pdf, other

    cs.LG stat.ML

    Function-Space Regularization for Deep Bayesian Classification

    Authors: Jihao Andreas Lin, Joe Watson, Pascal Klink, Jan Peters

    Abstract: Bayesian deep learning approaches assume model parameters to be latent random variables and infer posterior distributions to quantify uncertainty, increase safety and trust, and prevent overconfident and unpredictable behavior. However, weight-space priors are model-specific, can be difficult to interpret and are hard to specify. Instead, we apply a Dirichlet prior in predictive space and perform… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Advances in Approximate Bayesian Inference 2023

  5. arXiv:2306.05769  [pdf, other

    cs.LG

    Self-Paced Absolute Learning Progress as a Regularized Approach to Curriculum Learning

    Authors: Tobias Niehues, Ulla Scheler, Pascal Klink

    Abstract: The usability of Reinforcement Learning is restricted by the large computation times it requires. Curriculum Reinforcement Learning speeds up learning by defining a helpful order in which an agent encounters tasks, i.e. from simple to hard. Curricula based on Absolute Learning Progress (ALP) have proven successful in different environments, but waste computation on repeating already learned behavi… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 11 pages, 8 figures. The paper was a result from an Integrated Project at TU Darmstadt for which we received course credit (9 ECTS) and is not meant to be published elsewhere

  6. arXiv:2211.01120  [pdf, other

    cs.LG cs.AI cs.RO

    Variational Hierarchical Mixtures for Probabilistic Learning of Inverse Dynamics

    Authors: Hany Abdulsamad, Peter Nickl, Pascal Klink, Jan Peters

    Abstract: Well-calibrated probabilistic regression models are a crucial learning component in robotics applications as datasets grow rapidly and tasks become more complex. Unfortunately, classical regression models are usually either probabilistic kernel machines with a flexible structure that does not scale gracefully with data or deterministic and vastly scalable automata, albeit with a restrictive parame… ▽ More

    Submitted 10 September, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2011.05217

  7. arXiv:2104.10986  [pdf, other

    cs.LG

    Reinforcement Learning using Guided Observability

    Authors: Stephan Weigand, Pascal Klink, Jan Peters, Joni Pajarinen

    Abstract: Due to recent breakthroughs, reinforcement learning (RL) has demonstrated impressive performance in challenging sequential decision-making problems. However, an open question is how to make RL cope with partial observability which is prevalent in many real-world problems. Contrary to contemporary RL approaches, which focus mostly on improved memory representations or strong assumptions about the t… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

  8. arXiv:2102.13176  [pdf, other

    cs.LG

    A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning

    Authors: Pascal Klink, Hany Abdulsamad, Boris Belousov, Carlo D'Eramo, Jan Peters, Joni Pajarinen

    Abstract: Across machine learning, the use of curricula has shown strong empirical potential to improve learning from data by avoiding local optima of training objectives. For reinforcement learning (RL), curricula are especially interesting, as the underlying optimization has a strong tendency to get stuck in local optima due to the exploration-exploitation trade-off. Recently, a number of approaches for a… ▽ More

    Submitted 2 September, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Journal ref: Journal of Machine Learning Research 22 (182), Pages 1-52, 2021

  9. arXiv:2011.05217  [pdf, other

    cs.LG cs.RO

    A Variational Infinite Mixture for Probabilistic Inverse Dynamics Learning

    Authors: Hany Abdulsamad, Peter Nickl, Pascal Klink, Jan Peters

    Abstract: Probabilistic regression techniques in control and robotics applications have to fulfill different criteria of data-driven adaptability, computational efficiency, scalability to high dimensions, and the capacity to deal with different modalities in the data. Classical regressors usually fulfill only a subset of these properties. In this work, we extend seminal work on Bayesian nonparametric mixtur… ▽ More

    Submitted 30 March, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

  10. arXiv:2004.11812  [pdf, other

    cs.LG cs.AI stat.ML

    Self-Paced Deep Reinforcement Learning

    Authors: Pascal Klink, Carlo D'Eramo, Jan Peters, Joni Pajarinen

    Abstract: Curriculum reinforcement learning (CRL) improves the learning speed and stability of an agent by exposing it to a tailored series of tasks throughout learning. Despite empirical successes, an open question in CRL is how to automatically generate a curriculum for a given reinforcement learning (RL) agent, avoiding manual design. In this paper, we propose an answer by interpreting the curriculum gen… ▽ More

    Submitted 23 October, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

  11. arXiv:1911.00384  [pdf, other

    cs.AI

    Generalized Mean Estimation in Monte-Carlo Tree Search

    Authors: Tuan Dam, Pascal Klink, Carlo D'Eramo, Jan Peters, Joni Pajarinen

    Abstract: We consider Monte-Carlo Tree Search (MCTS) applied to Markov Decision Processes (MDPs) and Partially Observable MDPs (POMDPs), and the well-known Upper Confidence bound for Trees (UCT) algorithm. In UCT, a tree with nodes (states) and edges (actions) is incrementally built by the expansion of nodes, and the values of nodes are updated through a backup strategy based on the average value of child n… ▽ More

    Submitted 13 July, 2020; v1 submitted 1 November, 2019; originally announced November 2019.

  12. arXiv:1910.02826  [pdf, other

    cs.LG stat.ML

    Self-Paced Contextual Reinforcement Learning

    Authors: Pascal Klink, Hany Abdulsamad, Boris Belousov, Jan Peters

    Abstract: Generalization and adaptation of learned skills to novel situations is a core requirement for intelligent autonomous robots. Although contextual reinforcement learning provides a principled framework for learning and generalization of behaviors across related tasks, it generally relies on uninformed sampling of environments from an unknown, uncontrolled context distribution, thus missing the benef… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.