Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Tjanaka, B

Searching in archive cs. Search in all archives.
.
  1. Quality Diversity for Robot Learning: Limitations and Future Directions

    Authors: Sumeet Batra, Bryon Tjanaka, Stefanos Nikolaidis, Gaurav Sukhatme

    Abstract: Quality Diversity (QD) has shown great success in discovering high-performing, diverse policies for robot skill learning. While current benchmarks have led to the development of powerful QD methods, we argue that new paradigms must be developed to facilitate open-ended search and generalizability. In particular, many methods focus on learning diverse agents that each move to a different xy positio… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted to GECCO 2024

  2. arXiv:2312.11331  [pdf, other

    cs.LG cs.NE

    Density Descent for Diversity Optimization

    Authors: David H. Lee, Anishalakshmi V. Palaparthi, Matthew C. Fontaine, Bryon Tjanaka, Stefanos Nikolaidis

    Abstract: Diversity optimization seeks to discover a set of solutions that elicit diverse features. Prior work has proposed Novelty Search (NS), which, given a current set of solutions, seeks to expand the set by finding points in areas of low density in the feature space. However, to estimate density, NS relies on a heuristic that considers the k-nearest neighbors of the search point in the feature space,… ▽ More

    Submitted 30 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 15 pages, 5 figures, published as a conference paper at the 2024 Genetic and Evolutionary Computation Conference (GECCO '24)

  3. arXiv:2305.13795  [pdf, other

    cs.LG cs.AI

    Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning

    Authors: Sumeet Batra, Bryon Tjanaka, Matthew C. Fontaine, Aleksei Petrenko, Stefanos Nikolaidis, Gaurav Sukhatme

    Abstract: Training generally capable agents that thoroughly explore their environment and learn new and diverse skills is a long-term goal of robot learning. Quality Diversity Reinforcement Learning (QD-RL) is an emerging research area that blends the best aspects of both fields -- Quality Diversity (QD) provides a principled form of exploration and produces collections of behaviorally diverse agents, while… ▽ More

    Submitted 29 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted as a spotlight paper at ICLR 2024

  4. arXiv:2304.13787  [pdf, other

    cs.RO cs.HC cs.LG

    Surrogate Assisted Generation of Human-Robot Interaction Scenarios

    Authors: Varun Bhatt, Heramb Nemlekar, Matthew C. Fontaine, Bryon Tjanaka, Hejia Zhang, Ya-Chuan Hsu, Stefanos Nikolaidis

    Abstract: As human-robot interaction (HRI) systems advance, so does the difficulty of evaluating and understanding the strengths and limitations of these systems in different environments and with different users. To this end, previous methods have algorithmically generated diverse scenarios that reveal system failures in a shared control teleoperation task. However, these methods require directly evaluatin… ▽ More

    Submitted 31 October, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 27 pages; 12 figures; 3 tables; Accepted for oral presentation at CoRL 2023

  5. arXiv:2303.00191  [pdf, other

    cs.NE cs.LG cs.SE

    pyribs: A Bare-Bones Python Library for Quality Diversity Optimization

    Authors: Bryon Tjanaka, Matthew C. Fontaine, David H. Lee, Yulun Zhang, Nivedit Reddy Balam, Nathaniel Dennler, Sujay S. Garlanka, Nikitas Dimitri Klapsis, Stefanos Nikolaidis

    Abstract: Recent years have seen a rise in the popularity of quality diversity (QD) optimization, a branch of optimization that seeks to find a collection of diverse, high-performing solutions to a given problem. To grow further, we believe the QD community faces two challenges: developing a framework to represent the field's growing array of algorithms, and implementing that framework in software that supp… ▽ More

    Submitted 14 April, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

    Comments: Published as a conference paper at the 2023 Genetic and Evolutionary Computation Conference (GECCO '23); Pyribs is available at https://pyribs.org

  6. arXiv:2210.02622  [pdf, other

    cs.RO cs.LG cs.NE

    Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing

    Authors: Bryon Tjanaka, Matthew C. Fontaine, David H. Lee, Aniruddha Kalkar, Stefanos Nikolaidis

    Abstract: Pre-training a diverse set of neural network controllers in simulation has enabled robots to adapt online to damage in robot locomotion tasks. However, finding diverse, high-performing controllers requires expensive network training and extensive tuning of a large number of hyperparameters. On the other hand, Covariance Matrix Adaptation MAP-Annealing (CMA-MAE), an evolution strategies (ES)-based… ▽ More

    Submitted 15 September, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Source code and videos available at https://scalingcmamae.github.io

  7. arXiv:2206.04199  [pdf, other

    cs.AI cs.LG cs.NE

    Deep Surrogate Assisted Generation of Environments

    Authors: Varun Bhatt, Bryon Tjanaka, Matthew C. Fontaine, Stefanos Nikolaidis

    Abstract: Recent progress in reinforcement learning (RL) has started producing generally capable agents that can solve a distribution of complex environments. These agents are typically tested on fixed, human-authored environments. On the other hand, quality diversity (QD) optimization has been proven to be an effective component of environment generation algorithms, which can generate collections of high-q… ▽ More

    Submitted 11 October, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: 26 pages, 15 figures, supplemental website at https://dsagepaper.github.io/

  8. arXiv:2202.03666  [pdf, other

    cs.LG cs.AI cs.NE

    Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning

    Authors: Bryon Tjanaka, Matthew C. Fontaine, Julian Togelius, Stefanos Nikolaidis

    Abstract: Consider the problem of training robustly capable agents. One approach is to generate a diverse collection of agent polices. Training can then be viewed as a quality diversity (QD) optimization problem, where we search for a collection of performant policies that are diverse with respect to quantified behavior. Recent work shows that differentiable quality diversity (DQD) algorithms greatly accele… ▽ More

    Submitted 15 April, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Published as a conference paper at the 2022 Genetic and Evolutionary Computation Conference (GECCO '22); Online article available at http://dqd-rl.github.io

  9. arXiv:2106.10853  [pdf, other

    cs.RO cs.AI

    On the Importance of Environments in Human-Robot Coordination

    Authors: Matthew C. Fontaine, Ya-Chuan Hsu, Yulun Zhang, Bryon Tjanaka, Stefanos Nikolaidis

    Abstract: When studying robots collaborating with humans, much of the focus has been on robot policies that coordinate fluently with human teammates in collaborative tasks. However, less emphasis has been placed on the effect of the environment on coordination behaviors. To thoroughly explore environments that result in diverse behaviors, we propose a framework for procedural generation of environments that… ▽ More

    Submitted 28 June, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: Accepted to Robotics: Science and Systems (RSS) 2021

  10. Scalable Hierarchical Agglomerative Clustering

    Authors: Nicholas Monath, Avinava Dubey, Guru Guruganesh, Manzil Zaheer, Amr Ahmed, Andrew McCallum, Gokhan Mergen, Marc Najork, Mert Terzihan, Bryon Tjanaka, Yuan Wang, Yuchen Wu

    Abstract: The applicability of agglomerative clustering, for inferring both hierarchical and flat clustering, is limited by its scalability. Existing scalable hierarchical clustering methods sacrifice quality for speed and often lead to over-merging of clusters. In this paper, we present a scalable, agglomerative method for hierarchical clustering that does not sacrifice quality and scales to billions of da… ▽ More

    Submitted 30 September, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Appeared in KDD '21: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining