Zum Hauptinhalt springen

Showing 1–50 of 111 results for author: Boots, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16487  [pdf, other

    cs.RO

    Dynamics Models in the Aggressive Off-Road Driving Regime

    Authors: Tyler Han, Sidharth Talia, Rohan Panicker, Preet Shah, Neel Jawale, Byron Boots

    Abstract: Current developments in autonomous off-road driving are steadily increasing performance through higher speeds and more challenging, unstructured environments. However, this operating regime subjects the vehicle to larger inertial effects, where consideration of higher-order states is necessary to avoid failures such as rollovers or excessive impact forces. Aggressive driving through Model Predicti… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted to ICRA 2024 Workshop on Resilient Off-road Autonomy

  2. arXiv:2403.18197  [pdf, other

    cs.RO

    LocoMan: Advancing Versatile Quadrupedal Dexterity with Lightweight Loco-Manipulators

    Authors: Changyi Lin, Xingyu Liu, Yuxiang Yang, Yaru Niu, Wenhao Yu, Tingnan Zhang, Jie Tan, Byron Boots, Ding Zhao

    Abstract: Quadrupedal robots have emerged as versatile agents capable of locomoting and manipulating in complex environments. Traditional designs typically rely on the robot's inherent body parts or incorporate top-mounted arms for manipulation tasks. However, these configurations may limit the robot's operational dexterity, efficiency and adaptability, particularly in cluttered or constrained spaces. In th… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Project page: https://linchangyi1.github.io/LocoMan

  3. arXiv:2403.11298  [pdf, other

    cs.RO

    Multi-Sample Long Range Path Planning under Sensing Uncertainty for Off-Road Autonomous Driving

    Authors: Matt Schmittle, Rohan Baijal, Brian Hou, Siddhartha Srinivasa, Byron Boots

    Abstract: We focus on the problem of long-range dynamic replanning for off-road autonomous vehicles, where a robot plans paths through a previously unobserved environment while continuously receiving noisy local observations. An effective approach for planning under sensing uncertainty is determinization, where one converts a stochastic world into a deterministic one and plans under this simplification. Thi… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  4. arXiv:2312.16016  [pdf, other

    cs.RO

    V-STRONG: Visual Self-Supervised Traversability Learning for Off-road Navigation

    Authors: Sanghun Jung, JoonHo Lee, Xiangyun Meng, Byron Boots, Alexander Lambert

    Abstract: Reliable estimation of terrain traversability is critical for the successful deployment of autonomous systems in wild, outdoor environments. Given the lack of large-scale annotated datasets for off-road navigation, strictly-supervised learning approaches remain limited in their generalization ability. To this end, we introduce a novel, image-based self-supervised learning method for traversability… ▽ More

    Submitted 15 March, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: ICRA 2024; 8 pages

  5. arXiv:2311.12284  [pdf, other

    cs.RO

    Model Predictive Control for Aggressive Driving Over Uneven Terrain

    Authors: Tyler Han, Alex Liu, Anqi Li, Alex Spitzer, Guanya Shi, Byron Boots

    Abstract: Terrain traversability in unstructured off-road autonomy has traditionally relied on semantic classification, resource-intensive dynamics models, or purely geometry-based methods to predict vehicle-terrain interactions. While inconsequential at low speeds, uneven terrain subjects our full-scale system to safety-critical challenges at operating speeds of 7--10 m/s. This study focuses particularly o… ▽ More

    Submitted 7 June, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to R:SS 2024

  6. arXiv:2310.09053  [pdf, other

    cs.RO cs.AI eess.SY

    DATT: Deep Adaptive Trajectory Tracking for Quadrotor Control

    Authors: Kevin Huang, Rwik Rana, Alexander Spitzer, Guanya Shi, Byron Boots

    Abstract: Precise arbitrary trajectory tracking for quadrotors is challenging due to unknown nonlinear dynamics, trajectory infeasibility, and actuation limits. To tackle these challenges, we present Deep Adaptive Trajectory Tracking (DATT), a learning-based approach that can precisely track arbitrary, potentially infeasible trajectories in the presence of large disturbances in the real world. DATT builds o… ▽ More

    Submitted 13 December, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

  7. arXiv:2310.04590  [pdf, other

    cs.RO cs.LG

    Deep Model Predictive Optimization

    Authors: Jacob Sacks, Rwik Rana, Kevin Huang, Alex Spitzer, Guanya Shi, Byron Boots

    Abstract: A major challenge in robotics is to design robust policies which enable complex and agile behaviors in the real world. On one end of the spectrum, we have model-free reinforcement learning (MFRL), which is incredibly flexible and general but often results in brittle policies. In contrast, model predictive control (MPC) continually re-plans at each time step to remain robust to perturbations and mo… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: Main paper is 6 pages with 4 figures and 1 table. Code available at: https://github.com/jisacks/dmpo

  8. arXiv:2309.16652  [pdf, other

    cs.RO

    Perceiving Extrinsic Contacts from Touch Improves Learning Insertion Policies

    Authors: Carolina Higuera, Joseph Ortiz, Haozhi Qi, Luis Pineda, Byron Boots, Mustafa Mukadam

    Abstract: Robotic manipulation tasks such as object insertion typically involve interactions between object and environment, namely extrinsic contacts. Prior work on Neural Contact Fields (NCF) use intrinsic tactile sensing between gripper and object to estimate extrinsic contacts in simulation. However, its effectiveness and utility in real-world tasks remains unknown. In this work, we improve NCF to ena… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: Under review

  9. arXiv:2309.13523  [pdf, other

    cs.CV

    LiDAR-UDA: Self-ensembling Through Time for Unsupervised LiDAR Domain Adaptation

    Authors: Amirreza Shaban, JoonHo Lee, Sanghun Jung, Xiangyun Meng, Byron Boots

    Abstract: We introduce LiDAR-UDA, a novel two-stage self-training-based Unsupervised Domain Adaptation (UDA) method for LiDAR segmentation. Existing self-training methods use a model trained on labeled source data to generate pseudo labels for target data and refine the predictions via fine-tuning the network on the pseudo labels. These methods suffer from domain shifts caused by different LiDAR sensor conf… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted ICCV 2023 (Oral)

  10. arXiv:2306.09557  [pdf, other

    cs.RO

    CAJun: Continuous Adaptive Jumping using a Learned Centroidal Controller

    Authors: Yuxiang Yang, Guanya Shi, Xiangyun Meng, Wenhao Yu, Tingnan Zhang, Jie Tan, Byron Boots

    Abstract: We present CAJun, a novel hierarchical learning and control framework that enables legged robots to jump continuously with adaptive jumping distances. CAJun consists of a high-level centroidal policy and a low-level leg controller. In particular, we use reinforcement learning (RL) to train the centroidal policy, which specifies the gait timing, base velocity, and swing foot position for the leg co… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Please visit https://yxyang.github.io/cajun/ for additional results

  11. arXiv:2305.03735  [pdf, other

    cs.AI cs.GT cs.MA cs.RO

    Stackelberg Games for Learning Emergent Behaviors During Competitive Autocurricula

    Authors: Boling Yang, Liyuan Zheng, Lillian J. Ratliff, Byron Boots, Joshua R. Smith

    Abstract: Autocurricular training is an important sub-area of multi-agent reinforcement learning~(MARL) that allows multiple agents to learn emergent skills in an unsupervised co-evolving scheme. The robotics community has experimented autocurricular training with physically grounded problems, such as robust control and interactive manipulation tasks. However, the asymmetric nature of these tasks makes the… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  12. arXiv:2304.08663  [pdf, other

    cs.RO cs.AI cs.LG

    Continuous Versatile Jumping Using Learned Action Residuals

    Authors: Yuxiang Yang, Xiangyun Meng, Wenhao Yu, Tingnan Zhang, Jie Tan, Byron Boots

    Abstract: Jumping is essential for legged robots to traverse through difficult terrains. In this work, we propose a hierarchical framework that combines optimal control and reinforcement learning to learn continuous jumping motions for quadrupedal robots. The core of our framework is a stance controller, which combines a manually designed acceleration controller with a learned residual policy. As the accele… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: To be presented at L4DC 2023

  13. arXiv:2304.01182  [pdf, other

    cs.RO

    Learning to Read Braille: Bridging the Tactile Reality Gap with Diffusion Models

    Authors: Carolina Higuera, Byron Boots, Mustafa Mukadam

    Abstract: Simulating vision-based tactile sensors enables learning models for contact-rich tasks when collecting real world data at scale can be prohibitive. However, modeling the optical response of the gel deformation as well as incorporating the dynamics of the contact makes sim2real challenging. Prior works have explored data augmentation, fine-tuning, or learning generative models to reduce the sim2rea… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  14. arXiv:2303.17156  [pdf, other

    cs.LG

    MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations

    Authors: Anqi Li, Byron Boots, Ching-An Cheng

    Abstract: We study a new paradigm for sequential decision making, called offline policy learning from observations (PLfO). Offline PLfO aims to learn policies using datasets with substandard qualities: 1) only a subset of trajectories is labeled with rewards, 2) labeled trajectories may not contain actions, 3) labeled trajectories may not be of high quality, and 4) the data may not have full coverage. Such… ▽ More

    Submitted 6 August, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

  15. arXiv:2303.15771  [pdf, other

    cs.RO

    TerrainNet: Visual Modeling of Complex Terrain for High-speed, Off-road Navigation

    Authors: Xiangyun Meng, Nathan Hatch, Alexander Lambert, Anqi Li, Nolan Wagener, Matthew Schmittle, JoonHo Lee, Wentao Yuan, Zoey Chen, Samuel Deng, Greg Okopal, Dieter Fox, Byron Boots, Amirreza Shaban

    Abstract: Effective use of camera-based vision systems is essential for robust performance in autonomous off-road driving, particularly in the high-speed regime. Despite success in structured, on-road settings, current end-to-end approaches for scene prediction have yet to be successfully adapted for complex outdoor terrain. To this end, we present TerrainNet, a vision-based terrain perception system for se… ▽ More

    Submitted 29 May, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  16. arXiv:2302.11048  [pdf, other

    cs.LG cs.AI

    Adversarial Model for Offline Reinforcement Learning

    Authors: Mohak Bhardwaj, Tengyang Xie, Byron Boots, Nan Jiang, Ching-An Cheng

    Abstract: We propose a novel model-based offline Reinforcement Learning (RL) framework, called Adversarial Model for Offline Reinforcement Learning (ARMOR), which can robustly learn policies to improve upon an arbitrary reference policy regardless of data coverage. ARMOR is designed to optimize policies for the worst-case performance relative to the reference policy through adversarially training a Markov d… ▽ More

    Submitted 24 December, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted at the Neural Information Processing Systems (NeurIPS), 2023. Mohak Bhardwaj and Tengyang Xie contributed equally to this work. arXiv admin note: text overlap with arXiv:2211.04538

  17. arXiv:2212.02603  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Learning to Optimize in Model Predictive Control

    Authors: Jacob Sacks, Byron Boots

    Abstract: Sampling-based Model Predictive Control (MPC) is a flexible control framework that can reason about non-smooth dynamics and cost functions. Recently, significant work has focused on the use of machine learning to improve the performance of MPC, often through learning or fine-tuning the dynamics or cost function. In contrast, we focus on learning to optimize more effectively. In other words, to imp… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Proceedings of the IEEE Conference on Robotics and Automation (ICRA), 2022. Paper is 6 pages with 2 figures and 2 tables

    Journal ref: In 2022 International Conference on Robotics and Automation (ICRA), pp. 10549-10556. IEEE, 2022

  18. arXiv:2212.02587  [pdf, other

    cs.RO cs.AI eess.SY

    Learning Sampling Distributions for Model Predictive Control

    Authors: Jacob Sacks, Byron Boots

    Abstract: Sampling-based methods have become a cornerstone of contemporary approaches to Model Predictive Control (MPC), as they make no restrictions on the differentiability of the dynamics or cost function and are straightforward to parallelize. However, their efficacy is highly dependent on the quality of the sampling distribution itself, which is often assumed to be simple, like a Gaussian. This restric… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted at the Conference on Robot Learning (CoRL), 2022. Main paper is 9 pages with 4 figures. Appendix is 12 pages with 11 figures and 1 table

  19. arXiv:2210.12209  [pdf, other

    cs.RO cs.AI

    Motion Policy Networks

    Authors: Adam Fishman, Adithyavairan Murali, Clemens Eppner, Bryan Peele, Byron Boots, Dieter Fox

    Abstract: Collision-free motion generation in unknown environments is a core building block for robot manipulation. Generating such motions is challenging due to multiple objectives; not only should the solutions be optimal, the motion generator itself must be fast enough for real-time performance and reliable enough for practical deployment. A wide variety of methods have been proposed ranging from local c… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: To be published in the Conference on Robot Learning (CoRL) 2022. 10 pages with 4 figures. Appendix has 10 pages and 1 figure

  20. arXiv:2210.09297  [pdf, other

    cs.RO cs.CV

    Neural Contact Fields: Tracking Extrinsic Contact with Tactile Sensing

    Authors: Carolina Higuera, Siyuan Dong, Byron Boots, Mustafa Mukadam

    Abstract: We present Neural Contact Fields, a method that brings together neural fields and tactile sensing to address the problem of tracking extrinsic contact between object and environment. Knowing where the external contact occurs is a first step towards methods that can actively control it in facilitating downstream manipulation tasks. Prior work for localizing environmental contacts typically assume a… ▽ More

    Submitted 13 March, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: 2023 International Conference on Robotics and Automation (ICRA)

  21. arXiv:2206.13631  [pdf, other

    cs.RO cs.AI

    Learning Semantics-Aware Locomotion Skills from Human Demonstration

    Authors: Yuxiang Yang, Xiangyun Meng, Wenhao Yu, Tingnan Zhang, Jie Tan, Byron Boots

    Abstract: The semantics of the environment, such as the terrain type and property, reveals important information for legged robots to adjust their behaviors. In this work, we present a framework that learns semantics-aware locomotion skills from perception for quadrupedal robots, such that the robot can traverse through complex offroad terrains with appropriate speeds and gaits using perception information.… ▽ More

    Submitted 10 October, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

  22. arXiv:2206.00205  [pdf, other

    cs.CV

    CAFA: Class-Aware Feature Alignment for Test-Time Adaptation

    Authors: Sanghun Jung, Jungsoo Lee, Nanhee Kim, Amirreza Shaban, Byron Boots, Jaegul Choo

    Abstract: Despite recent advancements in deep learning, deep neural networks continue to suffer from performance degradation when applied to new data that differs from training data. Test-time adaptation (TTA) aims to address this challenge by adapting a model to unlabeled data at test time. TTA can be applied to pretrained networks without modifying their training procedures, enabling them to utilize a wel… ▽ More

    Submitted 3 September, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

  23. Learning Implicit Priors for Motion Optimization

    Authors: Julen Urain, An T. Le, Alexander Lambert, Georgia Chalvatzaki, Byron Boots, Jan Peters

    Abstract: In this paper, we focus on the problem of integrating Energy-based Models (EBM) as guiding priors for motion optimization. EBMs are a set of neural networks that can represent expressive probability density distributions in terms of a Gibbs distribution parameterized by a suitable energy function. Due to their implicit nature, they can easily be integrated as optimization factors or as initial sam… ▽ More

    Submitted 11 January, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: 17 pages, accepted at IEEE/RSJ IROS 2022, paper website: https://sites.google.com/view/implicit-priors/home

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022, pp. 7672-7679

  24. arXiv:2202.07068  [pdf, other

    cs.RO cs.AI cs.HC cs.MA

    Motivating Physical Activity via Competitive Human-Robot Interaction

    Authors: Boling Yang, Golnaz Habibi, Patrick E. Lancaster, Byron Boots, Joshua R. Smith

    Abstract: This project aims to motivate research in competitive human-robot interaction by creating a robot competitor that can challenge human users in certain scenarios such as physical exercise and games. With this goal in mind, we introduce the Fencing Game, a human-robot competition used to evaluate both the capabilities of the robot competitor and user experience. We develop the robot competitor throu… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: Conference on Robot Learning. PMLR, 2022

  25. arXiv:2111.07986  [pdf, other

    cs.RO cs.LG eess.SY

    Nonprehensile Riemannian Motion Predictive Control

    Authors: Hamid Izadinia, Byron Boots, Steven M. Seitz

    Abstract: Nonprehensile manipulation involves long horizon underactuated object interactions and physical contact with different objects that can inherently introduce a high degree of uncertainty. In this work, we introduce a novel Real-to-Sim reward analysis technique, called Riemannian Motion Predictive Control (RMPC), to reliably imagine and predict the outcome of taking possible actions for a real robot… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: To appear at International Symposium on Experimental Robotics (ISER)

  26. arXiv:2111.02972  [pdf, other

    cs.RO

    Stein Variational Probabilistic Roadmaps

    Authors: Alexander Lambert, Brian Hou, Rosario Scalise, Siddhartha S. Srinivasa, Byron Boots

    Abstract: Efficient and reliable generation of global path plans are necessary for safe execution and deployment of autonomous systems. In order to generate planning graphs which adequately resolve the topology of a given environment, many sampling-based motion planners resort to coarse, heuristically-driven strategies which often fail to generalize to new and varied surroundings. Further, many of these app… ▽ More

    Submitted 20 May, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: Pre-print

  27. arXiv:2110.04669  [pdf, other

    cs.RO cs.LG

    Leveraging Experience in Lazy Search

    Authors: Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots, Siddhartha Srinivasa

    Abstract: Lazy graph search algorithms are efficient at solving motion planning problems where edge evaluation is the computational bottleneck. These algorithms work by lazily computing the shortest potentially feasible path, evaluating edges along that path, and repeating until a feasible path is found. The order in which edges are selected is critical to minimizing the total number of edge evaluations: a… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: Extended journal version accepted for publication at Autonomous Robots; 17 pages. arXiv admin note: substantial text overlap with arXiv:1907.07238

  28. arXiv:2109.10443  [pdf, other

    cs.RO eess.SY

    Geometric Fabrics: Generalizing Classical Mechanics to Capture the Physics of Behavior

    Authors: Karl Van Wyk, Mandy Xie, Anqi Li, Muhammad Asif Rana, Buck Babich, Bryan Peele, Qian Wan, Iretiayo Akinola, Balakumar Sundaralingam, Dieter Fox, Byron Boots, Nathan D. Ratliff

    Abstract: Classical mechanical systems are central to controller design in energy shaping methods of geometric control. However, their expressivity is limited by position-only metrics and the intimate link between metric and geometry. Recent work on Riemannian Motion Policies (RMPs) has shown that shedding these restrictions results in powerful design tools, but at the expense of theoretical stability guara… ▽ More

    Submitted 18 January, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

  29. arXiv:2107.05146  [pdf, other

    cs.RO

    Entropy Regularized Motion Planning via Stein Variational Inference

    Authors: Alexander Lambert, Byron Boots

    Abstract: Many Imitation and Reinforcement Learning approaches rely on the availability of expert-generated demonstrations for learning policies or value functions from data. Obtaining a reliable distribution of trajectories from motion planners is non-trivial, since it must broadly cover the space of states likely to be encountered during execution while also satisfying task-based constraints. We propose a… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: RSS 2021 Workshop on Integrating Planning and Learning

  30. arXiv:2106.09110  [pdf, other

    cs.LG cs.RO eess.SY

    Safe Reinforcement Learning Using Advantage-Based Intervention

    Authors: Nolan Wagener, Byron Boots, Ching-An Cheng

    Abstract: Many sequential decision problems involve finding a policy that maximizes total reward while obeying safety constraints. Although much recent research has focused on the development of safe reinforcement learning (RL) algorithms that produce a safe policy after training, ensuring safety during training as well remains an open problem. A fundamental challenge is performing exploration while still s… ▽ More

    Submitted 19 July, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Appearing in ICML 2021. 29 pages, 8 figures

  31. arXiv:2105.03019  [pdf, other

    cs.RO cs.LG

    Imitation Learning via Simultaneous Optimization of Policies and Auxiliary Trajectories

    Authors: Mandy Xie, Anqi Li, Karl Van Wyk, Frank Dellaert, Byron Boots, Nathan Ratliff

    Abstract: Imitation learning (IL) is a frequently used approach for data-efficient policy learning. Many IL methods, such as Dataset Aggregation (DAgger), combat challenges like distributional shift by interacting with oracular experts. Unfortunately, assuming access to oracular experts is often unrealistic in practice; data used in IL frequently comes from offline processes such as lead-through or teleoper… ▽ More

    Submitted 5 June, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

  32. arXiv:2104.13542  [pdf, other

    cs.RO

    STORM: An Integrated Framework for Fast Joint-Space Model-Predictive Control for Reactive Manipulation

    Authors: Mohak Bhardwaj, Balakumar Sundaralingam, Arsalan Mousavian, Nathan Ratliff, Dieter Fox, Fabio Ramos, Byron Boots

    Abstract: Sampling-based model-predictive control (MPC) is a promising tool for feedback control of robots with complex, non-smooth dynamics, and cost functions. However, the computationally demanding nature of sampling-based MPC algorithms has been a key bottleneck in their application to high-dimensional robotic manipulation problems in the real world. Previous methods have addressed this issue by running… ▽ More

    Submitted 14 September, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted for oral presentation at the Conference on Robot Learning (CoRL), 2021. Code available at: https://github.com/NVlabs/storm

    Journal ref: 5th Annual Conference on Robot Learning, 2021

  33. arXiv:2104.04644  [pdf, other

    cs.RO cs.LG

    Fast and Efficient Locomotion via Learned Gait Transitions

    Authors: Yuxiang Yang, Tingnan Zhang, Erwin Coumans, Jie Tan, Byron Boots

    Abstract: We focus on the problem of developing energy efficient controllers for quadrupedal robots. Animals can actively switch gaits at different speeds to lower their energy consumption. In this paper, we devise a hierarchical learning framework, in which distinctive locomotion gaits and natural gait transitions emerge automatically with a simple reward of energy minimization. We use evolutionary strateg… ▽ More

    Submitted 22 November, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: Published in CoRL 2021. Website: Website: https://sites.google.com/view/fast-and-efficient Code: https://github.com/yxyang/fast_and_efficient

  34. arXiv:2104.02863  [pdf, other

    cs.RO cs.LG

    The Value of Planning for Infinite-Horizon Model Predictive Control

    Authors: Nathan Hatch, Byron Boots

    Abstract: Model Predictive Control (MPC) is a classic tool for optimal control of complex, real-world systems. Although it has been successfully applied to a wide range of challenging tasks in robotics, it is fundamentally limited by the prediction horizon, which, if too short, will result in myopic decisions. Recently, several papers have suggested using a learned value function as the terminal cost for MP… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: 7 pages, 8 figures. To appear in the proceedings of the International Conference on Robotics and Automation (ICRA) 2021

  35. arXiv:2103.14162  [pdf, other

    cs.CV

    Few-shot Weakly-Supervised Object Detection via Directional Statistics

    Authors: Amirreza Shaban, Amir Rahimi, Thalaiyasingam Ajanthan, Byron Boots, Richard Hartley

    Abstract: Detecting novel objects from few examples has become an emerging topic in computer vision recently. However, these methods need fully annotated training images to learn new object categories which limits their applicability in real world scenarios such as field robotics. In this work, we propose a probabilistic multiple instance learning approach for few-shot Common Object Localization (COL) and f… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  36. arXiv:2103.12890  [pdf, other

    cs.RO cs.LG

    Dual Online Stein Variational Inference for Control and Dynamics

    Authors: Lucas Barcelos, Alexander Lambert, Rafael Oliveira, Paulo Borges, Byron Boots, Fabio Ramos

    Abstract: Model predictive control (MPC) schemes have a proven track record for delivering aggressive and robust performance in many challenging control tasks, coping with nonlinear system dynamics, constraints, and observational noise. Despite their success, these methods often rely on simple control distributions, which can limit their performance in highly uncertain and complex environments. MPC framewor… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: Corresponding author: [email protected]

  37. arXiv:2103.05922  [pdf, other

    cs.RO cs.LG eess.SY

    RMP2: A Structured Composable Policy Class for Robot Learning

    Authors: Anqi Li, Ching-An Cheng, M. Asif Rana, Man Xie, Karl Van Wyk, Nathan Ratliff, Byron Boots

    Abstract: We consider the problem of learning motion policies for acceleration-based robotics systems with a structured policy class specified by RMPflow. RMPflow is a multi-task control framework that has been successfully applied in many robotics problems. Using RMPflow as a structured policy class in learning has several benefits, such as sufficient expressiveness, the flexibility to inject different lev… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

  38. Combining pretrained CNN feature extractors to enhance clustering of complex natural images

    Authors: Joris Guerin, Stephane Thiery, Eric Nyiri, Olivier Gibaru, Byron Boots

    Abstract: Recently, a common starting point for solving complex unsupervised image classification tasks is to use generic features, extracted with deep Convolutional Neural Networks (CNN) pretrained on a large and versatile dataset (ImageNet). However, in most research, the CNN architecture for feature extraction is chosen arbitrarily, without justification. This paper aims at providing insight on the use o… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 21 pages, 16 figures, 10 tables, preprint of our paper published in Neurocomputing

    Journal ref: Guerin, J., Thiery, S., Nyiri, E., Gibaru, O., & Boots, B. (2021). Combining pretrained CNN feature extractors to enhance clustering of complex natural images. Neurocomputing, 423, 551-571

  39. arXiv:2012.13457  [pdf, other

    cs.RO cs.LG

    Towards Coordinated Robot Motions: End-to-End Learning of Motion Policies on Transform Trees

    Authors: M. Asif Rana, Anqi Li, Dieter Fox, Sonia Chernova, Byron Boots, Nathan Ratliff

    Abstract: Generating robot motion that fulfills multiple tasks simultaneously is challenging due to the geometric constraints imposed by the robot. In this paper, we propose to solve multi-task problems through learning structured policies from human demonstrations. Our structured policy is inspired by RMPflow, a framework for combining subtask policies on different spaces. The policy structure provides the… ▽ More

    Submitted 10 March, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

  40. arXiv:2012.05909  [pdf, other

    cs.LG cs.RO

    Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

    Authors: Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots

    Abstract: Model-Predictive Control (MPC) is a powerful tool for controlling complex, real-world systems that uses a model to make predictions about future behavior. For each state encountered, MPC solves an online optimization problem to choose a control action that will minimize future cost. This is a surprisingly effective strategy, but real-time performance requirements warrant the use of simple models.… ▽ More

    Submitted 13 April, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 15 pages

    Journal ref: International Conference on Learning Representations (ICLR), 2021

  41. arXiv:2011.07641  [pdf, other

    cs.RO cs.AI

    Stein Variational Model Predictive Control

    Authors: Alexander Lambert, Adam Fishman, Dieter Fox, Byron Boots, Fabio Ramos

    Abstract: Decision making under uncertainty is critical to real-world, autonomous systems. Model Predictive Control (MPC) methods have demonstrated favorable performance in practice, but remain limited when dealing with complex probability distributions. In this paper, we propose a generalization of MPC that represents a multitude of solutions as posterior distributions. By casting MPC as a Bayesian inferen… ▽ More

    Submitted 12 April, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

    Comments: Accepted to Conference on Robot Learning (CoRL) 2020

  42. arXiv:2011.06719  [pdf, other

    cs.RO cs.LG

    Grasping with Chopsticks: Combating Covariate Shift in Model-free Imitation Learning for Fine Manipulation

    Authors: Liyiming Ke, Jingqiang Wang, Tapomayukh Bhattacharjee, Byron Boots, Siddhartha Srinivasa

    Abstract: Billions of people use chopsticks, a simple yet versatile tool, for fine manipulation of everyday objects. The small, curved, and slippery tips of chopsticks pose a challenge for picking up small objects, making them a suitably complex test case. This paper leverages human demonstrations to develop an autonomous chopsticks-equipped robotic manipulator. Due to the lack of accurate models for fine m… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Submitted to ICRA 2021

  43. arXiv:2010.14750  [pdf, other

    cs.RO

    Geometric Fabrics for the Acceleration-based Design of Robotic Motion

    Authors: Mandy Xie, Karl Van Wyk, Anqi Li, Muhammad Asif Rana, Qian Wan, Dieter Fox, Byron Boots, Nathan Ratliff

    Abstract: This paper describes the pragmatic design and construction of geometric fabrics for shaping a robot's task-independent nominal behavior, capturing behavioral components such as obstacle avoidance, joint limit avoidance, redundancy resolution, global navigation heuristics, etc. Geometric fabrics constitute the most concrete incarnation of a new mathematical formulation for reactive behavior called… ▽ More

    Submitted 25 June, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

  44. arXiv:2010.10653  [pdf, other

    cs.LG quant-ph

    Quantum Tensor Networks, Stochastic Processes, and Weighted Automata

    Authors: Siddarth Srinivasan, Sandesh Adhikary, Jacob Miller, Guillaume Rabusseau, Byron Boots

    Abstract: Modeling joint probability distributions over sequences has been studied from many perspectives. The physics community developed matrix product states, a tensor-train decomposition for probabilistic modeling, motivated by the need to tractably model many-body systems. But similar models have also been studied in the stochastic processes and weighted automata literature, with little work on how the… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  45. arXiv:2009.10019  [pdf, other

    cs.RO cs.LG

    Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion

    Authors: Xingye Da, Zhaoming Xie, David Hoeller, Byron Boots, Animashree Anandkumar, Yuke Zhu, Buck Babich, Animesh Garg

    Abstract: We present a hierarchical framework that combines model-based control and reinforcement learning (RL) to synthesize robust controllers for a quadruped (the Unitree Laikago). The system consists of a high-level controller that learns to choose from a set of primitives in response to changes in the environment and a low-level controller that utilizes an established control method to robustly execute… ▽ More

    Submitted 23 November, 2020; v1 submitted 21 September, 2020; originally announced September 2020.

    Comments: supplementary video: https://youtu.be/JJOmFZKpYTo

  46. arXiv:2007.14256  [pdf, other

    cs.RO

    RMPflow: A Geometric Framework for Generation of Multi-Task Motion Policies

    Authors: Ching-An Cheng, Mustafa Mukadam, Jan Issac, Stan Birchfield, Dieter Fox, Byron Boots, Nathan Ratliff

    Abstract: Generating robot motion for multiple tasks in dynamic environments is challenging, requiring an algorithm to respond reactively while accounting for complex nonlinear relationships between tasks. In this paper, we develop a novel policy synthesis algorithm, RMPflow, based on geometrically consistent transformations of Riemannian Motion Policies (RMPs). RMPs are a class of reactive motion policies… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1811.07049

  47. arXiv:2007.02520  [pdf, other

    cs.LG cs.RO stat.ML

    Explaining Fast Improvement in Online Imitation Learning

    Authors: Xinyan Yan, Byron Boots, Ching-An Cheng

    Abstract: Online imitation learning (IL) is an algorithmic framework that leverages interactions with expert policies for efficient policy optimization. Here policies are optimized by performing online learning on a sequence of loss functions that encourage the learner to mimic expert actions, and if the online learning has no regret, the agent can provably learn an expert-like policy. Online IL has demonst… ▽ More

    Submitted 21 February, 2021; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: 22 pages, 2 figures

  48. arXiv:2005.13143  [pdf, other

    cs.RO cs.LG eess.SY

    Euclideanizing Flows: Diffeomorphic Reduction for Learning Stable Dynamical Systems

    Authors: Muhammad Asif Rana, Anqi Li, Dieter Fox, Byron Boots, Fabio Ramos, Nathan Ratliff

    Abstract: Robotic tasks often require motions with complex geometric structures. We present an approach to learn such motions from a limited number of human demonstrations by exploiting the regularity properties of human motions e.g. stability, smoothness, and boundedness. The complex motions are encoded as rollouts of a stable dynamical system, which, under a change of coordinates defined by a diffeomorphi… ▽ More

    Submitted 21 September, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: 2nd Annual Conference on Learning for Dynamics and Control (L4DC) 2020 -- Revised Version

  49. arXiv:2003.08375  [pdf, other

    cs.CV

    Pairwise Similarity Knowledge Transfer for Weakly Supervised Object Localization

    Authors: Amir Rahimi, Amirreza Shaban, Thalaiyasingam Ajanthan, Richard Hartley, Byron Boots

    Abstract: Weakly Supervised Object Localization (WSOL) methods only require image level labels as opposed to expensive bounding box annotations required by fully supervised algorithms. We study the problem of learning localization model on target classes with weakly supervised image labels, helped by a fully annotated source dataset. Typically, a WSOL model is first trained to predict class generic objectne… ▽ More

    Submitted 19 July, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: ECCV 2020. formerly "In Defense of Graph Inference Algorithms for Weakly Supervised Object Localization"

  50. arXiv:2003.06820  [pdf, other

    cs.LG cs.CV stat.ML

    Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks

    Authors: Amir Rahimi, Amirreza Shaban, Ching-An Cheng, Richard Hartley, Byron Boots

    Abstract: Predicting calibrated confidence scores for multi-class deep networks is important for avoiding rare but costly mistakes. A common approach is to learn a post-hoc calibration function that transforms the output of the original network into calibrated confidence scores while maintaining the network's accuracy. However, previous post-hoc calibration techniques work only with simple calibration funct… ▽ More

    Submitted 23 October, 2020; v1 submitted 15 March, 2020; originally announced March 2020.

    Comments: NeurIPS 2020