Zum Hauptinhalt springen

Showing 1–39 of 39 results for author: Shkurti, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16025  [pdf, other

    cs.LG cs.AI

    Exploring and Addressing Reward Confusion in Offline Preference Learning

    Authors: Xin Chen, Sam Toyer, Florian Shkurti

    Abstract: Spurious correlations in a reward model's training data can prevent Reinforcement Learning from Human Feedback (RLHF) from identifying the desired goal and induce unwanted behaviors. This paper shows that offline RLHF is susceptible to reward confusion, especially in the presence of spurious correlations in offline data. We create a benchmark to study this problem and propose a method that can sig… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  2. arXiv:2401.06949  [pdf, other

    cs.RO cs.AI

    ORGANA: A Robotic Assistant for Automated Chemistry Experimentation and Characterization

    Authors: Kourosh Darvish, Marta Skreta, Yuchi Zhao, Naruki Yoshikawa, Sagnik Som, Miroslav Bogdanovic, Yang Cao, Han Hao, Haoping Xu, Alán Aspuru-Guzik, Animesh Garg, Florian Shkurti

    Abstract: Chemistry experimentation is often resource- and labor-intensive. Despite the many benefits incurred by the integration of advanced and special-purpose lab equipment, many aspects of experimentation are still manually conducted by chemists, for example, polishing an electrode in electrochemistry experiments. Traditional lab automation infrastructure faces challenges when it comes to flexibly adapt… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  3. arXiv:2310.10982  [pdf, other

    cs.RO

    SICNav: Safe and Interactive Crowd Navigation using Model Predictive Control and Bilevel Optimization

    Authors: Sepehr Samavi, James R. Han, Florian Shkurti, Angela P. Schoellig

    Abstract: Robots need to predict and react to human motions to navigate through a crowd without collisions. Many existing methods decouple prediction from planning, which does not account for the interaction between robot and human motions and can lead to the robot getting stuck. We propose SICNav, a Model Predictive Control (MPC) method that jointly solves for robot motion and predicted crowd motion in clo… ▽ More

    Submitted 27 May, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Currently under review for IEEE Transactions on Robotics (T-RO)

  4. arXiv:2310.01775  [pdf, other

    cs.RO cs.AI

    STAMP: Differentiable Task and Motion Planning via Stein Variational Gradient Descent

    Authors: Yewon Lee, Philip Huang, Krishna Murthy Jatavallabhula, Andrew Z. Li, Fabian Damken, Eric Heiden, Kevin Smith, Derek Nowrouzezahrai, Fabio Ramos, Florian Shkurti

    Abstract: Planning for many manipulation tasks, such as using tools or assembling parts, often requires both symbolic and geometric reasoning. Task and Motion Planning (TAMP) algorithms typically solve these problems by conducting a tree search over high-level task sequences while checking for kinematic and dynamic feasibility. This can be inefficient as the width of the tree can grow exponentially with the… ▽ More

    Submitted 7 January, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 14 pages, 9 figures, Learning Effective Abstractions for Planning (LEAP) Workshop at CoRL 2023

    ACM Class: I.2.9

  5. arXiv:2309.16650  [pdf, other

    cs.RO cs.CV

    ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning

    Authors: Qiao Gu, Alihusein Kuwajerwala, Sacha Morin, Krishna Murthy Jatavallabhula, Bipasha Sen, Aditya Agarwal, Corban Rivera, William Paul, Kirsty Ellis, Rama Chellappa, Chuang Gan, Celso Miguel de Melo, Joshua B. Tenenbaum, Antonio Torralba, Florian Shkurti, Liam Paull

    Abstract: For robots to perform a wide variety of tasks, they require a 3D representation of the world that is semantically rich, yet compact and efficient for task-driven perception and planning. Recent approaches have attempted to leverage features from large vision-language models to encode semantics in 3D representations. However, these approaches tend to produce maps with per-point feature vectors, whi… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: Project page: https://concept-graphs.github.io/ Explainer video: https://youtu.be/mRhNkQwRYnc

  6. arXiv:2309.15770  [pdf, other

    cs.RO

    Generating Transferable Adversarial Simulation Scenarios for Self-Driving via Neural Rendering

    Authors: Yasasa Abeysirigoonawardena, Kevin Xie, Chuhan Chen, Salar Hosseini, Ruiting Chen, Ruiqi Wang, Florian Shkurti

    Abstract: Self-driving software pipelines include components that are learned from a significant number of training examples, yet it remains challenging to evaluate the overall system's safety and generalization performance. Together with scaling up the real-world deployment of autonomous vehicles, it is of critical importance to automatically find simulation scenarios where the driving policies will fail.… ▽ More

    Submitted 23 January, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Conference paper submitted to CoRL 23

  7. arXiv:2309.14657  [pdf, other

    cs.RO

    Field Testing of a Stochastic Planner for ASV Navigation Using Satellite Images

    Authors: Philip Huang, Tony Wang, Florian Shkurti, Timothy D. Barfoot

    Abstract: We introduce a multi-sensor navigation system for autonomous surface vessels (ASV) intended for water-quality monitoring in freshwater lakes. Our mission planner uses satellite imagery as a prior map, formulating offline a mission-level policy for global navigation of the ASV and enabling autonomous online execution via local perception and local planning modules. A significant challenge is posed… ▽ More

    Submitted 22 August, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: To appear in IEEE Transactions on Field Robotics (T-FR). 40 pages, 21 figures. Video available at https://youtu.be/KVSTmWFLqjk?si=Gvt1uOgLH-6OUrfD. Journal extension of arXiv:2209.11864

  8. Does Unpredictability Influence Driving Behavior?

    Authors: Sepehr Samavi, Florian Shkurti, Angela P. Schoellig

    Abstract: In this paper we investigate the effect of the unpredictability of surrounding cars on an ego-car performing a driving maneuver. We use Maximum Entropy Inverse Reinforcement Learning to model reward functions for an ego-car conducting a lane change in a highway setting. We define a new feature based on the unpredictability of surrounding cars and use it in the reward function. We learn two reward… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2023

    Report number: https://ieeexplore.ieee.org/document/10342534

  9. arXiv:2303.15342  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Exploring Continual Learning of Diffusion Models

    Authors: Michał Zając, Kamil Deja, Anna Kuzina, Jakub M. Tomczak, Tomasz Trzciński, Florian Shkurti, Piotr Miłoś

    Abstract: Diffusion models have achieved remarkable success in generating high-quality images thanks to their novel training procedures applied to unprecedented amounts of data. However, training a diffusion model from scratch is computationally expensive. This highlights the need to investigate the possibility of training these models iteratively, reusing computation while the data distribution changes. In… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  10. arXiv:2303.14595  [pdf, other

    cs.LG cs.AI cs.CV

    Preserving Linear Separability in Continual Learning by Backward Feature Projection

    Authors: Qiao Gu, Dongsub Shim, Florian Shkurti

    Abstract: Catastrophic forgetting has been a major challenge in continual learning, where the model needs to learn new tasks with limited or no access to data from previously seen tasks. To tackle this challenge, methods based on knowledge distillation in feature space have been proposed and shown to reduce forgetting. However, most feature distillation methods directly constrain the new features to match t… ▽ More

    Submitted 27 June, 2023; v1 submitted 25 March, 2023; originally announced March 2023.

    Comments: CVPR 2023. The code can be found at https://github.com/rvl-lab-utoronto/BFP

  11. arXiv:2303.14100  [pdf, other

    cs.RO

    Errors are Useful Prompts: Instruction Guided Task Programming with Verifier-Assisted Iterative Prompting

    Authors: Marta Skreta, Naruki Yoshikawa, Sebastian Arellano-Rubach, Zhi Ji, Lasse Bjørn Kristensen, Kourosh Darvish, Alán Aspuru-Guzik, Florian Shkurti, Animesh Garg

    Abstract: Generating low-level robot task plans from high-level natural language instructions remains a challenging problem. Although large language models have shown promising results in generating plans, the accuracy of the output remains unverified. Furthermore, the lack of domain-specific language data poses a limitation on the applicability of these models. In this paper, we propose CLAIRIFY, a novel a… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  12. arXiv:2303.13755  [pdf, other

    cs.CV cs.AI cs.LG

    Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers

    Authors: Cong Wei, Brendan Duke, Ruowei Jiang, Parham Aarabi, Graham W. Taylor, Florian Shkurti

    Abstract: Vision Transformers (ViT) have shown their competitive advantages performance-wise compared to convolutional neural networks (CNNs) though they often come with high computational costs. To this end, previous methods explore different attention patterns by limiting a fixed number of spatially nearby tokens to accelerate the ViT's multi-head self-attention (MHSA) operations. However, such structured… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  13. arXiv:2302.11683  [pdf, other

    cs.RO cs.AI cs.CV

    MVTrans: Multi-View Perception of Transparent Objects

    Authors: Yi Ru Wang, Yuchi Zhao, Haoping Xu, Saggi Eppel, Alan Aspuru-Guzik, Florian Shkurti, Animesh Garg

    Abstract: Transparent object perception is a crucial skill for applications such as robot manipulation in household and laboratory settings. Existing methods utilize RGB-D or stereo inputs to handle a subset of perception tasks including depth and pose estimation. However, transparent object perception remains to be an open problem. In this paper, we forgo the unreliable depth map from RGB-D sensors and ext… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: Accepted to ICRA 2023; 6 pages, 4 figures, 4 tables

  14. arXiv:2302.07241  [pdf, other

    cs.CV cs.AI cs.RO

    ConceptFusion: Open-set Multimodal 3D Mapping

    Authors: Krishna Murthy Jatavallabhula, Alihusein Kuwajerwala, Qiao Gu, Mohd Omama, Tao Chen, Alaa Maalouf, Shuang Li, Ganesh Iyer, Soroush Saryazdi, Nikhil Keetha, Ayush Tewari, Joshua B. Tenenbaum, Celso Miguel de Melo, Madhava Krishna, Liam Paull, Florian Shkurti, Antonio Torralba

    Abstract: Building 3D maps of the environment is central to robot navigation, planning, and interaction with objects in a scene. Most existing approaches that integrate semantic concepts with 3D maps largely remain confined to the closed-set setting: they can only reason about a finite set of concepts, pre-defined at training time. Further, these maps can only be queried using class labels, or in recent wor… ▽ More

    Submitted 23 October, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: RSS 2023. Project page: https://concept-fusion.github.io Explainer video: https://www.youtube.com/watch?v=rkXgws8fiDs Code: https://github.com/concept-fusion/concept-fusion

  15. arXiv:2212.09672  [pdf, other

    cs.RO

    Chemistry Lab Automation via Constrained Task and Motion Planning

    Authors: Naruki Yoshikawa, Andrew Zou Li, Kourosh Darvish, Yuchi Zhao, Haoping Xu, Artur Kuramshin, Alán Aspuru-Guzik, Animesh Garg, Florian Shkurti

    Abstract: Chemists need to perform many laborious and time-consuming experiments in the lab to discover and understand the properties of new materials. To support and accelerate this process, we propose a robot framework for manipulation that autonomously performs chemistry experiments. Our framework receives high-level abstract descriptions of chemistry experiments, perceives the lab workspace, and autonom… ▽ More

    Submitted 26 March, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Equal author contribution from Naruki Yoshikawa, Andrew Zou Li, Kourosh Darvish, Yuchi Zhao and Haoping Xu

  16. arXiv:2211.07545  [pdf, ps, other

    cs.RO cs.CV cs.LG

    NeurIPS 2022 Competition: Driving SMARTS

    Authors: Amir Rasouli, Randy Goebel, Matthew E. Taylor, Iuliia Kotseruba, Soheil Alizadeh, Tianpei Yang, Montgomery Alban, Florian Shkurti, Yuzheng Zhuang, Adam Scibior, Kasra Rezaee, Animesh Garg, David Meger, Jun Luo, Liam Paull, Weinan Zhang, Xinyu Wang, Xi Chen

    Abstract: Driving SMARTS is a regular competition designed to tackle problems caused by the distribution shift in dynamic interaction contexts that are prevalent in real-world autonomous driving (AD). The proposed competition supports methodologically diverse solutions, such as reinforcement learning (RL) and offline learning methods, trained on a combination of naturalistic AD data and open-source simulati… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 10 pages, 8 figures

  17. Policy-Guided Lazy Search with Feedback for Task and Motion Planning

    Authors: Mohamed Khodeir, Atharv Sonwane, Ruthrash Hari, Florian Shkurti

    Abstract: PDDLStream solvers have recently emerged as viable solutions for Task and Motion Planning (TAMP) problems, extending PDDL to problems with continuous action spaces. Prior work has shown how PDDLStream problems can be reduced to a sequence of PDDL planning problems, which can then be solved using off-the-shelf planners. However, this approach can suffer from long runtimes. In this paper we propose… ▽ More

    Submitted 23 August, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Journal ref: 2023 IEEE International Conference on Robotics and Automation (ICRA), London, United Kingdom, 2023, pp. 3743-3749

  18. arXiv:2209.11864  [pdf, other

    cs.RO

    Stochastic Planning for ASV Navigation Using Satellite Images

    Authors: Yizhou Huang, Hamza Dugmag, Timothy D. Barfoot, Florian Shkurti

    Abstract: Autonomous surface vessels (ASV) represent a promising technology to automate water-quality monitoring of lakes. In this work, we use satellite images as a coarse map and plan sampling routes for the robot. However, inconsistency between the satellite images and the actual lake, as well as environmental disturbances such as wind, aquatic vegetation, and changing water levels can make it difficult… ▽ More

    Submitted 28 April, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 7 pages, 5 figures

  19. arXiv:2207.05006  [pdf, other

    cs.RO cs.AI cs.LG

    TASKOGRAPHY: Evaluating robot task planning over large 3D scene graphs

    Authors: Christopher Agia, Krishna Murthy Jatavallabhula, Mohamed Khodeir, Ondrej Miksik, Vibhav Vineet, Mustafa Mukadam, Liam Paull, Florian Shkurti

    Abstract: 3D scene graphs (3DSGs) are an emerging description; unifying symbolic, topological, and metric scene representations. However, typical 3DSGs contain hundreds of objects and symbols even for small environments; rendering task planning on the full graph impractical. We construct TASKOGRAPHY, the first large-scale robotic task planning benchmark over 3DSGs. While most benchmarking efforts in this ar… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Video: https://www.youtube.com/watch?v=mM4v5hP4LdA&ab_channel=KrishnaMurthy . Project page: https://taskography.github.io/ . 18 pages, 7 figures. In proceedings of Conference on Robot Learning (CoRL) 2021. The first two authors contributed equally

    ACM Class: I.2.8; I.2.9; I.2.10; I.2.6

    Journal ref: PMLR 164 (2022) 46-58

  20. arXiv:2206.12534  [pdf, other

    cs.CV

    SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos

    Authors: Salar Hosseini Khorasgani, Yuxuan Chen, Florian Shkurti

    Abstract: Self-supervised methods have significantly closed the gap with end-to-end supervised learning for image classification. In the case of human action videos, however, where both appearance and motion are significant factors of variation, this gap remains significant. One of the key reasons for this is that sampling pairs of similar video clips, a required step for many self-supervised contrastive le… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: CVPR2022

  21. Learning to Search in Task and Motion Planning with Streams

    Authors: Mohamed Khodeir, Ben Agro, Florian Shkurti

    Abstract: Task and motion planning problems in robotics combine symbolic planning over discrete task variables with motion optimization over continuous state and action variables. Recent works such as PDDLStream have focused on optimistic planning with an incrementally growing set of objects until a feasible trajectory is found. However, this set is exhaustively expanded in a breadth-first manner, regardles… ▽ More

    Submitted 23 August, 2023; v1 submitted 25 November, 2021; originally announced November 2021.

    Comments: RAL Camera-Ready

    Journal ref: IEEE Robotics and Automation Letters, vol. 8, no. 4, pp. 1983-1990

  22. arXiv:2110.07668  [pdf, other

    cs.CV cs.RO

    Augmenting Imitation Experience via Equivariant Representations

    Authors: Dhruv Sharma, Alihusein Kuwajerwala, Florian Shkurti

    Abstract: The robustness of visual navigation policies trained through imitation often hinges on the augmentation of the training image-action pairs. Traditionally, this has been done by collecting data from multiple cameras, by using standard data augmentations from computer vision, such as adding random noise to each image, or by synthesizing training images. In this paper we show that there is another pr… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: 7 pages (including references), 15 figures

  23. arXiv:2110.00087  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Seeing Glass: Joint Point Cloud and Depth Completion for Transparent Objects

    Authors: Haoping Xu, Yi Ru Wang, Sagi Eppel, Alàn Aspuru-Guzik, Florian Shkurti, Animesh Garg

    Abstract: The basis of many object manipulation algorithms is RGB-D input. Yet, commodity RGB-D sensors can only provide distorted depth maps for a wide range of transparent objects due light refraction and absorption. To tackle the perception challenges posed by transparent objects, we propose TranspareNet, a joint point cloud and depth completion method, with the ability to complete the depth of transpare… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: Accepted for Oral at Conference on Robot Learning (CoRL) 2021; Haoping Xu and Yi Ru Wang contributed equally; 8 pages, 6 figures, 3 tables

  24. arXiv:2109.09913  [pdf, other

    cs.CV

    Physics-based Human Motion Estimation and Synthesis from Videos

    Authors: Kevin Xie, Tingwu Wang, Umar Iqbal, Yunrong Guo, Sanja Fidler, Florian Shkurti

    Abstract: Human motion synthesis is an important problem with applications in graphics, gaming and simulation environments for robotics. Existing methods require accurate motion capture data for training, which is costly to obtain. Instead, we propose a framework for training generative models of physically plausible human motion directly from monocular RGB videos, which are much more widely available. At t… ▽ More

    Submitted 11 August, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: To appear in ICCV 2021

  25. arXiv:2104.02646  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    gradSim: Differentiable simulation for system identification and visuomotor control

    Authors: Krishna Murthy Jatavallabhula, Miles Macklin, Florian Golemo, Vikram Voleti, Linda Petrini, Martin Weiss, Breandan Considine, Jerome Parent-Levesque, Kevin Xie, Kenny Erleben, Liam Paull, Florian Shkurti, Derek Nowrouzezahrai, Sanja Fidler

    Abstract: We consider the problem of estimating an object's physical properties such as mass, friction, and elasticity directly from video sequences. Such a system identification problem is fundamentally ill-posed due to the loss of information during image formation. Current solutions require precise 3D labels which are labor-intensive to gather, and infeasible to create for many systems such as deformable… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: ICLR 2021. Project page (and a dynamic web version of the article): https://gradsim.github.io

  26. arXiv:2103.03891  [pdf, other

    cs.CV cs.LG

    LOHO: Latent Optimization of Hairstyles via Orthogonalization

    Authors: Rohit Saha, Brendan Duke, Florian Shkurti, Graham W. Taylor, Parham Aarabi

    Abstract: Hairstyle transfer is challenging due to hair structure differences in the source and target hair. Therefore, we propose Latent Optimization of Hairstyles via Orthogonalization (LOHO), an optimization-based approach using GAN inversion to infill missing hair structure details in latent space during hairstyle transfer. Our approach decomposes hair into three attributes: perceptual structure, appear… ▽ More

    Submitted 10 March, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: CVPR 2021

  27. arXiv:2011.13897  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Latent Skill Planning for Exploration and Transfer

    Authors: Kevin Xie, Homanga Bharadhwaj, Danijar Hafner, Animesh Garg, Florian Shkurti

    Abstract: To quickly solve new tasks in complex environments, intelligent agents need to build up reusable knowledge. For example, a learned world model captures knowledge about the environment that applies to new tasks. Similarly, skills capture general behaviors that can apply to new tasks. In this paper, we investigate how these two approaches can be integrated into a single reinforcement learning agent.… ▽ More

    Submitted 2 May, 2021; v1 submitted 27 November, 2020; originally announced November 2020.

    Comments: First two authors contributed equally. Published as a conference paper in ICLR 2021

  28. arXiv:2011.01298  [pdf, other

    cs.RO cs.LG

    Shaping Rewards for Reinforcement Learning with Imperfect Demonstrations using Generative Models

    Authors: Yuchen Wu, Melissa Mozifian, Florian Shkurti

    Abstract: The potential benefits of model-free reinforcement learning to real robotics systems are limited by its uninformed exploration that leads to slow convergence, lack of data-efficiency, and unnecessary interactions with the environment. To address these drawbacks we propose a method that combines reinforcement and imitation learning by shaping the reward function with a state-and-action-dependent po… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: submitted to ICRA 2021

  29. arXiv:2010.14497  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Conservative Safety Critics for Exploration

    Authors: Homanga Bharadhwaj, Aviral Kumar, Nicholas Rhinehart, Sergey Levine, Florian Shkurti, Animesh Garg

    Abstract: Safe exploration presents a major challenge in reinforcement learning (RL): when active data collection requires deploying partially trained policies, we must ensure that these policies avoid catastrophically unsafe regions, while still enabling trial and error learning. In this paper, we target the problem of safe exploration in RL by learning a conservative safety estimate of environment states… ▽ More

    Submitted 26 April, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: Published as a conference paper in ICLR 2021

  30. arXiv:2009.11997  [pdf, other

    cs.LG cs.AI cs.RO

    Continual Model-Based Reinforcement Learning with Hypernetworks

    Authors: Yizhou Huang, Kevin Xie, Homanga Bharadhwaj, Florian Shkurti

    Abstract: Effective planning in model-based reinforcement learning (MBRL) and model-predictive control (MPC) relies on the accuracy of the learned dynamics model. In many instances of MBRL and MPC, this model is assumed to be stationary and is periodically re-trained from scratch on state transition experience collected from the beginning of environment interactions. This implies that the time required to t… ▽ More

    Submitted 29 March, 2021; v1 submitted 24 September, 2020; originally announced September 2020.

    Comments: 7 pages (+2 pages in appendix), 8 figures. To appear in the proc. of the 2021 IEEE International Conference on Robotics and Automation

  31. arXiv:2009.08577  [pdf

    cs.CY cs.RO

    Making Sense of the Robotized Pandemic Response: A Comparison of Global and Canadian Robot Deployments and Success Factors

    Authors: T. Barfoot, J. Burgner-Kahrs, E. Diller, A. Garg, A. Goldenberg, J. Kelly, X. Liu, H. E. Naguib, G. Nejat, A. P. Schoellig, F. Shkurti, H. Siegel, Y. Sun, S. L. Waslander, .

    Abstract: From disinfection and remote triage, to logistics and delivery, countries around the world are making use of robots to address the unique challenges presented by the COVID-19 pandemic. Robots are being used to manage the pandemic in Canada too, but relative to other regions, we have been more cautious in our adoption -- this despite the important role that robots of Canadian origin are now playing… ▽ More

    Submitted 21 September, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: 104 pages, 18 figures, 13 tables. Corresponding Author: H Siegel

  32. arXiv:2006.16235  [pdf, other

    cs.RO

    Vision-Based Goal-Conditioned Policies for Underwater Navigation in the Presence of Obstacles

    Authors: Travis Manderson, Juan Camilo Gamboa Higuera, Stefan Wapnick, Jean-François Tremblay, Florian Shkurti, David Meger, Gregory Dudek

    Abstract: We present Nav2Goal, a data-efficient and end-to-end learning method for goal-conditioned visual navigation. Our technique is used to train a navigation policy that enables a robot to navigate close to sparse geographic waypoints provided by a user without any prior map, all while avoiding obstacles and choosing paths that cover user-informed regions of interest. Our approach is based on recent ad… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: RSS 2020. Video and project details can be found at http://www.cim.mcgill.ca/mrl/nav2goal/

  33. arXiv:2005.10934  [pdf, other

    cs.RO cs.AI

    LEAF: Latent Exploration Along the Frontier

    Authors: Homanga Bharadhwaj, Animesh Garg, Florian Shkurti

    Abstract: Self-supervised goal proposal and reaching is a key component for exploration and efficient policy learning algorithms. Such a self-supervised approach without access to any oracle goal sampling distribution requires deep exploration and commitment so that long horizon plans can be efficiently discovered. In this paper, we propose an exploration framework, which learns a dynamics-aware manifold of… ▽ More

    Submitted 26 April, 2021; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: Published as a conference paper in ICRA 2021

  34. arXiv:2004.08763  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization

    Authors: Homanga Bharadhwaj, Kevin Xie, Florian Shkurti

    Abstract: Recent works in high-dimensional model-predictive control and model-based reinforcement learning with learned dynamics and reward models have resorted to population-based optimization methods, such as the Cross-Entropy Method (CEM), for planning a sequence of actions. To decide on an action to take, CEM conducts a search for the action sequence with the highest return according to the dynamics mod… ▽ More

    Submitted 18 April, 2020; originally announced April 2020.

    Comments: L4DC 2020; Accepted for presentation in the 2nd Annual Conference on Learning for Dynamics and Control

  35. arXiv:2003.10010  [pdf, other

    cs.RO cs.CV cs.LG

    One-Shot Informed Robotic Visual Search in the Wild

    Authors: Karim Koreitem, Florian Shkurti, Travis Manderson, Wei-Di Chang, Juan Camilo Gamboa Higuera, Gregory Dudek

    Abstract: We consider the task of underwater robot navigation for the purpose of collecting scientifically relevant video data for environmental monitoring. The majority of field robots that currently perform monitoring tasks in unstructured natural environments navigate via path-tracking a pre-specified sequence of waypoints. Although this navigation method is often necessary, it is limiting because the ro… ▽ More

    Submitted 3 September, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

    Comments: Accepted at IROS 2020. Code https://github.com/rvl-lab-utoronto/visual_search_in_the_wild and videos https://www.youtube.com/watch?v=0i_el5XGCus

  36. arXiv:2003.07489  [pdf, other

    cs.RO cs.LG

    Catch the Ball: Accurate High-Speed Motions for Mobile Manipulators via Inverse Dynamics Learning

    Authors: Ke Dong, Karime Pereida, Florian Shkurti, Angela P. Schoellig

    Abstract: Mobile manipulators consist of a mobile platform equipped with one or more robot arms and are of interest for a wide array of challenging tasks because of their extended workspace and dexterity. Typically, mobile manipulators are deployed in slow-motion collaborative robot scenarios. In this paper, we consider scenarios where accurate high-speed motions are required. We introduce a framework for t… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: Paper manuscript submitted to IROS 2020

  37. arXiv:2003.04514  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Diversity inducing Information Bottleneck in Model Ensembles

    Authors: Samarth Sinha, Homanga Bharadhwaj, Anirudh Goyal, Hugo Larochelle, Animesh Garg, Florian Shkurti

    Abstract: Although deep learning models have achieved state-of-the-art performance on a number of vision tasks, generalization over high dimensional multi-modal data, and reliable predictive uncertainty estimation are still active areas of research. Bayesian approaches including Bayesian Neural Nets (BNNs) do not scale well to modern computer vision tasks, as they are difficult to train, and have poor gener… ▽ More

    Submitted 8 December, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: AAAI 2021. Samarth Sinha* and Homanga Bharadhwaj* contributed equally to this work

  38. arXiv:1709.08292  [pdf, other

    cs.RO cs.AI cs.LG

    Underwater Multi-Robot Convoying using Visual Tracking by Detection

    Authors: Florian Shkurti, Wei-Di Chang, Peter Henderson, Md Jahidul Islam, Juan Camilo Gamboa Higuera, Jimmy Li, Travis Manderson, Anqi Xu, Gregory Dudek, Junaed Sattar

    Abstract: We present a robust multi-robot convoying approach that relies on visual detection of the leading agent, thus enabling target following in unstructured 3-D environments. Our method is based on the idea of tracking-by-detection, which interleaves efficient model-based object detection with temporal filtering of image-based bounding box estimation. This approach has the important advantage of mitiga… ▽ More

    Submitted 24 September, 2017; originally announced September 2017.

    Comments: Accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017

  39. arXiv:1708.04352  [pdf, other

    cs.AI

    Benchmark Environments for Multitask Learning in Continuous Domains

    Authors: Peter Henderson, Wei-Di Chang, Florian Shkurti, Johanna Hansen, David Meger, Gregory Dudek

    Abstract: As demand drives systems to generalize to various domains and problems, the study of multitask, transfer and lifelong learning has become an increasingly important pursuit. In discrete domains, performance on the Atari game suite has emerged as the de facto benchmark for assessing multitask learning. However, in continuous domains there is a lack of agreement on standard multitask evaluation envir… ▽ More

    Submitted 14 August, 2017; originally announced August 2017.

    Comments: Accepted at Lifelong Learning: A Reinforcement Learning Approach Workshop @ ICML, Sydney, Australia, 2017