Zum Hauptinhalt springen

Showing 1–43 of 43 results for author: Warnell, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01862  [pdf, other

    cs.RO

    Autonomous Ground Navigation in Highly Constrained Spaces: Lessons learned from The 3rd BARN Challenge at ICRA 2024

    Authors: Xuesu Xiao, Zifan Xu, Aniket Datar, Garrett Warnell, Peter Stone, Joshua Julian Damanik, Jaewon Jung, Chala Adane Deresa, Than Duc Huy, Chen Jinyu, Chen Yichen, Joshua Adrian Cahyono, Jingda Wu, Longfei Mo, Mingyang Lv, Bowen Lan, Qingyang Meng, Weizhi Tao, Li Cheng

    Abstract: The 3rd BARN (Benchmark Autonomous Robot Navigation) Challenge took place at the 2024 IEEE International Conference on Robotics and Automation (ICRA 2024) in Yokohama, Japan and continued to evaluate the performance of state-of-the-art autonomous ground navigation systems in highly constrained environments. Similar to the trend in The 1st and 2nd BARN Challenge at ICRA 2022 and 2023 in Philadelphi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2308.03205

  2. arXiv:2404.18798  [pdf, other

    cs.MA

    Multi-Agent Synchronization Tasks

    Authors: Rolando Fernandez, Garrett Warnell, Derrik E. Asher, Peter Stone

    Abstract: In multi-agent reinforcement learning (MARL), coordination plays a crucial role in enhancing agents' performance beyond what they could achieve through cooperation alone. The interdependence of agents' actions, coupled with the need for communication, leads to a domain where effective coordination is crucial. In this paper, we introduce and define $\textit{Multi-Agent Synchronization Tasks}$ (MSTs… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Adaptive Learning Agents Workshop at AAMAS 2024

  3. arXiv:2309.15302  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    STERLING: Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience

    Authors: Haresh Karnan, Elvin Yang, Daniel Farkash, Garrett Warnell, Joydeep Biswas, Peter Stone

    Abstract: Terrain awareness, i.e., the ability to identify and distinguish different types of terrain, is a critical ability that robots must have to succeed at autonomous off-road navigation. Current approaches that provide robots with this awareness either rely on labeled data which is expensive to collect, engineered features and cost functions that may not generalize, or expert human demonstrations whic… ▽ More

    Submitted 20 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Project website: https://hareshkarnan.github.io/sterling/

    Journal ref: Conference on Robot Learning (CoRL 2023)

  4. arXiv:2309.09912  [pdf, other

    cs.RO cs.AI cs.LG

    Wait, That Feels Familiar: Learning to Extrapolate Human Preferences for Preference Aligned Path Planning

    Authors: Haresh Karnan, Elvin Yang, Garrett Warnell, Joydeep Biswas, Peter Stone

    Abstract: Autonomous mobility tasks such as lastmile delivery require reasoning about operator indicated preferences over terrains on which the robot should navigate to ensure both robot safety and mission success. However, coping with out of distribution data from novel terrains or appearance changes due to lighting variations remains a fundamental problem in visual terrain adaptive navigation. Existing so… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Journal ref: Under Submission to ICRA 2024

  5. arXiv:2308.03205  [pdf, other

    cs.RO

    Autonomous Ground Navigation in Highly Constrained Spaces: Lessons learned from The 2nd BARN Challenge at ICRA 2023

    Authors: Xuesu Xiao, Zifan Xu, Garrett Warnell, Peter Stone, Ferran Gebelli Guinjoan, Romulo T. Rodrigues, Herman Bruyninckx, Hanjaya Mandala, Guilherme Christmann, Jose Luis Blanco-Claraco, Shravan Somashekara Rai

    Abstract: The 2nd BARN (Benchmark Autonomous Robot Navigation) Challenge took place at the 2023 IEEE International Conference on Robotics and Automation (ICRA 2023) in London, UK and continued to evaluate the performance of state-of-the-art autonomous ground navigation systems in highly constrained environments. Compared to The 1st BARN Challenge at ICRA 2022 in Philadelphia, the competition has grown signi… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2208.10473

  6. arXiv:2211.04005  [pdf, other

    cs.LG cs.AI

    ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning

    Authors: Eddy Hudson, Ishan Durugkar, Garrett Warnell, Peter Stone

    Abstract: Given a dataset of expert agent interactions with an environment of interest, a viable method to extract an effective agent policy is to estimate the maximum likelihood policy indicated by this data. This approach is commonly referred to as behavioral cloning (BC). In this work, we describe a key disadvantage of BC that arises due to the maximum likelihood objective function; namely that BC is mea… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  7. arXiv:2210.14428  [pdf, other

    cs.LG cs.AI cs.HC cs.RO

    D-Shape: Demonstration-Shaped Reinforcement Learning via Goal Conditioning

    Authors: Caroline Wang, Garrett Warnell, Peter Stone

    Abstract: While combining imitation learning (IL) and reinforcement learning (RL) is a promising way to address poor sample efficiency in autonomous behavior acquisition, methods that do so typically assume that the requisite behavior demonstrations are provided by an expert that behaves optimally with respect to a task reward. If, however, suboptimal demonstrations are provided, a fundamental challenge app… ▽ More

    Submitted 11 March, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    ACM Class: I.2.8; I.2.9; I.2.6

  8. arXiv:2209.13641  [pdf, other

    cs.RO

    Learning Perceptual Hallucination for Multi-Robot Navigation in Narrow Hallways

    Authors: Jin-Soo Park, Xuesu Xiao, Garrett Warnell, Harel Yedidsion, Peter Stone

    Abstract: While current systems for autonomous robot navigation can produce safe and efficient motion plans in static environments, they usually generate suboptimal behaviors when multiple robots must navigate together in confined spaces. For example, when two robots meet each other in a narrow hallway, they may either turn around to find an alternative route or collide with each other. This paper presents… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: 6+1 pages, 5 figures

  9. arXiv:2208.10473  [pdf, other

    cs.RO

    Autonomous Ground Navigation in Highly Constrained Spaces: Lessons learned from The BARN Challenge at ICRA 2022

    Authors: Xuesu Xiao, Zifan Xu, Zizhao Wang, Yunlong Song, Garrett Warnell, Peter Stone, Tingnan Zhang, Shravan Ravi, Gary Wang, Haresh Karnan, Joydeep Biswas, Nicholas Mohammad, Lauren Bramblett, Rahul Peddi, Nicola Bezzo, Zhanteng Xie, Philip Dames

    Abstract: The BARN (Benchmark Autonomous Robot Navigation) Challenge took place at the 2022 IEEE International Conference on Robotics and Automation (ICRA 2022) in Philadelphia, PA. The aim of the challenge was to evaluate state-of-the-art autonomous ground navigation systems for moving robots through highly constrained environments in a safe and efficient manner. Specifically, the task was to navigate a st… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

  10. arXiv:2203.15983  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics

    Authors: Haresh Karnan, Kavan Singh Sikand, Pranav Atreya, Sadegh Rabiee, Xuesu Xiao, Garrett Warnell, Peter Stone, Joydeep Biswas

    Abstract: One of the key challenges in high speed off road navigation on ground vehicles is that the kinodynamics of the vehicle terrain interaction can differ dramatically depending on the terrain. Previous approaches to addressing this challenge have considered learning an inverse kinodynamics (IKD) model, conditioned on inertial information of the vehicle to sense the kinodynamic interactions. In this pa… ▽ More

    Submitted 1 August, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  11. arXiv:2203.15041  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation

    Authors: Haresh Karnan, Anirudh Nair, Xuesu Xiao, Garrett Warnell, Soeren Pirk, Alexander Toshev, Justin Hart, Joydeep Biswas, Peter Stone

    Abstract: Social navigation is the capability of an autonomous agent, such as a robot, to navigate in a 'socially compliant' manner in the presence of other intelligent agents such as humans. With the emergence of autonomously navigating mobile robots in human populated environments (e.g., domestic service robots in homes and restaurants and food delivery robots on public sidewalks), incorporating socially… ▽ More

    Submitted 8 June, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Journal ref: Robotics and Automation Letters (RA-L) 2022

  12. arXiv:2202.00243  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Adversarial Imitation Learning from Video using a State Observer

    Authors: Haresh Karnan, Garrett Warnell, Faraz Torabi, Peter Stone

    Abstract: The imitation learning research community has recently made significant progress towards the goal of enabling artificial agents to imitate behaviors from video demonstrations alone. However, current state-of-the-art approaches developed for this problem exhibit high sample complexity due, in part, to the high-dimensional nature of video observations. Towards addressing this issue, we introduce her… ▽ More

    Submitted 26 July, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Journal ref: International Conference on Robotics and Automation (ICRA) 2022

  13. arXiv:2109.08968  [pdf, other

    cs.RO cs.LG

    Visual Representation Learning for Preference-Aware Path Planning

    Authors: Kavan Singh Sikand, Sadegh Rabiee, Adam Uccello, Xuesu Xiao, Garrett Warnell, Joydeep Biswas

    Abstract: Autonomous mobile robots deployed in outdoor environments must reason about different types of terrain for both safety (e.g., prefer dirt over mud) and deployer preferences (e.g., prefer dirt path over flower beds). Most existing solutions to this preference-aware path planning problem use semantic segmentation to classify terrain types from camera images, and then ascribe costs to each type. Unfo… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

    Comments: 7 pages, 6 figures

  14. APPLE: Adaptive Planner Parameter Learning from Evaluative Feedback

    Authors: Zizhao Wang, Xuesu Xiao, Garrett Warnell, Peter Stone

    Abstract: Classical autonomous navigation systems can control robots in a collision-free manner, oftentimes with verifiable safety and explainability. When facing new environments, however, fine-tuning of the system parameters by an expert is typically required before the system can navigate as expected. To alleviate this requirement, the recently-proposed Adaptive Planner Parameter Learning paradigm allows… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: 6 pages, 4 figures, accepted in IROS 2021. arXiv admin note: substantial text overlap with arXiv:2105.07620

  15. Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks

    Authors: Ruohan Zhang, Faraz Torabi, Garrett Warnell, Peter Stone

    Abstract: A longstanding goal of artificial intelligence is to create artificial agents capable of learning to perform tasks that require sequential decision making. Importantly, while it is the artificial agent that learns and acts, it is still up to humans to specify the particular task to be performed. Classical task-specification approaches typically involve humans providing stationary reward functions… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: Springer journal, Autonomous Agents and Multi-Agent Systems (JAAMAS)

    Journal ref: JAAMAS 35 (2021) 1-39

  16. arXiv:2105.09371  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    VOILA: Visual-Observation-Only Imitation Learning for Autonomous Navigation

    Authors: Haresh Karnan, Garrett Warnell, Xuesu Xiao, Peter Stone

    Abstract: While imitation learning for vision based autonomous mobile robot navigation has recently received a great deal of attention in the research community, existing approaches typically require state action demonstrations that were gathered using the deployment platform. However, what if one cannot easily outfit their platform to record these demonstration signals or worse yet the demonstrator does no… ▽ More

    Submitted 8 October, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

    Comments: Under Submission to ICRA+RAL 2022

    Journal ref: ICRA 2022

  17. arXiv:2105.07620  [pdf, other

    cs.RO

    APPL: Adaptive Planner Parameter Learning

    Authors: Xuesu Xiao, Zizhao Wang, Zifan Xu, Bo Liu, Garrett Warnell, Gauraang Dhamankar, Anirudh Nair, Peter Stone

    Abstract: While current autonomous navigation systems allow robots to successfully drive themselves from one point to another in specific environments, they typically require extensive manual parameter re-tuning by human robotics experts in order to function in new environments. Furthermore, even for just one complex environment, a single set of fine-tuned parameters may not work well in different regions o… ▽ More

    Submitted 18 May, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

  18. arXiv:2105.03756  [pdf, other

    cs.LG

    RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning

    Authors: Eddy Hudson, Garrett Warnell, Peter Stone

    Abstract: While Adversarial Imitation Learning (AIL) algorithms have recently led to state-of-the-art results on various imitation learning benchmarks, it is unclear as to what impact various design decisions have on performance. To this end, we present here an organizing, modular framework called Reinforcement-learning-based Adversarial Imitation Learning (RAIL) that encompasses and generalizes a popular s… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:2104.07810

  19. arXiv:2104.07810  [pdf, other

    cs.LG cs.RO

    Skeletal Feature Compensation for Imitation Learning with Embodiment Mismatch

    Authors: Eddy Hudson, Garrett Warnell, Faraz Torabi, Peter Stone

    Abstract: Learning from demonstrations in the wild (e.g. YouTube videos) is a tantalizing goal in imitation learning. However, for this goal to be achieved, imitation learning algorithms must deal with the fact that the demonstrators and learners may have bodies that differ from one another. This condition -- "embodiment mismatch" -- is ignored by many recent imitation learning algorithms. Our proposed imit… ▽ More

    Submitted 12 February, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

  20. arXiv:2104.00163  [pdf, other

    cs.LG cs.AI

    DEALIO: Data-Efficient Adversarial Learning for Imitation from Observation

    Authors: Faraz Torabi, Garrett Warnell, Peter Stone

    Abstract: In imitation learning from observation IfO, a learning agent seeks to imitate a demonstrating agent using only observations of the demonstrated behavior without access to the control signals generated by the demonstrator. Recent methods based on adversarial imitation learning have led to state-of-the-art performance on IfO problems, but they typically suffer from high sample complexity due to a re… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

  21. arXiv:2011.13112  [pdf, other

    cs.RO

    Motion Planning and Control for Mobile Robot Navigation Using Machine Learning: a Survey

    Authors: Xuesu Xiao, Bo Liu, Garrett Warnell, Peter Stone

    Abstract: Moving in complex environments is an essential capability of intelligent mobile robots. Decades of research and engineering have been dedicated to developing sophisticated navigation systems to move mobile robots from one point to another. Despite their overall success, a recently emerging research thrust is devoted to developing machine learning techniques to address the same problem, based in la… ▽ More

    Submitted 25 February, 2022; v1 submitted 25 November, 2020; originally announced November 2020.

  22. arXiv:2011.00400  [pdf, other

    cs.RO

    APPLI: Adaptive Planner Parameter Learning From Interventions

    Authors: Zizhao Wang, Xuesu Xiao, Bo Liu, Garrett Warnell, Peter Stone

    Abstract: While classical autonomous navigation systems can typically move robots from one point to another safely and in a collision-free manner, these systems may fail or produce suboptimal behavior in certain scenarios. The current practice in such scenarios is to manually re-tune the system's parameters, e.g. max speed, sampling rate, inflation radius, to optimize performance. This practice requires exp… ▽ More

    Submitted 22 August, 2021; v1 submitted 31 October, 2020; originally announced November 2020.

    Comments: 7 pages, 4 figures

  23. arXiv:2011.00397  [pdf, other

    cs.RO

    APPLR: Adaptive Planner Parameter Learning from Reinforcement

    Authors: Zifan Xu, Gauraang Dhamankar, Anirudh Nair, Xuesu Xiao, Garrett Warnell, Bo Liu, Zizhao Wang, Peter Stone

    Abstract: Classical navigation systems typically operate using a fixed set of hand-picked parameters (e.g. maximum speed, sampling rate, inflation radius, etc.) and require heavy expert re-tuning in order to work in new environments. To mitigate this requirement, it has been proposed to learn parameters for different contexts in a new environment using human demonstrations collected via teleoperation. Howev… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

  24. arXiv:2009.13736  [pdf, other

    cs.LG cs.AI stat.ML

    Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy

    Authors: Yunshu Du, Garrett Warnell, Assefaw Gebremedhin, Peter Stone, Matthew E. Taylor

    Abstract: Experience replay (ER) improves the data efficiency of off-policy reinforcement learning (RL) algorithms by allowing an agent to store and reuse its past experiences in a replay buffer. While many techniques have been proposed to enhance ER by biasing how experiences are sampled from the buffer, thus far they have not considered strategies for refreshing experiences inside the buffer. In this work… ▽ More

    Submitted 3 April, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: 29 pages (with appendices), 8 figures, preprint

  25. arXiv:2008.01594  [pdf, other

    cs.AI cs.LG

    An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch

    Authors: Siddharth Desai, Ishan Durugkar, Haresh Karnan, Garrett Warnell, Josiah Hanna, Peter Stone

    Abstract: We examine the problem of transferring a policy learned in a source environment to a target environment with different dynamics, particularly in the case where it is critical to reduce the amount of interaction with the target environment during learning. This problem is particularly important in sim-to-real transfer because simulators inevitably model real-world dynamics imperfectly. In this pape… ▽ More

    Submitted 16 November, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

    Journal ref: Neural Information Processing Systems (NeurIPS 2020)

  26. arXiv:2008.01281  [pdf, other

    cs.RO

    Stochastic Grounded Action Transformation for Robot Learning in Simulation

    Authors: Siddharth Desai, Haresh Karnan, Josiah P. Hanna, Garrett Warnell, Peter Stone

    Abstract: Robot control policies learned in simulation do not often transfer well to the real world. Many existing solutions to this sim-to-real problem, such as the Grounded Action Transformation (GAT) algorithm, seek to correct for or ground these differences by matching the simulator to the real world. However, the efficacy of these approaches is limited if they do not explicitly account for stochasticit… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Accepted at 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2020

  27. arXiv:2008.01279  [pdf, other

    cs.RO

    Reinforced Grounded Action Transformation for Sim-to-Real Transfer

    Authors: Haresh Karnan, Siddharth Desai, Josiah P. Hanna, Garrett Warnell, Peter Stone

    Abstract: Robots can learn to do complex tasks in simulation, but often, learned behaviors fail to transfer well to the real world due to simulator imperfections (the reality gap). Some existing solutions to this sim-to-real problem, such as Grounded Action Transformation (GAT), use a small amount of real-world experience to minimize the reality gap by grounding the simulator. While very effective in certai… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Accepted at International Conference on Intelligent Robots and Systems (IROS) 2020

  28. arXiv:2007.14479  [pdf, other

    cs.RO

    Toward Agile Maneuvers in Highly Constrained Spaces: Learning from Hallucination

    Authors: Xuesu Xiao, Bo Liu, Garrett Warnell, Peter Stone

    Abstract: While classical approaches to autonomous robot navigation currently enable operation in certain environments, they break down in tightly constrained spaces, e.g., where the robot needs to engage in agile maneuvers to squeeze between obstacles. Recent machine learning techniques have the potential to address this shortcoming, but existing approaches require vast amounts of navigation experience for… ▽ More

    Submitted 19 January, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

    Comments: Accepted by IEEE Robotics and Automation Letters (RA-L)

  29. arXiv:2004.00116  [pdf, other

    cs.RO cs.LG

    APPLD: Adaptive Planner Parameter Learning from Demonstration

    Authors: Xuesu Xiao, Bo Liu, Garrett Warnell, Jonathan Fink, Peter Stone

    Abstract: Existing autonomous robot navigation systems allow robots to move from one point to another in a collision-free manner. However, when facing new environments, these systems generally require re-tuning by expert roboticists with a good understanding of the inner workings of the navigation system. In contrast, even users who are unversed in the details of robot navigation algorithms can generate des… ▽ More

    Submitted 15 July, 2020; v1 submitted 31 March, 2020; originally announced April 2020.

    Comments: Accepted by Robotics and Automation Letters (RAL) and International Conference on Intelligent Robots and Systems (IROS) 2020

  30. arXiv:1911.00497  [pdf, other

    cs.AI cs.CL cs.LG

    A Narration-based Reward Shaping Approach using Grounded Natural Language Commands

    Authors: Nicholas Waytowich, Sean L. Barton, Vernon Lawhern, Garrett Warnell

    Abstract: While deep reinforcement learning techniques have led to agents that are successfully able to learn to perform a number of tasks that had been previously unlearnable, these techniques are still susceptible to the longstanding problem of reward sparsity. This is especially true for tasks such as training an agent to play StarCraft II, a real-time strategy game where reward is only given at the end… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

    Comments: Presented at the Imitation, Intent and Interaction (I3) workshop, ICML 2019. arXiv admin note: substantial text overlap with arXiv:1906.02671

  31. arXiv:1906.07374  [pdf, other

    cs.LG stat.ML

    Sample-efficient Adversarial Imitation Learning from Observation

    Authors: Faraz Torabi, Sean Geiger, Garrett Warnell, Peter Stone

    Abstract: Imitation from observation is the framework of learning tasks by observing demonstrated state-only trajectories. Recently, adversarial approaches have achieved significant performance improvements over other methods for imitating complex behaviors. However, these adversarial imitation algorithms often require many demonstration examples and learning iterations to produce a policy that is successfu… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

  32. arXiv:1906.07372  [pdf, other

    cs.LG cs.RO stat.ML

    RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration

    Authors: Brahma S. Pavse, Faraz Torabi, Josiah P. Hanna, Garrett Warnell, Peter Stone

    Abstract: Augmenting reinforcement learning with imitation learning is often hailed as a method by which to improve upon learning from scratch. However, most existing methods for integrating these two techniques are subject to several strong assumptions---chief among them that information about demonstrator actions is available. In this paper, we investigate the extent to which this assumption is necessary… ▽ More

    Submitted 21 July, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: IEEE Robotics and Automation Letters, presented at International Conference on Intelligent Robots and Systems (IROS 2020)

  33. arXiv:1906.02671  [pdf, other

    cs.MM cs.CL cs.LG cs.NE cs.RO

    Grounding Natural Language Commands to StarCraft II Game States for Narration-Guided Reinforcement Learning

    Authors: Nicholas Waytowich, Sean L. Barton, Vernon Lawhern, Ethan Stump, Garrett Warnell

    Abstract: While deep reinforcement learning techniques have led to agents that are successfully able to learn to perform a number of tasks that had been previously unlearnable, these techniques are still susceptible to the longstanding problem of {\em reward sparsity}. This is especially true for tasks such as training an agent to play StarCraft II, a real-time strategy game where reward is only given at th… ▽ More

    Submitted 24 April, 2019; originally announced June 2019.

    Comments: 10 pages, 3 figures. Published at SPIE 2019

  34. arXiv:1905.13566  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Recent Advances in Imitation Learning from Observation

    Authors: Faraz Torabi, Garrett Warnell, Peter Stone

    Abstract: Imitation learning is the process by which one agent tries to learn how to perform a certain task using information generated by another, often more-expert agent performing that same task. Conventionally, the imitator has access to both state and action information generated by an expert performing the task (e.g., the expert may provide a kinesthetic demonstration of object placement using a robot… ▽ More

    Submitted 18 June, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: International Joint Conference on Artificial Intelligence (IJCAI 2019)

  35. arXiv:1905.09335  [pdf, other

    cs.LG stat.ML

    Imitation Learning from Video by Leveraging Proprioception

    Authors: Faraz Torabi, Garrett Warnell, Peter Stone

    Abstract: Classically, imitation learning algorithms have been developed for idealized situations, e.g., the demonstrations are often required to be collected in the exact same environment and usually include the demonstrator's actions. Recently, however, the research community has begun to address some of these shortcomings by offering algorithmic solutions that enable imitation learning from observation (… ▽ More

    Submitted 18 June, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: International Joint Conference on Artificial Intelligence (IJCAI 2019)

  36. arXiv:1809.05676  [pdf, other

    cs.AI

    Deterministic Implementations for Reproducibility in Deep Reinforcement Learning

    Authors: Prabhat Nagarajan, Garrett Warnell, Peter Stone

    Abstract: While deep reinforcement learning (DRL) has led to numerous successes in recent years, reproducing these successes can be extremely challenging. One reproducibility challenge particularly relevant to DRL is nondeterminism in the training process, which can substantially affect the results. Motivated by this challenge, we study the positive impacts of deterministic implementations in eliminating no… ▽ More

    Submitted 9 June, 2019; v1 submitted 15 September, 2018; originally announced September 2018.

    Comments: 17 Pages

  37. arXiv:1807.06158  [pdf, other

    cs.LG cs.AI stat.ML

    Generative Adversarial Imitation from Observation

    Authors: Faraz Torabi, Garrett Warnell, Peter Stone

    Abstract: Imitation from observation (IfO) is the problem of learning directly from state-only demonstrations without having access to the demonstrator's actions. The lack of action information both distinguishes IfO from most of the literature in imitation learning, and also sets it apart as a method that may enable agents to learn from a large set of previously inapplicable resources such as internet vide… ▽ More

    Submitted 18 June, 2019; v1 submitted 16 July, 2018; originally announced July 2018.

  38. arXiv:1805.01954  [pdf, other

    cs.AI

    Behavioral Cloning from Observation

    Authors: Faraz Torabi, Garrett Warnell, Peter Stone

    Abstract: Humans often learn how to perform tasks via imitation: they observe others perform a task, and then very quickly infer the appropriate actions to take based on their observations. While extending this paradigm to autonomous agents is a well-studied problem in general, there are two particular aspects that have largely been overlooked: (1) that the learning is done from observation only (i.e., with… ▽ More

    Submitted 11 May, 2018; v1 submitted 4 May, 2018; originally announced May 2018.

    Comments: International Joint Conference on Artificial Intelligence (IJCAI 2018)

  39. arXiv:1709.10163  [pdf, other

    cs.AI cs.LG

    Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces

    Authors: Garrett Warnell, Nicholas Waytowich, Vernon Lawhern, Peter Stone

    Abstract: While recent advances in deep reinforcement learning have allowed autonomous learning agents to succeed at a variety of complex tasks, existing algorithms generally require a lot of training data. One way to increase the speed at which agents are able to learn to perform tasks is by leveraging the input of human trainers. Although such input can take many forms, real-time, scalar-valued feedback i… ▽ More

    Submitted 19 January, 2018; v1 submitted 28 September, 2017; originally announced September 2017.

    Comments: 9 pages, 6 figures

  40. arXiv:1612.04111  [pdf, other

    stat.ML cs.LG

    Parsimonious Online Learning with Kernels via Sparse Projections in Function Space

    Authors: Alec Koppel, Garrett Warnell, Ethan Stump, Alejandro Ribeiro

    Abstract: Despite their attractiveness, popular perception is that techniques for nonparametric function approximation do not scale to streaming data due to an intractable growth in the amount of storage they require. To solve this problem in a memory-affordable way, we propose an online technique based on functional stochastic gradient descent in tandem with supervised sparsification based on greedy functi… ▽ More

    Submitted 13 December, 2016; originally announced December 2016.

    Comments: Submitted to JMLR on 11/24/2016

  41. arXiv:1609.04794  [pdf, other

    cs.RO

    Semantics for UGV Registration in GPS-denied Environments

    Authors: Gordon Christie, Garrett Warnell, Kevin Kochersberger

    Abstract: Localization in a global map is critical to success in many autonomous robot missions. This is particularly challenging for multi-robot operations in unknown and adverse environments. Here, we are concerned with providing a small unmanned ground vehicle (UGV) the ability to localize itself within a 2.5D aerial map generated from imagery captured by a low-flying unmanned aerial vehicle (UAV). We co… ▽ More

    Submitted 19 September, 2016; v1 submitted 15 September, 2016; originally announced September 2016.

  42. arXiv:1605.01107  [pdf, other

    stat.ML cs.LG

    Decentralized Dynamic Discriminative Dictionary Learning

    Authors: Alec Koppel, Garrett Warnell, Ethan Stump, Alejandro Ribeiro

    Abstract: We consider discriminative dictionary learning in a distributed online setting, where a network of agents aims to learn a common set of dictionary elements of a feature space and model parameters while sequentially receiving observations. We formulate this problem as a distributed stochastic program with a non-convex objective and present a block variant of the Arrow-Hurwicz saddle point algorithm… ▽ More

    Submitted 3 May, 2016; originally announced May 2016.

  43. Adaptive-Rate Compressive Sensing Using Side Information

    Authors: Garrett Warnell, Sourabh Bhattacharya, Rama Chellappa, Tamer Basar

    Abstract: We provide two novel adaptive-rate compressive sensing (CS) strategies for sparse, time-varying signals using side information. Our first method utilizes extra cross-validation measurements, and the second one exploits extra low-resolution measurements. Unlike the majority of current CS techniques, we do not assume that we know an upper bound on the number of significant coefficients that comprise… ▽ More

    Submitted 2 January, 2014; originally announced January 2014.