Skip to main content

Showing 1–5 of 5 results for author: Castro, M Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2101.11071  [pdf, other

    cs.LG cs.AI stat.ML

    The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors

    Authors: William H. Guss, Mario Ynocente Castro, Sam Devlin, Brandon Houghton, Noboru Sean Kuno, Crissman Loomis, Stephanie Milani, Sharada Mohanty, Keisuke Nakata, Ruslan Salakhutdinov, John Schulman, Shinya Shiroshita, Nicholay Topin, Avinash Ummadisingu, Oriol Vinyals

    Abstract: Although deep reinforcement learning has led to breakthroughs in many difficult domains, these successes have required an ever-increasing number of samples, affording only a shrinking segment of the AI community access to their development. Resolution of these limitations requires new, sample-efficient methods. To facilitate research in this direction, we propose this second iteration of the MineR… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 37 pages, initial submission, accepted at NeurIPS. arXiv admin note: substantial text overlap with arXiv:1904.10079

  2. arXiv:2011.11991  [pdf, ps, other

    cs.LG cs.RO

    Discovering Avoidable Planner Failures of Autonomous Vehicles using Counterfactual Analysis in Behaviorally Diverse Simulation

    Authors: Daisuke Nishiyama, Mario Ynocente Castro, Shirou Maruyama, Shinya Shiroshita, Karim Hamzaoui, Yi Ouyang, Guy Rosman, Jonathan DeCastro, Kuan-Hui Lee, Adrien Gaidon

    Abstract: Automated Vehicles require exhaustive testing in simulation to detect as many safety-critical failures as possible before deployment on public roads. In this work, we focus on the core decision-making component of autonomous robots: their planning algorithm. We introduce a planner testing framework that leverages recent progress in simulating behaviorally diverse traffic participants. Using large… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: 8 pages, 8 figures

    Journal ref: The 23rd IEEE International Conference on Intelligent Transportation Systems (ITSC2020)

  3. arXiv:2011.05741  [pdf, ps, other

    cs.LG cs.RO

    Behaviorally Diverse Traffic Simulation via Reinforcement Learning

    Authors: Shinya Shiroshita, Shirou Maruyama, Daisuke Nishiyama, Mario Ynocente Castro, Karim Hamzaoui, Guy Rosman, Jonathan DeCastro, Kuan-Hui Lee, Adrien Gaidon

    Abstract: Traffic simulators are important tools in autonomous driving development. While continuous progress has been made to provide developers more options for modeling various traffic participants, tuning these models to increase their behavioral diversity while maintaining quality is often very challenging. This paper introduces an easily-tunable policy generation algorithm for autonomous driving agent… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: 8 pages, 16 figures

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020, pp. 2103-2110

  4. arXiv:2007.08082  [pdf, other

    cs.RO cs.AI cs.DC cs.LG stat.ML

    Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators

    Authors: Yasuhiro Fujita, Kota Uenishi, Avinash Ummadisingu, Prabhat Nagarajan, Shimpei Masuda, Mario Ynocente Castro

    Abstract: Developing personal robots that can perform a diverse range of manipulation tasks in unstructured environments necessitates solving several challenges for robotic grasping systems. We take a step towards this broader goal by presenting the first RL-based system, to our knowledge, for a mobile manipulator that can (a) achieve targeted grasping generalizing to unseen target objects, (b) learn comple… ▽ More

    Submitted 14 October, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: Accepted at IROS 2020

  5. arXiv:1803.07616  [pdf, other

    cs.AI cs.CV

    IntPhys: A Framework and Benchmark for Visual Intuitive Physics Reasoning

    Authors: Ronan Riochet, Mario Ynocente Castro, Mathieu Bernard, Adam Lerer, Rob Fergus, Véronique Izard, Emmanuel Dupoux

    Abstract: In order to reach human performance on complexvisual tasks, artificial systems need to incorporate a sig-nificant amount of understanding of the world in termsof macroscopic objects, movements, forces, etc. Inspiredby work on intuitive physics in infants, we propose anevaluation benchmark which diagnoses how much a givensystem understands about physics by testing whether itcan tell apart well matc… ▽ More

    Submitted 11 February, 2020; v1 submitted 20 March, 2018; originally announced March 2018.