Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Gothoskar, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.10454  [pdf, other

    cs.RO cs.AI

    Partially Observable Task and Motion Planning with Uncertainty and Risk Awareness

    Authors: Aidan Curtis, George Matheos, Nishad Gothoskar, Vikash Mansinghka, Joshua Tenenbaum, Tomás Lozano-Pérez, Leslie Pack Kaelbling

    Abstract: Integrated task and motion planning (TAMP) has proven to be a valuable approach to generalizable long-horizon robotic manipulation and navigation problems. However, the typical TAMP problem formulation assumes full observability and deterministic action effects. These assumptions limit the ability of the planner to gather information and make decisions that are risk-aware. We propose a strategy fo… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  2. arXiv:2312.08715  [pdf, other

    cs.RO

    Bayes3D: fast learning and inference in structured generative models of 3D objects and scenes

    Authors: Nishad Gothoskar, Matin Ghavami, Eric Li, Aidan Curtis, Michael Noseworthy, Karen Chung, Brian Patton, William T. Freeman, Joshua B. Tenenbaum, Mirko Klukas, Vikash K. Mansinghka

    Abstract: Robots cannot yet match humans' ability to rapidly learn the shapes of novel 3D objects and recognize them robustly despite clutter and occlusion. We present Bayes3D, an uncertainty-aware perception system for structured 3D scenes, that reports accurate posterior uncertainty over 3D object shape, pose, and scene composition in the presence of clutter and occlusion. Bayes3D delivers these capabilit… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  3. arXiv:2302.03744  [pdf, other

    cs.CV

    3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation

    Authors: Guangyao Zhou, Nishad Gothoskar, Lirui Wang, Joshua B. Tenenbaum, Dan Gutfreund, Miguel Lázaro-Gredilla, Dileep George, Vikash K. Mansinghka

    Abstract: The ability to perceive and understand 3D scenes is crucial for many applications in computer vision and robotics. Inverse graphics is an appealing approach to 3D scene understanding that aims to infer the 3D scene structure from 2D images. In this paper, we introduce probabilistic modeling to the inverse graphics framework to quantify uncertainty and achieve robustness in 6D pose estimation tasks… ▽ More

    Submitted 6 September, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: ICCV 2023 camera ready

  4. arXiv:2208.02914  [pdf, other

    cs.AI

    Solving the Baby Intuitions Benchmark with a Hierarchically Bayesian Theory of Mind

    Authors: Tan Zhi-Xuan, Nishad Gothoskar, Falk Pollok, Dan Gutfreund, Joshua B. Tenenbaum, Vikash K. Mansinghka

    Abstract: To facilitate the development of new models to bridge the gap between machine and human social intelligence, the recently proposed Baby Intuitions Benchmark (arXiv:2102.11938) provides a suite of tasks designed to evaluate commonsense reasoning about agents' goals and actions that even young infants exhibit. Here we present a principled Bayesian solution to this benchmark, based on a hierarchicall… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 6 pages, 2 figures. Presented at the Robotics: Science and Systems 2022 Workshop on Social Intelligence in Humans and Robots

  5. arXiv:2202.03697  [pdf, other

    cs.RO

    DURableVS: Data-efficient Unsupervised Recalibrating Visual Servoing via online learning in a structured generative model

    Authors: Nishad Gothoskar, Miguel Lázaro-Gredilla, Yasemin Bekiroglu, Abhishek Agarwal, Joshua B. Tenenbaum, Vikash K. Mansinghka, Dileep George

    Abstract: Visual servoing enables robotic systems to perform accurate closed-loop control, which is required in many applications. However, existing methods either require precise calibration of the robot kinematic model and cameras or use neural architectures that require large amounts of data to train. In this work, we present a method for unsupervised learning of visual servoing that does not require any… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  6. arXiv:2111.00312  [pdf, other

    cs.CV cs.AI

    3DP3: 3D Scene Perception via Probabilistic Programming

    Authors: Nishad Gothoskar, Marco Cusumano-Towner, Ben Zinberg, Matin Ghavamizadeh, Falk Pollok, Austin Garrett, Joshua B. Tenenbaum, Dan Gutfreund, Vikash K. Mansinghka

    Abstract: We present 3DP3, a framework for inverse graphics that uses inference in a structured generative model of objects, scenes, and images. 3DP3 uses (i) voxel models to represent the 3D shape of objects, (ii) hierarchical scene graphs to decompose scenes into objects and the contacts between them, and (iii) depth image likelihoods based on real-time graphics. Given an observed RGB-D image, 3DP3's infe… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: NeurIPS 2021

  7. arXiv:2006.06803  [pdf, other

    stat.ML cs.LG

    Query Training: Learning a Worse Model to Infer Better Marginals in Undirected Graphical Models with Hidden Variables

    Authors: Miguel Lázaro-Gredilla, Wolfgang Lehrach, Nishad Gothoskar, Guangyao Zhou, Antoine Dedieu, Dileep George

    Abstract: Probabilistic graphical models (PGMs) provide a compact representation of knowledge that can be queried in a flexible way: after learning the parameters of a graphical model once, new probabilistic queries can be answered at test time without retraining. However, when using undirected PGMS with hidden variables, two sources of error typically compound in all but the simplest models (a) learning er… ▽ More

    Submitted 25 February, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

  8. arXiv:2006.06620  [pdf, other

    cs.RO cs.AI cs.LG

    From proprioception to long-horizon planning in novel environments: A hierarchical RL model

    Authors: Nishad Gothoskar, Miguel Lázaro-Gredilla, Dileep George

    Abstract: For an intelligent agent to flexibly and efficiently operate in complex environments, they must be able to reason at multiple levels of temporal, spatial, and conceptual abstraction. At the lower levels, the agent must interpret their proprioceptive inputs and control their muscles, and at the higher levels, the agent must select goals and plan how they will achieve those goals. It is clear that e… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  9. arXiv:2003.04474  [pdf, other

    cs.RO cs.AI

    Learning a generative model for robot control using visual feedback

    Authors: Nishad Gothoskar, Miguel Lázaro-Gredilla, Abhishek Agarwal, Yasemin Bekiroglu, Dileep George

    Abstract: We introduce a novel formulation for incorporating visual feedback in controlling robots. We define a generative model from actions to image observations of features on the end-effector. Inference in the model allows us to infer the robot state corresponding to target locations of the features. This, in turn, guides motion of the robot and allows for matching the target locations of the features i… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  10. arXiv:1905.00507  [pdf, other

    stat.ML cs.LG

    Learning higher-order sequential structure with cloned HMMs

    Authors: Antoine Dedieu, Nishad Gothoskar, Scott Swingle, Wolfgang Lehrach, Miguel Lázaro-Gredilla, Dileep George

    Abstract: Variable order sequence modeling is an important problem in artificial and natural intelligence. While overcomplete Hidden Markov Models (HMMs), in theory, have the capacity to represent long-term temporal structure, they often fail to learn and converge to local minima. We show that by constraining HMMs with a simple sparsity structure inspired by biology, we can make it learn variable order sequ… ▽ More

    Submitted 15 May, 2019; v1 submitted 1 May, 2019; originally announced May 2019.