Zum Hauptinhalt springen

Showing 1–23 of 23 results for author: Vemprala, S

.
  1. arXiv:2408.05336  [pdf, other

    cs.RO cs.AI

    Logically Constrained Robotics Transformers for Enhanced Perception-Action Planning

    Authors: Parv Kapoor, Sai Vemprala, Ashish Kapoor

    Abstract: With the advent of large foundation model based planning, there is a dire need to ensure their output aligns with the stakeholder's intent. When these models are deployed in the real world, the need for alignment is magnified due to the potential cost to life and infrastructure due to unexpected faliures. Temporal Logic specifications have long provided a way to constrain system behaviors and are… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: Robotics Science and Systems: Towards Safe Autonomy

  2. arXiv:2310.02437  [pdf, other

    cs.CV

    EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields

    Authors: Anish Bhattacharya, Ratnesh Madaan, Fernando Cladera, Sai Vemprala, Rogerio Bonatti, Kostas Daniilidis, Ashish Kapoor, Vijay Kumar, Nikolai Matni, Jayesh K. Gupta

    Abstract: We present EvDNeRF, a pipeline for generating event data and training an event-based dynamic NeRF, for the purpose of faithfully reconstructing eventstreams on scenes with rigid and non-rigid deformations that may be too fast to capture with a standard camera. Event cameras register asynchronous per-pixel brightness changes at MHz rates with high dynamic range, making them ideal for observing fast… ▽ More

    Submitted 6 December, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: 16 pages, 20 figures, 2 tables

  3. arXiv:2310.00887  [pdf, other

    cs.RO cs.AI cs.LG

    GRID: A Platform for General Robot Intelligence Development

    Authors: Sai Vemprala, Shuhang Chen, Abhinav Shukla, Dinesh Narayanan, Ashish Kapoor

    Abstract: Developing machine intelligence abilities in robots and autonomous systems is an expensive and time consuming process. Existing solutions are tailored to specific applications and are harder to generalize. Furthermore, scarcity of training data adds a layer of complexity in deploying deep machine learning models. We present a new platform for General Robot Intelligence Development (GRID) to addres… ▽ More

    Submitted 7 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

  4. arXiv:2307.07909  [pdf, other

    cs.AI

    Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training

    Authors: Yao Wei, Yanchao Sun, Ruijie Zheng, Sai Vemprala, Rogerio Bonatti, Shuhang Chen, Ratnesh Madaan, Zhongjie Ba, Ashish Kapoor, Shuang Ma

    Abstract: We introduce DualMind, a generalist agent designed to tackle various decision-making tasks that addresses challenges posed by current methods, such as overfitting behaviors and dependence on task-specific fine-tuning. DualMind uses a novel "Dual-phase" training strategy that emulates how humans learn to act in the world. The model first learns fundamental common knowledge through a self-supervised… ▽ More

    Submitted 9 October, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

  5. arXiv:2306.17582  [pdf, other

    cs.AI cs.CL cs.HC cs.LG cs.RO

    ChatGPT for Robotics: Design Principles and Model Abilities

    Authors: Sai Vemprala, Rogerio Bonatti, Arthur Bucker, Ashish Kapoor

    Abstract: This paper presents an experimental study regarding the use of OpenAI's ChatGPT for robotics applications. We outline a strategy that combines design principles for prompt engineering and the creation of a high-level function library which allows ChatGPT to adapt to different robotics tasks, simulators, and form factors. We focus our evaluations on the effectiveness of different prompt engineering… ▽ More

    Submitted 19 July, 2023; v1 submitted 20 February, 2023; originally announced June 2023.

  6. arXiv:2303.04212  [pdf, other

    cs.RO cs.LG

    ConBaT: Control Barrier Transformer for Safe Policy Learning

    Authors: Yue Meng, Sai Vemprala, Rogerio Bonatti, Chuchu Fan, Ashish Kapoor

    Abstract: Large-scale self-supervised models have recently revolutionized our ability to perform a variety of tasks within the vision and language domains. However, using such models for autonomous systems is challenging because of safety requirements: besides executing correct actions, an autonomous agent must also avoid the high cost and potentially fatal critical mistakes. Traditionally, self-supervised… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  7. arXiv:2211.15286  [pdf, other

    cs.CV

    Masked Autoencoders for Egocentric Video Understanding @ Ego4D Challenge 2022

    Authors: Jiachen Lei, Shuang Ma, Zhongjie Ba, Sai Vemprala, Ashish Kapoor, Kui Ren

    Abstract: In this report, we present our approach and empirical results of applying masked autoencoders in two egocentric video understanding tasks, namely, Object State Change Classification and PNR Temporal Localization, of Ego4D Challenge 2022. As team TheSSVL, we ranked 2nd place in both tasks. Our code will be made available.

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: 5 pages

  8. arXiv:2210.16294  [pdf

    cs.LG cs.MA

    Learning Modular Simulations for Homogeneous Systems

    Authors: Jayesh K. Gupta, Sai Vemprala, Ashish Kapoor

    Abstract: Complex systems are often decomposed into modular subsystems for engineering tractability. Although various equation based white-box modeling techniques make use of such structure, learning based methods have yet to incorporate these ideas broadly. We present a modular simulation framework for modeling homogeneous multibody dynamical systems, which combines ideas from graph neural networks and neu… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: First two authors contributed equally. Accepted at NeurIPS 2022

  9. arXiv:2209.11133  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    PACT: Perception-Action Causal Transformer for Autoregressive Robotics Pre-Training

    Authors: Rogerio Bonatti, Sai Vemprala, Shuang Ma, Felipe Frujeri, Shuhang Chen, Ashish Kapoor

    Abstract: Robotics has long been a field riddled with complex systems architectures whose modules and connections, whether traditional or learning-based, require significant human expertise and prior knowledge. Inspired by large pre-trained language models, this work introduces a paradigm for pre-training a general purpose representation that can serve as a starting point for multiple tasks on a given robot… ▽ More

    Submitted 23 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

  10. arXiv:2209.10986  [pdf, other

    cs.RO cs.CV

    Learning to Simulate Realistic LiDARs

    Authors: Benoit Guillard, Sai Vemprala, Jayesh K. Gupta, Ondrej Miksik, Vibhav Vineet, Pascal Fua, Ashish Kapoor

    Abstract: Simulating realistic sensors is a challenging part in data generation for autonomous systems, often involving carefully handcrafted sensor design, scene properties, and physics modeling. To alleviate this, we introduce a pipeline for data-driven simulation of a realistic LiDAR sensor. We propose a model that learns a mapping between RGB images and corresponding LiDAR features such as raydrop or pe… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: IROS2022 paper

  11. arXiv:2208.02918  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    LATTE: LAnguage Trajectory TransformEr

    Authors: Arthur Bucker, Luis Figueredo, Sami Haddadin, Ashish Kapoor, Shuang Ma, Sai Vemprala, Rogerio Bonatti

    Abstract: Natural language is one of the most intuitive ways to express human intent. However, translating instructions and commands towards robotic motion generation and deployment in the real world is far from being an easy task. The challenge of combining a robot's inherent low-level geometric and kinodynamic constraints with a human's high-level semantic instructions traditionally is solved using task-s… ▽ More

    Submitted 16 September, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

  12. arXiv:2204.08945  [pdf, other

    cs.CV cs.AI cs.LG

    Missingness Bias in Model Debugging

    Authors: Saachi Jain, Hadi Salman, Eric Wong, Pengchuan Zhang, Vibhav Vineet, Sai Vemprala, Aleksander Madry

    Abstract: Missingness, or the absence of features from an input, is a concept fundamental to many model debugging tools. However, in computer vision, pixels cannot simply be removed from an image. One thus tends to resort to heuristics such as blacking out pixels, which may in turn introduce bias into the debugging process. We study such biases and, in particular, show how transformer-based architectures ca… ▽ More

    Submitted 13 June, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Published at ICLR 2022

  13. arXiv:2203.15788  [pdf, other

    cs.RO

    COMPASS: Contrastive Multimodal Pretraining for Autonomous Systems

    Authors: Shuang Ma, Sai Vemprala, Wenshan Wang, Jayesh K. Gupta, Yale Song, Daniel McDuff, Ashish Kapoor

    Abstract: Learning representations that generalize across tasks and domains is challenging yet necessary for autonomous systems. Although task-driven approaches are appealing, designing models specific to each application can be difficult in the face of limited data, especially when dealing with highly variable multimodal input spaces arising from different tasks in different environments.We introduce the f… ▽ More

    Submitted 19 February, 2022; originally announced March 2022.

  14. arXiv:2106.13364  [pdf, other

    cs.AI cs.CV cs.LG

    CausalCity: Complex Simulations with Agency for Causal Discovery and Reasoning

    Authors: Daniel McDuff, Yale Song, Jiyoung Lee, Vibhav Vineet, Sai Vemprala, Nicholas Gyde, Hadi Salman, Shuang Ma, Kwanghoon Sohn, Ashish Kapoor

    Abstract: The ability to perform causal and counterfactual reasoning are central properties of human intelligence. Decision-making systems that can perform these types of reasoning have the potential to be more generalizable and interpretable. Simulations have helped advance the state-of-the-art in this domain, by providing the ability to systematically vary parameters (e.g., confounders) and generate examp… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  15. arXiv:2106.03805  [pdf, other

    cs.CV cs.LG stat.ML

    3DB: A Framework for Debugging Computer Vision Models

    Authors: Guillaume Leclerc, Hadi Salman, Andrew Ilyas, Sai Vemprala, Logan Engstrom, Vibhav Vineet, Kai Xiao, Pengchuan Zhang, Shibani Santurkar, Greg Yang, Ashish Kapoor, Aleksander Madry

    Abstract: We introduce 3DB: an extendable, unified framework for testing and debugging vision models using photorealistic simulation. We demonstrate, through a wide range of use cases, that 3DB allows users to discover vulnerabilities in computer vision systems and gain insights into how models make decisions. 3DB captures and generalizes many robustness analyses from prior work, and enables one to study th… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  16. arXiv:2103.00806  [pdf, other

    cs.CV cs.RO

    Representation Learning for Event-based Visuomotor Policies

    Authors: Sai Vemprala, Sami Mian, Ashish Kapoor

    Abstract: Event-based cameras are dynamic vision sensors that provide asynchronous measurements of changes in per-pixel brightness at a microsecond level. This makes them significantly faster than conventional frame-based cameras, and an appealing choice for high-speed navigation. While an interesting sensor modality, this asynchronously streamed event data poses a challenge for machine learning techniques… ▽ More

    Submitted 29 September, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

  17. arXiv:2012.12235  [pdf, other

    cs.CV cs.LG

    Unadversarial Examples: Designing Objects for Robust Vision

    Authors: Hadi Salman, Andrew Ilyas, Logan Engstrom, Sai Vemprala, Aleksander Madry, Ashish Kapoor

    Abstract: We study a class of realistic computer vision settings wherein one can influence the design of the objects being recognized. We develop a framework that leverages this capability to significantly improve vision models' performance and robustness. This framework exploits the sensitivity of modern machine learning algorithms to input perturbations in order to design "robust objects," i.e., objects t… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

  18. arXiv:2011.00095  [pdf, other

    cs.RO

    Adversarial Attacks on Optimization based Planners

    Authors: Sai Vemprala, Ashish Kapoor

    Abstract: Trajectory planning is a key piece in the algorithmic architecture of a robot. Trajectory planners typically use iterative optimization schemes for generating smooth trajectories that avoid collisions and are optimal for tracking given the robot's physical specifications. Starting from an initial estimate, the planners iteratively refine the solution so as to satisfy the desired constraints. In th… ▽ More

    Submitted 4 June, 2021; v1 submitted 30 October, 2020; originally announced November 2020.

    Comments: 7 pages. Presented at ICRA 2021

  19. arXiv:2003.05654  [pdf, other

    cs.RO cs.CV

    AirSim Drone Racing Lab

    Authors: Ratnesh Madaan, Nicholas Gyde, Sai Vemprala, Matthew Brown, Keiko Nagami, Tim Taubner, Eric Cristofalo, Davide Scaramuzza, Mac Schwager, Ashish Kapoor

    Abstract: Autonomous drone racing is a challenging research problem at the intersection of computer vision, planning, state estimation, and control. We introduce AirSim Drone Racing Lab, a simulation framework for enabling fast prototyping of algorithms for autonomy and enabling machine learning research in this domain, with the goal of reducing the time, money, and risks associated with field robotics. Our… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

    Comments: 14 pages, 6 figures

  20. arXiv:2001.08198  [pdf, other

    cs.RO

    Safety Considerations in Deep Control Policies with Safety Barrier Certificates Under Uncertainty

    Authors: Tom Hirshberg, Sai Vemprala, Ashish Kapoor

    Abstract: Recent advances in Deep Machine Learning have shown promise in solving complex perception and control loops via methods such as reinforcement and imitation learning. However, guaranteeing safety for such learned deep policies has been a challenge due to issues such as partial observability and difficulties in characterizing the behavior of the neural networks. While a lot of emphasis in safe learn… ▽ More

    Submitted 2 March, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

  21. arXiv:1905.02648  [pdf, other

    cs.RO cs.MA

    Collaborative Localization for Micro Aerial Vehicles

    Authors: Sai Vemprala, Srikanth Saripalli

    Abstract: In this paper, we present a framework for performing collaborative localization for groups of micro aerial vehicles (MAV) that use vision based sensing. The vehicles are each assumed to be equipped with a forward-facing monocular camera, and to be capable of communicating with each other. This collaborative localization approach is developed as a decentralized algorithm and built in a distributed… ▽ More

    Submitted 10 May, 2020; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: Supplementary video at https://www.youtube.com/watch?v=LvaTOWuTOPo

  22. arXiv:1808.00259  [pdf, other

    cs.RO

    Drone Detection Using Depth Maps

    Authors: Adrian Carrio, Sai Vemprala, Andres Ripoll, Srikanth Saripalli, Pascual Campoy

    Abstract: Obstacle avoidance is a key feature for safe Unmanned Aerial Vehicle (UAV) navigation. While solutions have been proposed for static obstacle avoidance, systems enabling avoidance of dynamic objects, such as drones, are hard to implement due to the detection range and field-of-view (FOV) requirements, as well as the constraints for integrating such systems on-board small UAVs. In this work, a data… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

    Comments: Accepted at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018), Madrid, Spain

  23. arXiv:1804.02510  [pdf, other

    cs.RO

    Monocular Vision based Collaborative Localization for Micro Aerial Vehicle Swarms

    Authors: Sai Vemprala, Srikanth Saripalli

    Abstract: In this paper, we present a vision based collaborative localization framework for groups of micro aerial vehicles (MAV). The vehicles are each assumed to be equipped with a forward-facing monocular camera, and to be capable of communicating with each other. This collaborative localization approach is built upon a distributed algorithm where individual and relative pose estimation techniques are co… ▽ More

    Submitted 7 April, 2018; originally announced April 2018.

    Comments: 9 pages, 12 figures, IEEE conference format