Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Agia, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14769  [pdf, other

    cs.RO

    Points2Plans: From Point Clouds to Long-Horizon Plans with Composable Relational Dynamics

    Authors: Yixuan Huang, Christopher Agia, Jimmy Wu, Tucker Hermans, Jeannette Bohg

    Abstract: We present Points2Plans, a framework for composable planning with a relational dynamics model that enables robots to solve long-horizon manipulation tasks from partial-view point clouds. Given a language instruction and a point cloud of the scene, our framework initiates a hierarchical planning procedure, whereby a language model generates a high-level plan and a sampling-based planner produces co… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: Under review

  2. arXiv:2408.06105  [pdf, other

    cs.RO

    Text2Interaction: Establishing Safe and Preferable Human-Robot Interaction

    Authors: Jakob Thumm, Christopher Agia, Marco Pavone, Matthias Althoff

    Abstract: Adjusting robot behavior to human preferences can require intensive human feedback, preventing quick adaptation to new users and changing circumstances. Moreover, current approaches typically treat user preferences as a reward, which requires a manual balance between task success and user satisfaction. To integrate new user preferences in a zero-shot manner, our proposed Text2Interaction framework… ▽ More

    Submitted 21 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

  3. arXiv:2408.05924  [pdf, other

    cs.RO cs.AI

    Adapting a Foundation Model for Space-based Tasks

    Authors: Matthew Foutter, Praneet Bhoj, Rohan Sinha, Amine Elhafsi, Somrita Banerjee, Christopher Agia, Justin Kruger, Tommaso Guffanti, Daniele Gammelli, Simone D'Amico, Marco Pavone

    Abstract: Foundation models, e.g., large language models, possess attributes of intelligence which offer promise to endow a robot with the contextual understanding necessary to navigate complex, unstructured tasks in the wild. In the future of space robotics, we see three core challenges which motivate the use of a foundation model adapted to space-based applications: 1) Scalability of ground-in-the-loop op… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  4. arXiv:2407.08735  [pdf, other

    cs.RO cs.AI eess.SY

    Real-Time Anomaly Detection and Reactive Planning with Large Language Models

    Authors: Rohan Sinha, Amine Elhafsi, Christopher Agia, Matthew Foutter, Edward Schmerling, Marco Pavone

    Abstract: Foundation models, e.g., large language models (LLMs), trained on internet-scale data possess zero-shot generalization capabilities that make them a promising technology towards detecting and mitigating out-of-distribution failure modes of robotic systems. Fully realizing this promise, however, poses two challenges: (i) mitigating the considerable computational expense of these models such that th… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted to Robotics: Science and Systems (RSS) 2024

  5. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  6. arXiv:2401.11371  [pdf, other

    cs.RO eess.SY

    Modeling Considerations for Developing Deep Space Autonomous Spacecraft and Simulators

    Authors: Christopher Agia, Guillem Casadesus Vila, Saptarshi Bandyopadhyay, David S. Bayard, Kar-Ming Cheung, Charles H. Lee, Eric Wood, Ian Aenishanslin, Steven Ardito, Lorraine Fesq, Marco Pavone, Issa A. D. Nesnas

    Abstract: To extend the limited scope of autonomy used in prior missions for operation in distant and complex environments, there is a need to further develop and mature autonomy that jointly reasons over multiple subsystems, which we term system-level autonomy. System-level autonomy establishes situational awareness that resolves conflicting information across subsystems, which may necessitate the refineme… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: Project page: https://sites.google.com/stanford.edu/spacecraft-models. 20 pages, 8 figures. Accepted to the IEEE Conference on Aerospace (AeroConf) 2024

    ACM Class: I.2.8; I.2.9; I.6.1; I.6.3; I.6.4; I.6.6; J.2

  7. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  8. arXiv:2305.11307  [pdf, other

    cs.RO

    Semantic Anomaly Detection with Large Language Models

    Authors: Amine Elhafsi, Rohan Sinha, Christopher Agia, Edward Schmerling, Issa Nesnas, Marco Pavone

    Abstract: As robots acquire increasingly sophisticated skills and see increasingly complex and varied environments, the threat of an edge case or anomalous failure is ever present. For example, Tesla cars have seen interesting failure modes ranging from autopilot disengagements due to inactive traffic lights carried by trucks to phantom braking caused by images of stop signs on roadside billboards. These sy… ▽ More

    Submitted 11 September, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  9. Text2Motion: From Natural Language Instructions to Feasible Plans

    Authors: Kevin Lin, Christopher Agia, Toki Migimatsu, Marco Pavone, Jeannette Bohg

    Abstract: We propose Text2Motion, a language-based planning framework enabling robots to solve sequential manipulation tasks that require long-horizon reasoning. Given a natural language instruction, our framework constructs both a task- and motion-level plan that is verified to reach inferred symbolic goals. Text2Motion uses feasibility heuristics encoded in Q-functions of a library of skills to guide task… ▽ More

    Submitted 26 November, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Published in Autonomous Robots, Special Issue: Large Language Models in Robotics 2023. Project page: https://sites.google.com/stanford.edu/text2motion. First two authors contributed equally

  10. STAP: Sequencing Task-Agnostic Policies

    Authors: Christopher Agia, Toki Migimatsu, Jiajun Wu, Jeannette Bohg

    Abstract: Advances in robotic skill acquisition have made it possible to build general-purpose libraries of learned skills for downstream manipulation tasks. However, naively executing these skills one after the other is unlikely to succeed without accounting for dependencies between actions prevalent in long-horizon plans. We present Sequencing Task-Agnostic Policies (STAP), a scalable framework for traini… ▽ More

    Submitted 31 May, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: Video: https://drive.google.com/file/d/1zp3qFeZLACNPsGLLP7p6q9X1tuA_PGEo/view. Project page: https://sites.google.com/stanford.edu/stap. 12 pages, 7 figures. In proceedings of the IEEE International Conference on Robotics and Automation (ICRA) 2023. The first two authors contributed equally

  11. arXiv:2207.05006  [pdf, other

    cs.RO cs.AI cs.LG

    TASKOGRAPHY: Evaluating robot task planning over large 3D scene graphs

    Authors: Christopher Agia, Krishna Murthy Jatavallabhula, Mohamed Khodeir, Ondrej Miksik, Vibhav Vineet, Mustafa Mukadam, Liam Paull, Florian Shkurti

    Abstract: 3D scene graphs (3DSGs) are an emerging description; unifying symbolic, topological, and metric scene representations. However, typical 3DSGs contain hundreds of objects and symbols even for small environments; rendering task planning on the full graph impractical. We construct TASKOGRAPHY, the first large-scale robotic task planning benchmark over 3DSGs. While most benchmarking efforts in this ar… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Video: https://www.youtube.com/watch?v=mM4v5hP4LdA&ab_channel=KrishnaMurthy . Project page: https://taskography.github.io/ . 18 pages, 7 figures. In proceedings of Conference on Robot Learning (CoRL) 2021. The first two authors contributed equally

    ACM Class: I.2.8; I.2.9; I.2.10; I.2.6

    Journal ref: PMLR 164 (2022) 46-58

  12. arXiv:2012.09242  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    S3CNet: A Sparse Semantic Scene Completion Network for LiDAR Point Clouds

    Authors: Ran Cheng, Christopher Agia, Yuan Ren, Xinhai Li, Liu Bingbing

    Abstract: With the increasing reliance of self-driving and similar robotic systems on robust 3D vision, the processing of LiDAR scans with deep convolutional neural networks has become a trend in academia and industry alike. Prior attempts on the challenging Semantic Scene Completion task - which entails the inference of dense 3D structure and associated semantic labels from "sparse" representations - have… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: 14 pages

    Journal ref: PMLR 155 (2021) 2148-2161