Zum Hauptinhalt springen

Showing 1–50 of 210 results for author: Kochenderfer, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.21126  [pdf, other

    cs.CV cs.RO

    Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving

    Authors: Bernard Lange, Masha Itkina, Jiachen Li, Mykel J. Kochenderfer

    Abstract: Environment prediction frameworks are critical for the safe navigation of autonomous vehicles (AVs) in dynamic settings. LiDAR-generated occupancy grid maps (L-OGMs) offer a robust bird's-eye view for the scene representation, enabling self-supervised joint scene predictions while exhibiting resilience to partial observability and perception detection failures. Prior approaches have focused on det… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  2. arXiv:2407.16223  [pdf, ps, other

    cs.RO cs.CV

    Probabilistic Parameter Estimators and Calibration Metrics for Pose Estimation from Image Features

    Authors: Romeo Valentin, Sydney M. Katz, Joonghyun Lee, Don Walker, Matthew Sorgenfrei, Mykel J. Kochenderfer

    Abstract: This paper addresses the challenge of probabilistic parameter estimation given measurement uncertainty in real-time. We provide a general formulation and apply this to pose estimation for an autonomous visual landing system. We present three probabilistic parameter estimators: a least-squares sampling approach, a linear approximation method, and a probabilistic programming estimator. To evaluate t… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted at DASC '24. 9 pages, 4 figures

  3. arXiv:2407.15839  [pdf, other

    cs.RO cs.AI

    Importance Sampling-Guided Meta-Training for Intelligent Agents in Highly Interactive Environments

    Authors: Mansur Arief, Mike Timmerman, Jiachen Li, David Isele, Mykel J Kochenderfer

    Abstract: Training intelligent agents to navigate highly interactive environments presents significant challenges. While guided meta reinforcement learning (RL) approach that first trains a guiding policy to train the ego agent has proven effective in improving generalizability across various levels of interaction, the state-of-the-art method tends to be overly sensitive to extreme cases, impairing the agen… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  4. arXiv:2407.14981  [pdf, other

    cs.CY

    Open Problems in Technical AI Governance

    Authors: Anka Reuel, Ben Bucknall, Stephen Casper, Tim Fist, Lisa Soder, Onni Aarne, Lewis Hammond, Lujain Ibrahim, Alan Chan, Peter Wills, Markus Anderljung, Ben Garfinkel, Lennart Heim, Andrew Trask, Gabriel Mukobi, Rylan Schaeffer, Mauricio Baker, Sara Hooker, Irene Solaiman, Alexandra Sasha Luccioni, Nitarshan Rajkumar, Nicolas Moës, Jeffrey Ladish, Neel Guha, Jessica Newman , et al. (6 additional authors not shown)

    Abstract: AI progress is creating a growing range of risks and opportunities, but it is often unclear how they should be navigated. In many cases, the barriers and uncertainties faced are at least partly technical. Technical AI governance, referring to technical analysis and tools for supporting the effective governance of AI, seeks to address such challenges. It can help to (a) identify areas where interve… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: Ben Bucknall and Anka Reuel contributed equally and share the first author position

  5. arXiv:2407.09447  [pdf, other

    cs.CL

    ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts

    Authors: Amelia F. Hardy, Houjun Liu, Bernard Lange, Mykel J. Kochenderfer

    Abstract: Typical schemes for automated red-teaming large language models (LLMs) focus on discovering prompts that trigger a frozen language model (the defender) to generate toxic text. This often results in the prompting model (the adversary) producing text that is unintelligible and unlikely to arise. Here, we propose a reinforcement learning formulation of the LLM red-teaming task which allows us to disc… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 9 pages, 2 tables, 2 figures

  6. arXiv:2406.17189  [pdf, ps, other

    eess.SY cs.MA

    Hierarchical Framework for Optimizing Wildfire Surveillance and Suppression using Human-Autonomous Teaming

    Authors: Mahdi Al-Husseini, Kyle Wray, Mykel Kochenderfer

    Abstract: The integration of manned and unmanned aircraft can help improve wildfire response. Wildfire containment failures occur when resources available to first responders, who execute the initial stages of wildfire management referred to as the initial attack, are ineffective or insufficient. Initial attack surveillance and suppression models have linked action spaces and objectives, making their optimi… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  7. arXiv:2406.14761  [pdf, other

    cs.RO cs.AI eess.SY

    Diffusion-Based Failure Sampling for Cyber-Physical Systems

    Authors: Harrison Delecki, Marc R. Schlichting, Mansur Arief, Anthony Corso, Marcell Vazquez-Chanlatte, Mykel J. Kochenderfer

    Abstract: Validating safety-critical autonomous systems in high-dimensional domains such as robotics presents a significant challenge. Existing black-box approaches based on Markov chain Monte Carlo may require an enormous number of samples, while methods based on importance sampling often rely on simple parametric families that may struggle to represent the distribution over failures. We propose to sample… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review at RA-L

  8. arXiv:2405.19570  [pdf, other

    cs.MA cs.RO

    Distributed Online Planning for Min-Max Problems in Networked Markov Games

    Authors: Alexandros E. Tzikas, Jinkyoo Park, Mykel J. Kochenderfer, Ross E. Allen

    Abstract: Min-max problems are important in multi-agent sequential decision-making because they improve the performance of the worst-performing agent in the network. However, solving the multi-agent min-max problem is challenging. We propose a modular, distributed, online planning-based algorithm that is able to approximate the solution of the min-max objective in networked Markov games, assuming that the a… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted to appear in the IEEE Robotics and Automation Letters

  9. arXiv:2405.09822  [pdf, other

    cs.RO

    SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection Tasks

    Authors: Muhammad Fadhil Ginting, Sung-Kyun Kim, David D. Fan, Matteo Palieri, Mykel J. Kochenderfer, Ali-akbar Agha-Mohammadi

    Abstract: This paper addresses the problem of object-goal navigation in autonomous inspections in real-world environments. Object-goal navigation is crucial to enable effective inspections in various settings, often requiring the robot to identify the target object within a large search space. Current object inspection methods fall short of human efficiency because they typically cannot bootstrap prior and… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  10. arXiv:2405.00644  [pdf, other

    cs.AI

    ConstrainedZero: Chance-Constrained POMDP Planning using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints

    Authors: Robert J. Moss, Arec Jamgochian, Johannes Fischer, Anthony Corso, Mykel J. Kochenderfer

    Abstract: To plan safely in uncertain environments, agents must balance utility with safety constraints. Safe planning problems can be modeled as a chance-constrained partially observable Markov decision process (CC-POMDP) and solutions often use expensive rollouts or heuristics to estimate the optimal value and action-selection policy. This work introduces the ConstrainedZero policy iteration algorithm tha… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: In Proceedings of the 2024 International Joint Conference on Artificial Intelligence (IJCAI)

  11. arXiv:2404.18374  [pdf, other

    cs.RO

    Trajectory Optimization for Adaptive Informative Path Planning with Multimodal Sensing

    Authors: Joshua Ott, Edward Balaban, Mykel Kochenderfer

    Abstract: We consider the problem of an autonomous agent equipped with multiple sensors, each with different sensing precision and energy costs. The agent's goal is to explore the environment and gather information subject to its resource constraints in unknown, partially observable environments. The challenge lies in reasoning about the effects of sensing and movement while respecting the agent's resource… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: IEEE International Conference on Control, Decision and Information Technologies

  12. arXiv:2404.06940  [pdf, other

    cs.RO

    Learning-based Methods for Adaptive Informative Path Planning

    Authors: Marija Popovic, Joshua Ott, Julius Rückin, Mykel J. Kochenderfer

    Abstract: Adaptive informative path planning (AIPP) is important to many robotics applications, enabling mobile robots to efficiently collect useful data about initially unknown environments. In addition, learning-based methods are increasingly used in robotics to enhance adaptability, versatility, and robustness across diverse and complex tasks. Our survey explores research on applying robotic learning to… ▽ More

    Submitted 23 July, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: 24 pages, 3 figures

  13. arXiv:2403.17358  [pdf, other

    cs.AI

    Addressing Myopic Constrained POMDP Planning with Recursive Dual Ascent

    Authors: Paula Stocco, Suhas Chundi, Arec Jamgochian, Mykel J. Kochenderfer

    Abstract: Lagrangian-guided Monte Carlo tree search with global dual ascent has been applied to solve large constrained partially observable Markov decision processes (CPOMDPs) online. In this work, we demonstrate that these global dual parameters can lead to myopic action selection during exploration, ultimately leading to suboptimal decision making. To address this, we introduce history-dependent dual var… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted to the 2024 International Conference on Automated Planning and Scheduling (ICAPS)

  14. arXiv:2402.09388  [pdf, other

    cs.AI

    Entropy-regularized Point-based Value Iteration

    Authors: Harrison Delecki, Marcell Vazquez-Chanlatte, Esen Yel, Kyle Wray, Tomer Arnon, Stefan Witwicki, Mykel J. Kochenderfer

    Abstract: Model-based planners for partially observable problems must accommodate both model uncertainty during planning and goal uncertainty during objective inference. However, model-based planners may be brittle under these types of uncertainty because they rely on an exact model and tend to commit to a single optimal behavior. Inspired by results in the model-free setting, we propose an entropy-regulari… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  15. arXiv:2402.08841  [pdf, other

    cs.RO

    Approximate Sequential Optimization for Informative Path Planning

    Authors: Joshua Ott, Mykel J. Kochenderfer, Stephen Boyd

    Abstract: We consider the problem of finding an informative path through a graph, given initial and terminal nodes and a given maximum path length. We assume that a linear noise corrupted measurement is taken at each node of an underlying unknown vector that we wish to estimate. The informativeness is measured by the reduction in uncertainty in our estimate, evaluated using several metrics. We present a con… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  16. arXiv:2401.17191  [pdf, other

    cs.RO

    Semantic Belief Behavior Graph: Enabling Autonomous Robot Inspection in Unknown Environments

    Authors: Muhammad Fadhil Ginting, David D. Fan, Sung-Kyun Kim, Mykel J. Kochenderfer, Ali-akbar Agha-mohammadi

    Abstract: This paper addresses the problem of autonomous robotic inspection in complex and unknown environments. This capability is crucial for efficient and precise inspections in various real-world scenarios, even when faced with perceptual uncertainty and lack of prior knowledge of the environment. Existing methods for real-world autonomous inspections typically rely on predefined targets and waypoints a… ▽ More

    Submitted 9 July, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  17. arXiv:2401.15838  [pdf, other

    stat.ML cs.LG cs.MA math.OC stat.CO

    Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

    Authors: Alexandros E. Tzikas, Licio Romao, Mert Pilanci, Alessandro Abate, Mykel J. Kochenderfer

    Abstract: Many machine learning applications require operating on a spatially distributed dataset. Despite technological advances, privacy considerations and communication constraints may prevent gathering the entire dataset in a central unit. In this paper, we propose a distributed sampling scheme based on the alternating direction method of multipliers, which is commonly used in the optimization literatur… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  18. arXiv:2401.12275  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.MA

    Multi-Agent Dynamic Relational Reasoning for Social Robot Navigation

    Authors: Jiachen Li, Chuanbo Hua, Hengbo Ma, Jinkyoo Park, Victoria Dax, Mykel J. Kochenderfer

    Abstract: Social robot navigation can be helpful in various contexts of daily life but requires safe human-robot interactions and efficient trajectory planning. While modeling pairwise relations has been widely studied in multi-agent interacting systems, the ability to capture larger-scale group-wise activities is limited. In this paper, we propose a systematic relational reasoning approach with explicit in… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 19 pages, 8 figures, 6 tables

  19. arXiv:2401.10949  [pdf, ps, other

    cs.MA cs.LG eess.SY

    The Synergy Between Optimal Transport Theory and Multi-Agent Reinforcement Learning

    Authors: Ali Baheri, Mykel J. Kochenderfer

    Abstract: This paper explores the integration of optimal transport (OT) theory with multi-agent reinforcement learning (MARL). This integration uses OT to handle distributions and transportation problems to enhance the efficiency, coordination, and adaptability of MARL. There are five key areas where OT can impact MARL: (1) policy alignment, where OT's Wasserstein metric is used to align divergent agent str… ▽ More

    Submitted 24 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  20. arXiv:2401.05610  [pdf, other

    cs.LG cs.AI

    Graph Q-Learning for Combinatorial Optimization

    Authors: Victoria M. Dax, Jiachen Li, Kevin Leahy, Mykel J. Kochenderfer

    Abstract: Graph-structured data is ubiquitous throughout natural and social sciences, and Graph Neural Networks (GNNs) have recently been shown to be effective at solving prediction and inference problems on graph data. In this paper, we propose and demonstrate that GNNs can be applied to solve Combinatorial Optimization (CO) problems. CO concerns optimizing a function over a discrete solution space that is… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Journal ref: GLIndA Workshop NeurIPS 2022

  21. Disentangled Neural Relational Inference for Interpretable Motion Prediction

    Authors: Victoria M. Dax, Jiachen Li, Enna Sachdeva, Nakul Agarwal, Mykel J. Kochenderfer

    Abstract: Effective interaction modeling and behavior prediction of dynamic agents play a significant role in interactive motion planning for autonomous robots. Although existing methods have improved prediction accuracy, few research efforts have been devoted to enhancing prediction model interpretability and out-of-distribution (OOD) generalizability. This work addresses these two challenging aspects by d… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Journal ref: IEEE Robotics and Automation Letters, Date: FEBRUARY 2024 , Volume: 9, Issue: 2, ISSN: 2377-3766, pp1452-1459

  22. arXiv:2311.16091  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.MA

    Interactive Autonomous Navigation with Internal State Inference and Interactivity Estimation

    Authors: Jiachen Li, David Isele, Kanghoon Lee, Jinkyoo Park, Kikuo Fujimura, Mykel J. Kochenderfer

    Abstract: Deep reinforcement learning (DRL) provides a promising way for intelligent agents (e.g., autonomous vehicles) to learn to navigate complex scenarios. However, DRL with neural networks as function approximators is typically considered a black box with little explainability and often suffers from suboptimal performance, especially for autonomous navigation in highly interactive multi-agent environme… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 18 pages, 14 figures

  23. arXiv:2311.00192  [pdf, other

    cs.RO cs.AI

    Large-Scale Multi-Robot Assembly Planning for Autonomous Manufacturing

    Authors: Kyle Brown, Dylan M. Asmar, Mac Schwager, Mykel J. Kochenderfer

    Abstract: Mobile autonomous robots have the potential to revolutionize manufacturing processes. However, employing large robot fleets in manufacturing requires addressing challenges including collision-free movement in a shared workspace, effective multi-robot collaboration to manipulate and transport large payloads, complex task allocation due to coupled manufacturing processes, and spatial planning for pa… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: Repository: https://github.com/sisl/ConstructionBots.jl. Under review

  24. arXiv:2310.20054  [pdf, other

    cs.AI cs.RO

    Constrained Hierarchical Monte Carlo Belief-State Planning

    Authors: Arec Jamgochian, Hugo Buurmeijer, Kyle H. Wray, Anthony Corso, Mykel J. Kochenderfer

    Abstract: Optimal plans in Constrained Partially Observable Markov Decision Processes (CPOMDPs) maximize reward objectives while satisfying hard cost constraints, generalizing safe planning under state and transition uncertainty. Unfortunately, online CPOMDP planning is extremely difficult in large or continuous problem domains. In many large robotic domains, hierarchical decomposition can simplify planning… ▽ More

    Submitted 26 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to the 2024 IEEE International Conference on Robotics and Automation (ICRA)

  25. arXiv:2310.01723  [pdf, other

    cs.RO

    Predicting Future Spatiotemporal Occupancy Grids with Semantics for Autonomous Driving

    Authors: Maneekwan Toyungyernsub, Esen Yel, Jiachen Li, Mykel J. Kochenderfer

    Abstract: For autonomous vehicles to proactively plan safe trajectories and make informed decisions, they must be able to predict the future occupancy states of the local environment. However, common issues with occupancy prediction include predictions where moving objects vanish or become blurred, particularly at longer time horizons. We propose an environment prediction framework that incorporates environ… ▽ More

    Submitted 11 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted at the 35th IEEE Intelligent Vehicles Symposium 2024

  26. arXiv:2309.13893  [pdf, other

    cs.RO cs.AI cs.CV

    Scene Informer: Anchor-based Occlusion Inference and Trajectory Prediction in Partially Observable Environments

    Authors: Bernard Lange, Jiachen Li, Mykel J. Kochenderfer

    Abstract: Navigating complex and dynamic environments requires autonomous vehicles (AVs) to reason about both visible and occluded regions. This involves predicting the future motion of observed agents, inferring occluded ones, and modeling their interactions based on vectorized scene representations of the partially observable environment. However, prior work on occlusion inference and trajectory predictio… ▽ More

    Submitted 8 March, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted to 2024 IEEE International Conference on Robotics and Automation (ICRA)

  27. arXiv:2309.12474  [pdf, other

    cs.RO cs.AI cs.CY cs.ET eess.SY

    SAVME: Efficient Safety Validation for Autonomous Systems Using Meta-Learning

    Authors: Marc R. Schlichting, Nina V. Boord, Anthony L. Corso, Mykel J. Kochenderfer

    Abstract: Discovering potential failures of an autonomous system is important prior to deployment. Falsification-based methods are often used to assess the safety of such systems, but the cost of running many accurate simulation can be high. The validation can be accelerated by identifying critical failure scenarios for the system under test and by reducing the simulation runtime. We propose a Bayesian appr… ▽ More

    Submitted 30 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted for ITSC 2023

  28. arXiv:2309.08778  [pdf, other

    cs.LO cs.MS

    Satisfiability.jl: Satisfiability Modulo Theories in Julia

    Authors: Emiko Soroka, Mykel J. Kochenderfer, Sanjay Lall

    Abstract: Satisfiability modulo theories (SMT) is a core tool in formal verification. While the SMT-LIB specification language can be used to interact with theorem proving software, a high-level interface allows for faster and easier specifications of complex SMT formulae. In this paper we present a novel open-source package for interacting with SMT-LIB compliant solvers in the Julia programming language.

    Submitted 15 December, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 7 pages, revised from a previous longer version to comply with a conference length requirement. Submitted to NASA Formal Methods 2024

    ACM Class: D.2.4

  29. arXiv:2309.06597  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning

    Authors: Enna Sachdeva, Nakul Agarwal, Suhas Chundi, Sean Roelofs, Jiachen Li, Mykel Kochenderfer, Chiho Choi, Behzad Dariush

    Abstract: The widespread adoption of commercial autonomous vehicles (AVs) and advanced driver assistance systems (ADAS) may largely depend on their acceptance by society, for which their perceived trustworthiness and interpretability to riders are crucial. In general, this task is challenging because modern autonomous systems software relies heavily on black-box artificial intelligence models. Towards this… ▽ More

    Submitted 8 November, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

  30. arXiv:2307.10586  [pdf, other

    cs.LG

    A Holistic Assessment of the Reliability of Machine Learning Systems

    Authors: Anthony Corso, David Karamadian, Romeo Valentin, Mary Cooper, Mykel J. Kochenderfer

    Abstract: As machine learning (ML) systems increasingly permeate high-stakes settings such as healthcare, transportation, military, and national security, concerns regarding their reliability have emerged. Despite notable progress, the performance of these systems can significantly diminish due to adversarial attacks or environmental changes, leading to overconfident predictions, failures to detect input fa… ▽ More

    Submitted 29 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

  31. arXiv:2307.10160  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.MA

    Robust Driving Policy Learning with Guided Meta Reinforcement Learning

    Authors: Kanghoon Lee, Jiachen Li, David Isele, Jinkyoo Park, Kikuo Fujimura, Mykel J. Kochenderfer

    Abstract: Although deep reinforcement learning (DRL) has shown promising results for autonomous navigation in interactive traffic scenarios, existing work typically adopts a fixed behavior policy to control social vehicles in the training environment. This may cause the learned driving policy to overfit the environment, making it difficult to interact well with vehicles with different, unseen behaviors. In… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: ITSC 2023

  32. arXiv:2307.01371  [pdf, other

    cs.RO cs.AI

    Efficient Determination of Safety Requirements for Perception Systems

    Authors: Sydney M. Katz, Anthony L. Corso, Esen Yel, Mykel J. Kochenderfer

    Abstract: Perception systems operate as a subcomponent of the general autonomy stack, and perception system designers often need to optimize performance characteristics while maintaining safety with respect to the overall closed-loop system. For this reason, it is useful to distill high-level safety requirements into component-level requirements on the perception system. In this work, we focus on efficientl… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 10 pages, 14 figures, submitted to the 2023 Digital Avionics Systems Conference

  33. arXiv:2306.15605  [pdf, other

    cs.RO

    Deep Normalizing Flows for State Estimation

    Authors: Harrison Delecki, Liam A. Kruse, Marc R. Schlichting, Mykel J. Kochenderfer

    Abstract: Safe and reliable state estimation techniques are a critical component of next-generation robotic systems. Agents in such systems must be able to reason about the intentions and trajectories of other agents for safe and efficient motion planning. However, classical state estimation techniques such as Gaussian filters often lack the expressive power to represent complex underlying distributions, es… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted to FUSION 2023

  34. arXiv:2306.11203  [pdf, other

    cs.CV cs.LG

    AVOIDDS: Aircraft Vision-based Intruder Detection Dataset and Simulator

    Authors: Elysia Q. Smyers, Sydney M. Katz, Anthony L. Corso, Mykel J. Kochenderfer

    Abstract: Designing robust machine learning systems remains an open problem, and there is a need for benchmark problems that cover both environmental changes and evaluation on a downstream task. In this work, we introduce AVOIDDS, a realistic object detection benchmark for the vision-based aircraft detect-and-avoid problem. We provide a labeled dataset consisting of 72,000 photorealistic images of intruder… ▽ More

    Submitted 26 December, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted to and presented at NeurIPS 2023, Datasets and Benchmarks Track; fixed link formatting in the abstract

  35. arXiv:2306.00249  [pdf, other

    cs.AI

    BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned Approximations

    Authors: Robert J. Moss, Anthony Corso, Jef Caers, Mykel J. Kochenderfer

    Abstract: Real-world planning problems, including autonomous driving and sustainable energy applications like carbon storage and resource exploration, have recently been modeled as partially observable Markov decision processes (POMDPs) and solved using approximate methods. To solve high-dimensional POMDPs in practice, state-of-the-art methods use online planning with problem-specific heuristics to reduce p… ▽ More

    Submitted 31 July, 2024; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: Presented at the Reinforcement Learning Conference (RLC) 2024

    Journal ref: RLJ, Volume 1, Issue (Number) 1, 2024

  36. arXiv:2305.18633  [pdf, other

    cs.RO

    Experience Filter: Using Past Experiences on Unseen Tasks or Environments

    Authors: Anil Yildiz, Esen Yel, Anthony L. Corso, Kyle H. Wray, Stefan J. Witwicki, Mykel J. Kochenderfer

    Abstract: One of the bottlenecks of training autonomous vehicle (AV) agents is the variability of training environments. Since learning optimal policies for unseen environments is often very costly and requires substantial data collection, it becomes computationally intractable to train the agent on every possible environment or task the AV may encounter. This paper introduces a zero-shot filtering approach… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted at IEEE Intelligent Vehicles Symposium (IV) 2023

  37. arXiv:2305.09930  [pdf, other

    cs.RO cs.LG stat.ML

    Model-based Validation as Probabilistic Inference

    Authors: Harrison Delecki, Anthony Corso, Mykel J. Kochenderfer

    Abstract: Estimating the distribution over failures is a key step in validating autonomous systems. Existing approaches focus on finding failures for a small range of initial conditions or make restrictive assumptions about the properties of the system under test. We frame estimating the distribution over failure trajectories for sequential systems as Bayesian inference. Our model-based approach represents… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted to 5th Annual Learning for Dynamics & Control Conference (L4DC) 2023

  38. arXiv:2305.02449  [pdf, other

    cs.LG stat.AP

    Bayesian Safety Validation for Failure Probability Estimation of Black-Box Systems

    Authors: Robert J. Moss, Mykel J. Kochenderfer, Maxime Gariel, Arthur Dubois

    Abstract: Estimating the probability of failure is an important step in the certification of safety-critical systems. Efficient estimation methods are often needed due to the challenges posed by high-dimensional input spaces, risky test scenarios, and computationally expensive simulators. This work frames the problem of black-box safety validation as a Bayesian optimization problem and introduces a method t… ▽ More

    Submitted 29 June, 2024; v1 submitted 3 May, 2023; originally announced May 2023.

    Journal ref: AIAA Journal of Aerospace Information Systems (JAIS) 21.7 (2024): 533-546

  39. arXiv:2304.09352  [pdf, other

    cs.AI eess.SY physics.flu-dyn

    Optimizing Carbon Storage Operations for Long-Term Safety

    Authors: Yizheng Wang, Markus Zechner, Gege Wen, Anthony Louis Corso, John Michael Mern, Mykel J. Kochenderfer, Jef Karel Caers

    Abstract: To combat global warming and mitigate the risks associated with climate change, carbon capture and storage (CCS) has emerged as a crucial technology. However, safely sequestering CO2 in geological formations for long-term storage presents several challenges. In this study, we address these issues by modeling the decision-making process for carbon storage operations as a partially observable Markov… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  40. arXiv:2304.00645  [pdf, other

    cs.RO

    Safe and Efficient Navigation in Extreme Environments using Semantic Belief Graphs

    Authors: Muhammad Fadhil Ginting, Sung-Kyun Kim, Oriana Peltzer, Joshua Ott, Sunggoo Jung, Mykel J. Kochenderfer, Ali-akbar Agha-mohammadi

    Abstract: To achieve autonomy in unknown and unstructured environments, we propose a method for semantic-based planning under perceptual uncertainty. This capability is crucial for safe and efficient robot navigation in environment with mobility-stressing elements that require terrain-specific locomotion policies. We propose the Semantic Belief Graph (SBG), a geometric- and semantic-based representation of… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

  41. arXiv:2303.09981  [pdf, other

    cs.LG

    Inferring Traffic Models in Terminal Airspace from Flight Tracks and Procedures

    Authors: Soyeon Jung, Mykel J. Kochenderfer

    Abstract: Realistic aircraft trajectory models are useful in the design and validation of air traffic management (ATM) systems. Models of aircraft operated under instrument flight rules (IFR) require capturing the variability inherent in how aircraft follow standard flight procedures. The variability in aircraft behavior varies among flight stages. In this paper, we propose a probabilistic model that can le… ▽ More

    Submitted 30 August, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

  42. arXiv:2303.03530  [pdf, other

    cs.RO

    Incorporating Human Path Preferences in Robot Navigation with Minimal Interventions

    Authors: Oriana Peltzer, Dylan M. Asmar, Mac Schwager, Mykel J. Kochenderfer

    Abstract: Robots that can effectively understand human intentions from actions are crucial for successful human-robot collaboration. In this work, we address the challenge of a robot navigating towards an unknown goal while also accounting for a human's preference for a particular path in the presence of obstacles. This problem is particularly challenging when both the goal and path preference are unknown a… ▽ More

    Submitted 15 March, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  43. arXiv:2302.03164  [pdf, other

    cs.RO

    Adaptive Coverage Path Planning for Efficient Exploration of Unknown Environments

    Authors: Amanda Bouman, Joshua Ott, Sung-Kyun Kim, Kenny Chen, Mykel J. Kochenderfer, Brett Lopez, Ali-akbar Agha-mohammadi, Joel Burdick

    Abstract: We present a method for solving the coverage problem with the objective of autonomously exploring an unknown environment under mission time constraints. Here, the robot is tasked with planning a path over a horizon such that the accumulated area swept out by its sensor footprint is maximized. Because this problem exhibits a diminishing returns property known as submodularity, we choose to formulat… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  44. arXiv:2301.02362  [pdf, other

    cs.RO

    Fast and Scalable Signal Inference for Active Robotic Source Seeking

    Authors: Christopher E. Denniston, Oriana Peltzer, Joshua Ott, Sangwoo Moon, Sung-Kyun Kim, Gaurav S. Sukhatme, Mykel J. Kochenderfer, Mac Schwager, Ali-akbar Agha-mohammadi

    Abstract: In active source seeking, a robot takes repeated measurements in order to locate a signal source in a cluttered and unknown environment. A key component of an active source seeking robot planner is a model that can produce estimates of the signal at unknown locations with uncertainty quantification. This model allows the robot to plan for future measurements in the environment. Traditionally, this… ▽ More

    Submitted 17 April, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: 6 pages, Submitted to ICRA 2023 - Contains Appendix

  45. arXiv:2212.14118  [pdf, other

    eess.SY cs.LG

    Falsification of Learning-Based Controllers through Multi-Fidelity Bayesian Optimization

    Authors: Zahra Shahrooei, Mykel J. Kochenderfer, Ali Baheri

    Abstract: Simulation-based falsification is a practical testing method to increase confidence that the system will meet safety requirements. Because full-fidelity simulations can be computationally demanding, we investigate the use of simulators with different levels of fidelity. As a first step, we express the overall safety specification in terms of environmental parameters and structure this safety speci… ▽ More

    Submitted 28 April, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: 7 pages, 8 figures, Accepted for the 2023 European Control Conference (ECC)

  46. arXiv:2212.12154  [pdf, other

    cs.AI

    Online Planning for Constrained POMDPs with Continuous Spaces through Dual Ascent

    Authors: Arec Jamgochian, Anthony Corso, Mykel J. Kochenderfer

    Abstract: Rather than augmenting rewards with penalties for undesired behavior, Constrained Partially Observable Markov Decision Processes (CPOMDPs) plan safely by imposing inviolable hard constraint value budgets. Previous work performing online planning for CPOMDPs has only been applied to discrete action and observation spaces. In this work, we propose algorithms for online CPOMDP planning for continuous… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: Submitted to ICAPS-23

  47. arXiv:2212.00669  [pdf, other

    physics.geo-ph cs.AI

    A POMDP Model for Safe Geological Carbon Sequestration

    Authors: Anthony Corso, Yizheng Wang, Markus Zechner, Jef Caers, Mykel J. Kochenderfer

    Abstract: Geological carbon capture and sequestration (CCS), where CO$_2$ is stored in subsurface formations, is a promising and scalable approach for reducing global emissions. However, if done incorrectly, it may lead to earthquakes and leakage of CO$_2$ back to the surface, harming both humans and the environment. These risks are exacerbated by the large amount of uncertainty in the structure of the stor… ▽ More

    Submitted 24 October, 2022; originally announced December 2022.

    Comments: Accepted at NeurIPS 2022 Workshop on Tackling Climate Change with Machine Learning

  48. arXiv:2211.12470  [pdf, ps, other

    cs.LG cs.AI

    A Deep Reinforcement Learning Approach to Rare Event Estimation

    Authors: Anthony Corso, Kyu-Young Kim, Shubh Gupta, Grace Gao, Mykel J. Kochenderfer

    Abstract: An important step in the design of autonomous systems is to evaluate the probability that a failure will occur. In safety-critical domains, the failure probability is extremely small so that the evaluation of a policy through Monte Carlo sampling is inefficient. Adaptive importance sampling approaches have been developed for rare event estimation but do not scale well to sequential systems with lo… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  49. arXiv:2211.08701  [pdf, other

    cs.RO cs.CV cs.LG

    Interpretable Self-Aware Neural Networks for Robust Trajectory Prediction

    Authors: Masha Itkina, Mykel J. Kochenderfer

    Abstract: Although neural networks have seen tremendous success as predictive models in a variety of domains, they can be overly confident in their predictions on out-of-distribution (OOD) data. To be viable for safety-critical applications, like autonomous vehicles, neural networks must accurately estimate their epistemic or model uncertainty, achieving a level of system self-awareness. Techniques for epis… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Conference on Robot Learning (CoRL) 2022, 15 pages, 4 figures

    ACM Class: I.2.9; I.2.6; I.2.10

  50. arXiv:2210.17540  [pdf, other

    cs.LG cs.MA

    Agent-Time Attention for Sparse Rewards Multi-Agent Reinforcement Learning

    Authors: Jennifer She, Jayesh K. Gupta, Mykel J. Kochenderfer

    Abstract: Sparse and delayed rewards pose a challenge to single agent reinforcement learning. This challenge is amplified in multi-agent reinforcement learning (MARL) where credit assignment of these rewards needs to happen not only across time, but also across agents. We propose Agent-Time Attention (ATA), a neural network model with auxiliary losses for redistributing sparse and delayed rewards in collabo… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Full version of the Extended Abstract accepted at the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2022