Skip to main content

Showing 1–22 of 22 results for author: Janson, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17748  [pdf, other

    cs.LG math.OC stat.ML

    A New Perspective on Shampoo's Preconditioner

    Authors: Depen Morwani, Itai Shapira, Nikhil Vyas, Eran Malach, Sham Kakade, Lucas Janson

    Abstract: Shampoo, a second-order optimization algorithm which uses a Kronecker product preconditioner, has recently garnered increasing attention from the machine learning community. The preconditioner used by Shampoo can be viewed either as an approximation of the Gauss--Newton component of the Hessian or the covariance matrix of the gradients maintained by Adagrad. We provide an explicit and novel connec… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2402.11771  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Evaluating the Effectiveness of Index-Based Treatment Allocation

    Authors: Niclas Boehmer, Yash Nair, Sanket Shah, Lucas Janson, Aparna Taneja, Milind Tambe

    Abstract: When resources are scarce, an allocation policy is needed to decide who receives a resource. This problem occurs, for instance, when allocating scarce medical resources and is often solved using modern ML methods. This paper introduces methods to evaluate index-based allocation policies -- that allocate a fixed number of resources to those who need them the most -- by using data from a randomized… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  3. arXiv:2402.04933  [pdf, other

    cs.LG stat.AP

    A Bayesian Approach to Online Learning for Contextual Restless Bandits with Applications to Public Health

    Authors: Biyonka Liang, Lily Xu, Aparna Taneja, Milind Tambe, Lucas Janson

    Abstract: Public health programs often provide interventions to encourage beneficiary adherence,and effectively allocating interventions is vital for producing the greatest overall health outcomes. Such resource allocation problems are often modeled as restless multi-armed bandits (RMABs) with unknown underlying transition dynamics, hence requiring online reinforcement learning (RL). We present Bayesian Lea… ▽ More

    Submitted 27 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 26 pages, 18 figures

  4. arXiv:2302.13970  [pdf, other

    math.OC cs.CG eess.SY math.DG math.PR math.ST

    Estimating the Convex Hull of the Image of a Set with Smooth Boundary: Error Bounds and Applications

    Authors: Thomas Lew, Riccardo Bonalli, Lucas Janson, Marco Pavone

    Abstract: We study the problem of estimating the convex hull of the image $f(X)\subset\mathbb{R}^n$ of a compact set $X\subset\mathbb{R}^m$ with smooth boundary through a smooth function $f:\mathbb{R}^m\to\mathbb{R}^n$. Assuming that $f$ is a submersion, we derive a new bound on the Hausdorff distance between the convex hull of $f(X)$ and the convex hull of the images $f(x_i)$ of $M$ sampled inputs $x_i$ on… ▽ More

    Submitted 29 February, 2024; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 33 pages. Small changes to improve the clarity and presentation of results. Fixed Lemma 3.7

  5. arXiv:2202.07098  [pdf, ps, other

    cs.LG stat.ME

    Statistical Inference After Adaptive Sampling for Longitudinal Data

    Authors: Kelly W. Zhang, Lucas Janson, Susan A. Murphy

    Abstract: Online reinforcement learning and other adaptive sampling algorithms are increasingly used in digital intervention experiments to optimize treatment delivery for users over time. In this work, we focus on longitudinal user data collected by a large class of adaptive sampling algorithms that are designed to optimize treatment decisions online using accruing data from multiple users. Combining or "p… ▽ More

    Submitted 19 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Fixing typos

  6. arXiv:2202.05799  [pdf, ps, other

    cs.LG eess.SY math.ST

    Rate-matching the regret lower-bound in the linear quadratic regulator with unknown dynamics

    Authors: Feicheng Wang, Lucas Janson

    Abstract: The theory of reinforcement learning currently suffers from a mismatch between its empirical performance and the theoretical characterization of its performance, with consequences for, e.g., the understanding of sample efficiency, safety, and robustness. The linear quadratic regulator with unknown dynamics is a fundamental reinforcement learning setting with significant structure in its dynamics a… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

  7. arXiv:2112.05745  [pdf, other

    eess.SY cs.AI cs.LG cs.RO

    A Simple and Efficient Sampling-based Algorithm for General Reachability Analysis

    Authors: Thomas Lew, Lucas Janson, Riccardo Bonalli, Marco Pavone

    Abstract: In this work, we analyze an efficient sampling-based algorithm for general-purpose reachability analysis, which remains a notoriously challenging problem with applications ranging from neural network verification to safety analysis of dynamical systems. By sampling inputs, evaluating their images in the true reachable set, and taking their $ε$-padded convex hull as a set estimator, this algorithm… ▽ More

    Submitted 13 April, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: 4th Annual Learning for Dynamics & Control Conference (L4DC) 2022. Section V: added the assumption $\partial\mathcal{Y}\subseteq f(\partial\mathcal{X})$. If $\partial\mathcal{Y}\nsubseteq f(\partial\mathcal{X})$, then one should sample over the entire set $\mathcal{X}$ to obtain finite-sample bounds

  8. arXiv:2109.11234  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    The Role of Tactile Sensing in Learning and Deploying Grasp Refinement Algorithms

    Authors: Alexander Koenig, Zixi Liu, Lucas Janson, Robert Howe

    Abstract: A long-standing question in robot hand design is how accurate tactile sensing must be. This paper uses simulated tactile signals and the reinforcement learning (RL) framework to study the sensing needs in grasping systems. Our first experiment investigates the need for rich tactile sensing in the rewards of RL-based grasp refinement algorithms for multi-fingered robotic hands. We systematically in… ▽ More

    Submitted 27 March, 2022; v1 submitted 23 September, 2021; originally announced September 2021.

    Comments: paper currently under review, 7 pages, 10 figures, video: https://www.youtube.com/watch?v=WKhmOKPEYPc, code: https://github.com/axkoenig/grasp_refinement

  9. arXiv:2104.14074  [pdf, other

    cs.LG

    Statistical Inference with M-Estimators on Adaptively Collected Data

    Authors: Kelly W. Zhang, Lucas Janson, Susan A. Murphy

    Abstract: Bandit algorithms are increasingly used in real-world sequential decision-making problems. Associated with this is an increased desire to be able to use the resulting datasets to answer scientific questions like: Did one type of ad lead to more purchases? In which contexts is a mobile health intervention effective? However, classical statistical approaches fail to provide valid confidence interval… ▽ More

    Submitted 19 November, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Journal ref: Advances in Neural Information Processing Systems, 2021

  10. arXiv:2011.01364  [pdf, other

    cs.LG eess.SY math.ST

    Exact Asymptotics for Linear Quadratic Adaptive Control

    Authors: Feicheng Wang, Lucas Janson

    Abstract: Recent progress in reinforcement learning has led to remarkable performance in a range of applications, but its deployment in high-stakes settings remains quite rare. One reason is a limited understanding of the behavior of reinforcement algorithms, both in terms of their regret and their ability to learn the underlying system dynamics---existing work is focused almost exclusively on characterizin… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  11. arXiv:2007.12671  [pdf, other

    stat.ML cs.LG math.ST

    Cross-validation Confidence Intervals for Test Error

    Authors: Pierre Bayle, Alexandre Bayle, Lucas Janson, Lester Mackey

    Abstract: This work develops central limit theorems for cross-validation and consistent estimators of its asymptotic variance under weak stability conditions on the learning algorithm. Together, these results provide practical, asymptotically-exact confidence intervals for $k$-fold test error and valid, powerful hypothesis tests of whether one learning algorithm has smaller $k$-fold test error than another.… ▽ More

    Submitted 31 October, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020); 40 pages, 15 figures

  12. arXiv:2002.03217  [pdf, other

    cs.LG stat.ML

    Inference for Batched Bandits

    Authors: Kelly W. Zhang, Lucas Janson, Susan A. Murphy

    Abstract: As bandit algorithms are increasingly utilized in scientific studies and industrial applications, there is an associated increasing need for reliable inference methods based on the resulting adaptively-collected data. In this work, we develop methods for inference on data collected in batches using a bandit algorithm. We first prove that the ordinary least squares estimator (OLS), which is asympto… ▽ More

    Submitted 8 January, 2021; v1 submitted 8 February, 2020; originally announced February 2020.

    Journal ref: NeurIPS 2020

  13. arXiv:1910.08184  [pdf, other

    cs.RO eess.SY

    Map-Predictive Motion Planning in Unknown Environments

    Authors: Amine Elhafsi, Boris Ivanovic, Lucas Janson, Marco Pavone

    Abstract: Algorithms for motion planning in unknown environments are generally limited in their ability to reason about the structure of the unobserved environment. As such, current methods generally navigate unknown environments by relying on heuristic methods to choose intermediate objectives along frontiers. We present a unified method that combines map prediction and motion planning for safe, time-effic… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

  14. arXiv:1909.09688  [pdf, other

    cs.RO math.OC

    Revisiting the Asymptotic Optimality of RRT$^*$

    Authors: Kiril Solovey, Lucas Janson, Edward Schmerling, Emilio Frazzoli, Marco Pavone

    Abstract: RRT* is one of the most widely used sampling-based algorithms for asymptotically-optimal motion planning. This algorithm laid the foundations for optimality in motion planning as a whole, and inspired the development of numerous new algorithms in the field, many of which build upon RRT* itself. In this paper, we first identify a logical gap in the optimality proof of RRT*, which was developed in K… ▽ More

    Submitted 21 April, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: To appear in ICRA2020. This version includes a detailed counterexample that is not present in the conference version

  15. arXiv:1804.05804  [pdf, other

    cs.RO cs.AI

    Safe Motion Planning in Unknown Environments: Optimality Benchmarks and Tractable Policies

    Authors: Lucas Janson, Tommy Hu, Marco Pavone

    Abstract: This paper addresses the problem of planning a safe (i.e., collision-free) trajectory from an initial state to a goal region when the obstacle space is a-priori unknown and is incrementally revealed online, e.g., through line-of-sight perception. Despite its ubiquitous nature, this formulation of motion planning has received relatively little theoretical investigation, as opposed to the setup wher… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

  16. arXiv:1512.01629  [pdf, ps, other

    cs.AI cs.LG math.OC

    Risk-Constrained Reinforcement Learning with Percentile Risk Criteria

    Authors: Yinlam Chow, Mohammad Ghavamzadeh, Lucas Janson, Marco Pavone

    Abstract: In many sequential decision-making problems one is interested in minimizing an expected cumulative cost while taking into account \emph{risk}, i.e., increased awareness of events of small probability and high consequences. Accordingly, the objective of this paper is to present efficient reinforcement learning algorithms for risk-constrained Markov decision processes (MDPs), where risk is represent… ▽ More

    Submitted 6 April, 2017; v1 submitted 5 December, 2015; originally announced December 2015.

    Comments: arXiv admin note: substantial text overlap with arXiv:1406.3339

  17. An Asymptotically-Optimal Sampling-Based Algorithm for Bi-directional Motion Planning

    Authors: Joseph A. Starek, Javier V. Gomez, Edward Schmerling, Lucas Janson, Luis Moreno, Marco Pavone

    Abstract: Bi-directional search is a widely used strategy to increase the success and convergence rates of sampling-based motion planning algorithms. Yet, few results are available that merge both bi-directional search and asymptotic optimality into existing optimal planners, such as PRM*, RRT*, and FMT*. The objective of this paper is to fill this gap. Specifically, this paper presents a bi-directional, sa… ▽ More

    Submitted 27 July, 2015; originally announced July 2015.

    Comments: Accepted to the 2015 IEEE Intelligent Robotics and Systems Conference in Hamburg, Germany. This submission represents the long version of the conference manuscript, with additional proof details (Section IV) regarding the asymptotic optimality of the BFMT* algorithm

  18. arXiv:1505.00023  [pdf, other

    cs.RO

    Deterministic Sampling-Based Motion Planning: Optimality, Complexity, and Performance

    Authors: Lucas Janson, Brian Ichter, Marco Pavone

    Abstract: Probabilistic sampling-based algorithms, such as the probabilistic roadmap (PRM) and the rapidly-exploring random tree (RRT) algorithms, represent one of the most successful approaches to robotic motion planning, due to their strong theoretical properties (in terms of probabilistic completeness or even asymptotic optimality) and remarkable practical performance. Such algorithms are probabilistic i… ▽ More

    Submitted 3 May, 2016; v1 submitted 30 April, 2015; originally announced May 2015.

  19. arXiv:1504.08053  [pdf, other

    cs.RO

    Monte Carlo Motion Planning for Robot Trajectory Optimization Under Uncertainty

    Authors: Lucas Janson, Edward Schmerling, Marco Pavone

    Abstract: This article presents a novel approach, named MCMP (Monte Carlo Motion Planning), to the problem of motion planning under uncertainty, i.e., to the problem of computing a low-cost path that fulfills probabilistic collision avoidance constraints. MCMP estimates the collision probability (CP) of a given path by sampling via Monte Carlo the execution of a reference tracking controller (in this paper… ▽ More

    Submitted 28 May, 2015; v1 submitted 29 April, 2015; originally announced April 2015.

  20. arXiv:1405.7421  [pdf, other

    cs.RO

    Optimal Sampling-Based Motion Planning under Differential Constraints: the Drift Case with Linear Affine Dynamics

    Authors: Edward Schmerling, Lucas Janson, Marco Pavone

    Abstract: In this paper we provide a thorough, rigorous theoretical framework to assess optimality guarantees of sampling-based algorithms for drift control systems: systems that, loosely speaking, can not stop instantaneously due to momentum. We exploit this framework to design and analyze a sampling-based algorithm (the Differential Fast Marching Tree algorithm) that is asymptotically optimal, that is, it… ▽ More

    Submitted 26 October, 2015; v1 submitted 28 May, 2014; originally announced May 2014.

  21. arXiv:1403.2483  [pdf, other

    cs.RO

    Optimal Sampling-Based Motion Planning under Differential Constraints: the Driftless Case

    Authors: Edward Schmerling, Lucas Janson, Marco Pavone

    Abstract: Motion planning under differential constraints is a classic problem in robotics. To date, the state of the art is represented by sampling-based techniques, with the Rapidly-exploring Random Tree algorithm as a leading example. Yet, the problem is still open in many aspects, including guarantees on the quality of the obtained solution. In this paper we provide a thorough theoretical framework to as… ▽ More

    Submitted 2 March, 2015; v1 submitted 11 March, 2014; originally announced March 2014.

  22. arXiv:1306.3532  [pdf, other

    cs.RO

    Fast Marching Tree: a Fast Marching Sampling-Based Method for Optimal Motion Planning in Many Dimensions

    Authors: Lucas Janson, Edward Schmerling, Ashley Clark, Marco Pavone

    Abstract: In this paper we present a novel probabilistic sampling-based motion planning algorithm called the Fast Marching Tree algorithm (FMT*). The algorithm is specifically aimed at solving complex motion planning problems in high-dimensional configuration spaces. This algorithm is proven to be asymptotically optimal and is shown to converge to an optimal solution faster than its state-of-the-art counter… ▽ More

    Submitted 6 February, 2015; v1 submitted 14 June, 2013; originally announced June 2013.