Zum Hauptinhalt springen

Showing 1–50 of 106 results for author: Smith, S L

.
  1. arXiv:2404.07839  [pdf, other

    cs.LG cs.AI cs.CL

    RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

    Authors: Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti , et al. (37 additional authors not shown)

    Abstract: We introduce RecurrentGemma, a family of open language models which uses Google's novel Griffin architecture. Griffin combines linear recurrences with local attention to achieve excellent performance on language. It has a fixed-sized state, which reduces memory use and enables efficient inference on long sequences. We provide two sizes of models, containing 2B and 9B parameters, and provide pre-tr… ▽ More

    Submitted 28 August, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  2. Estimating Visibility from Alternate Perspectives for Motion Planning with Occlusions

    Authors: Barry Gilhuly, Armin Sadeghi, Stephen L. Smith

    Abstract: Visibility is a crucial aspect of planning and control of autonomous vehicles (AV), particularly when navigating environments with occlusions. However, when an AV follows a trajectory with multiple occlusions, existing methods evaluate each occlusion individually, calculate a visibility cost for each, and rely on the planner to minimize the overall cost. This can result in conflicting priorities f… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to the IEEE-RAL for possible publication

  3. arXiv:2403.08295  [pdf, other

    cs.CL cs.AI

    Gemma: Open Models Based on Gemini Research and Technology

    Authors: Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Léonard Hussenot, Pier Giuseppe Sessa, Aakanksha Chowdhery, Adam Roberts, Aditya Barua, Alex Botev, Alex Castro-Ros, Ambrose Slone, Amélie Héliou, Andrea Tacchetti, Anna Bulanova, Antonia Paterson, Beth Tsai, Bobak Shahriari , et al. (83 additional authors not shown)

    Abstract: This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language understanding, reasoning, and safety. We release two sizes of models (2 billion and 7 billion parameters), and provide both pretrained and fine-tuned checkpoints. Ge… ▽ More

    Submitted 16 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  4. arXiv:2402.19427  [pdf, other

    cs.LG cs.CL

    Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

    Authors: Soham De, Samuel L. Smith, Anushan Fernando, Aleksandar Botev, George Cristian-Muraru, Albert Gu, Ruba Haroun, Leonard Berrada, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, Arnaud Doucet, David Budden, Yee Whye Teh, Razvan Pascanu, Nando De Freitas, Caglar Gulcehre

    Abstract: Recurrent neural networks (RNNs) have fast inference and scale efficiently on long sequences, but they are difficult to train and hard to scale. We propose Hawk, an RNN with gated linear recurrences, and Griffin, a hybrid model that mixes gated linear recurrences with local attention. Hawk exceeds the reported performance of Mamba on downstream tasks, while Griffin matches the performance of Llama… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 25 pages, 11 figures

  5. arXiv:2401.01483  [pdf, other

    cs.RO

    To Lead or to Follow? Adaptive Robot Task Planning in Human-Robot Collaboration

    Authors: Ali Noormohammadi-Asl, Stephen L. Smith, Kerstin Dautenhahn

    Abstract: Adaptive task planning is fundamental to ensuring effective and seamless human-robot collaboration. This paper introduces a robot task planning framework that takes into account both human leading/following preferences and performance, specifically focusing on task allocation and scheduling in collaborative settings. We present a proactive task allocation approach with three primary objectives: en… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  6. arXiv:2401.01466  [pdf, other

    cs.RO

    Human Leading or Following Preferences: Effects on Human Perception of the Robot and the Human-Robot Collaboration

    Authors: Ali Noormohammadi-Asl, Kevin Fan, Stephen L. Smith, Kerstin Dautenhahn

    Abstract: Achieving effective and seamless human-robot collaboration requires two key outcomes: enhanced team performance and fostering a positive human perception of both the robot and the collaboration. This paper investigates the capability of the proposed task planning framework to realize these objectives by integrating human leading/following preference and performance into its task allocation and sch… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  7. arXiv:2312.15177  [pdf, other

    eess.SY

    Stochastic Data-Driven Predictive Control with Equivalence to Stochastic MPC

    Authors: Ruiqi Li, John W. Simpson-Porco, Stephen L. Smith

    Abstract: We propose a data-driven receding-horizon control method dealing with the chance-constrained output-tracking problem of unknown stochastic linear time-invariant (LTI) systems with partial state observation. The proposed method takes into account the statistics of the process noise, the measurement noise and the uncertain initial condition, following an analogous framework to Stochastic Model Predi… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: 20 pages, 4 figures. The extended version of a submission to IEEE Transactions on Automatic Control

  8. arXiv:2312.07227  [pdf, other

    cs.RO

    Scalarizing Multi-Objective Robot Planning Problems using Weighted Maximization

    Authors: Nils Wilde, Stephen L. Smith, Javier Alonso-Mora

    Abstract: When designing a motion planner for autonomous robots there are usually multiple objectives to be considered. However, a cost function that yields the desired trade-off between objectives is not easily obtainable. A common technique across many applications is to use a weighted sum of relevant objective functions and then carefully adapt the weights. However, this approach may not find all relevan… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  9. arXiv:2312.05338  [pdf, other

    cs.RO math.OC

    Minimizing Robot Digging Times to Retrieve Bins in Robotic-Based Compact Storage and Retrieval Systems

    Authors: Anni Yue, Stephen L. Smith

    Abstract: Robotic-based compact storage and retrieval systems provide high-density storage in distribution center and warehouse applications. In the system, items are stored in bins, and the bins are organized inside a three-dimensional grid. Robots move on top of the grid to retrieve and deliver bins. To retrieve a bin, a robot removes all bins above one by one with its gripper, called bin digging. The clo… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 35 pages, 16 figures, submitted to Transportation Science (INFORMS)

  10. arXiv:2312.00668  [pdf, other

    math.CV

    A transform pair for bounded convex planar domains

    Authors: Jesse Hulse, Loredana Lanzani, Stefan Llewellyn Smith, Elena Luca

    Abstract: A new transform pair which can be used to solve mixed boundary value problems for Laplace's equation and the complex Helmholtz equation in bounded convex planar domains is presented. This work is an extension of Crowdy (2015, CMFT, 15, 655--687) where new transform techniques were developed for boundary value problems for Laplace's equation in circular domains. The key ingredient of the method is… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  11. arXiv:2311.17837  [pdf, other

    cs.RO

    Anytime Replanning of Robot Coverage Paths for Partially Unknown Environments

    Authors: Megnath Ramesh, Frank Imeson, Baris Fidan, Stephen L. Smith

    Abstract: In this paper, we propose a method to replan coverage paths for a robot operating in an environment with initially unknown static obstacles. Existing coverage approaches reduce coverage time by covering along the minimum number of coverage lines (straight-line paths). However, recomputing such paths online can be computationally expensive resulting in robot stoppages that increase coverage time. A… ▽ More

    Submitted 7 June, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: 16 pages, 18 figures, Paper submitted to IEEE T-RO

  12. arXiv:2311.00136  [pdf, other

    q-bio.NC cs.LG cs.NE

    Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data

    Authors: Antonis Antoniades, Yiyi Yu, Joseph Canzano, William Wang, Spencer LaVere Smith

    Abstract: State-of-the-art systems neuroscience experiments yield large-scale multimodal data, and these data sets require new tools for analysis. Inspired by the success of large pretrained models in vision and language domains, we reframe the analysis of large-scale, cellular-resolution neuronal spiking data into an autoregressive spatiotemporal generation problem. Neuroformer is a multimodal, multitask g… ▽ More

    Submitted 15 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: 9 pages for main paper. 22 pages in total. 13 figures, 1 table

  13. arXiv:2310.16764  [pdf, other

    cs.CV cs.LG cs.NE

    ConvNets Match Vision Transformers at Scale

    Authors: Samuel L. Smith, Andrew Brock, Leonard Berrada, Soham De

    Abstract: Many researchers believe that ConvNets perform well on small or moderately sized datasets, but are not competitive with Vision Transformers when given access to datasets on the web-scale. We challenge this belief by evaluating a performant ConvNet architecture pre-trained on JFT-4B, a large labelled dataset of images often used for training foundation models. We consider pre-training compute budge… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  14. arXiv:2310.10502  [pdf, other

    cs.RO cs.MA

    Adaptive Robot Assistance: Expertise and Influence in Multi-User Task Planning

    Authors: Abhinav Dahiya, Stephen L. Smith

    Abstract: This paper addresses the challenge of enabling a single robot to effectively assist multiple humans in decision-making for task planning domains. We introduce a comprehensive framework designed to enhance overall team performance by considering both human expertise in making the optimal decisions and robot influence on human decision-making. Our model integrates these factors seamlessly within the… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 7 pages, 5 figures

  15. IAS/CEA Evolution of Dust in Nearby Galaxies (ICED): the spatially-resolved dust properties of NGC4254

    Authors: L. Pantoni, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, M. Baes, A. Beelen, A. Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, G. Ejlali, F. Galliano, A. Gomez, J. Goupy, A. P. Jones, C. Hanser , et al. (35 additional authors not shown)

    Abstract: We present the first preliminary results of the project \textit{ICED}, focusing on the face-on galaxy NGC4254. We use the millimetre maps observed with NIKA2 at IRAM-30m, as part of the IMEGIN Guaranteed Time Large Program, and of a wide collection of ancillary data (multi-wavelength photometry and gas phase spectral lines) that are publicly available. We derive the global and local properties of… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: to appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

    Journal ref: EPJ Web of Conferences 293 (2024) 00038

  16. Constraining Millimeter Dust Emission in Nearby Galaxies with NIKA2: the case of NGC2146 and NGC2976

    Authors: G. Ejlali, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Aussel, M. Baes, A. Beelen, Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, F. Galliano, A. Gomez, J. Goupy, A. P. Jones, C. Hanser, A. Hughes , et al. (35 additional authors not shown)

    Abstract: This study presents the first millimeter continuum mapping observations of two nearby galaxies, the starburst spiral galaxy NGC2146 and the dwarf galaxy NGC2976, at 1.15 mm and 2 mm using the NIKA2 camera on the IRAM 30m telescope, as part of the Guaranteed Time Large Project IMEGIN. These observations provide robust resolved information about the physical properties of dust in nearby galaxies by… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: To appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

    Journal ref: EPJ Web of conferences 293 (2024) 00016

  17. arXiv:2308.10888  [pdf, other

    cs.LG cs.CV cs.CY

    Unlocking Accuracy and Fairness in Differentially Private Image Classification

    Authors: Leonard Berrada, Soham De, Judy Hanwen Shen, Jamie Hayes, Robert Stanforth, David Stutz, Pushmeet Kohli, Samuel L. Smith, Borja Balle

    Abstract: Privacy-preserving machine learning aims to train models on private data without leaking sensitive information. Differential privacy (DP) is considered the gold standard framework for privacy-preserving training, as it provides formal privacy guarantees. However, compared to their non-private counterparts, models trained with DP often have significantly reduced accuracy. Private classifiers are al… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  18. arXiv:2307.11888  [pdf, other

    cs.LG cs.NE

    Universality of Linear Recurrences Followed by Non-linear Projections: Finite-Width Guarantees and Benefits of Complex Eigenvalues

    Authors: Antonio Orvieto, Soham De, Caglar Gulcehre, Razvan Pascanu, Samuel L. Smith

    Abstract: Deep neural networks based on linear RNNs interleaved with position-wise MLPs are gaining traction as competitive approaches for sequence modeling. Examples of such architectures include state-space models (SSMs) like S4, LRU, and Mamba: recently proposed models that achieve promising performance on text, genetics, and other data that require long-range reasoning. Despite experimental evidence hig… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: v1: Accepted at HLD 2023 Workshop @ICML; v2: Preprint; v3: ICML version

  19. arXiv:2307.11192  [pdf, other

    cs.RO

    Adapting to Human Preferences to Lead or Follow in Human-Robot Collaboration: A System Evaluation

    Authors: Ali Noormohammadi-Asl, Ali Ayub, Stephen L. Smith, Kerstin Dautenhahn

    Abstract: With the introduction of collaborative robots, humans and robots can now work together in close proximity and share the same workspace. However, this collaboration presents various challenges that need to be addressed to ensure seamless cooperation between the agents. This paper focuses on task planning for human-robot collaboration, taking into account the human's performance and their preference… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  20. arXiv:2307.04674  [pdf, other

    cs.RO

    Optimal Robot Path Planning In a Collaborative Human-Robot Team with Intermittent Human Availability

    Authors: Abhinav Dahiya, Stephen L. Smith

    Abstract: This paper presents a solution for the problem of optimal planning for a robot in a collaborative human-robot team, where the human supervisor is intermittently available to assist the robot in completing tasks more quickly. Specifically, we address the challenge of computing the fastest path between two configurations in an environment with time constraints on how long the robot can wait for assi… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 9 pages, 7 figures, IEEE ROMAN 2023

  21. Optimizing Task Waiting Times in Dynamic Vehicle Routing

    Authors: Alexander Botros, Barry Gilhuly, Nils Wilde, Armin Sadeghi, Javier Alonso-Mora, Stephen L. Smith

    Abstract: We study the problem of deploying a fleet of mobile robots to service tasks that arrive stochastically over time and at random locations in an environment. This is known as the Dynamic Vehicle Routing Problem (DVRP) and requires robots to allocate incoming tasks among themselves and find an optimal sequence for each robot. State-of-the-art approaches only consider average wait times and focus on h… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: Accepted for publication in IEEE Robotics and Automation Letters (RA-L)

    MSC Class: 68M20 ACM Class: J.2

  22. arXiv:2306.16501  [pdf, other

    cs.RO cs.HC

    On the Impact of Interruptions During Multi-Robot Supervision Tasks

    Authors: Abhinav Dahiya, Yifan Cai, Oliver Schneider, Stephen L. Smith

    Abstract: Human supervisors in multi-robot systems are primarily responsible for monitoring robots, but can also be assigned with secondary tasks. These tasks can act as interruptions and can be categorized as either intrinsic, i.e., being directly related to the monitoring task, or extrinsic, i.e., being unrelated. In this paper, we investigate the impact of these two types of interruptions through a user… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 7 pages, 10 figures, 2 tables, ICRA 2023

  23. arXiv:2304.02692  [pdf, other

    math.OC eess.SY

    A Unified Approach to Optimally Solving Sensor Scheduling and Sensor Selection Problems in Kalman Filtering

    Authors: Shamak Dutta, Nils Wilde, Stephen L. Smith

    Abstract: We consider a general form of the sensor scheduling problem for state estimation of linear dynamical systems, which involves selecting sensors that minimize the trace of the Kalman filter error covariance (weighted by a positive semidefinite matrix) subject to polyhedral constraints on the selected sensors. This general form captures several well-studied problems including sensor placement, sensor… ▽ More

    Submitted 11 December, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  24. arXiv:2303.08935  [pdf, other

    cs.RO

    Multi-Robot Persistent Monitoring: Minimizing Latency and Number of Robots with Recharging Constraints

    Authors: Ahmad Bilal Asghar, Shreyas Sundaram, Stephen L. Smith

    Abstract: In this paper we study multi-robot path planning for persistent monitoring tasks. We consider the case where robots have a limited battery capacity with a discharge time $D$. We represent the areas to be monitored as the vertices of a weighted graph. For each vertex, there is a constraint on the maximum allowable time between robot visits, called the latency. The objective is to find the minimum n… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 13 pages, 10 fiugres. arXiv admin note: substantial text overlap with arXiv:1903.06105

  25. arXiv:2303.06349  [pdf, other

    cs.LG

    Resurrecting Recurrent Neural Networks for Long Sequences

    Authors: Antonio Orvieto, Samuel L Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, Soham De

    Abstract: Recurrent Neural Networks (RNNs) offer fast inference on long sequences but are hard to optimize and slow to train. Deep state-space models (SSMs) have recently been shown to perform remarkably well on long sequence modeling tasks, and have the added benefits of fast parallelizable training and RNN-like fast inference. However, while SSMs are superficially similar to RNNs, there are important diff… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: 30 pages, 9 figures

  26. arXiv:2302.13861  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Differentially Private Diffusion Models Generate Useful Synthetic Images

    Authors: Sahra Ghalebikesabi, Leonard Berrada, Sven Gowal, Ira Ktena, Robert Stanforth, Jamie Hayes, Soham De, Samuel L. Smith, Olivia Wiles, Borja Balle

    Abstract: The ability to generate privacy-preserving synthetic versions of sensitive image datasets could unlock numerous ML applications currently constrained by data availability. Due to their astonishing image generation quality, diffusion models are a prime candidate for generating high-quality synthetic data. However, recent studies have found that, by default, the outputs of some diffusion models do n… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  27. Real-Time Navigation for Autonomous Surface Vehicles In Ice-Covered Waters

    Authors: Rodrigue de Schaetzen, Alexander Botros, Robert Gash, Kevin Murrant, Stephen L. Smith

    Abstract: Vessel transit in ice-covered waters poses unique challenges in safe and efficient motion planning. When the concentration of ice is high, it may not be possible to find collision-free trajectories. Instead, ice can be pushed out of the way if it is small or if contact occurs near the edge of the ice. In this work, we propose a real-time navigation framework that minimizes collisions with ice and… ▽ More

    Submitted 23 February, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: 7 pages, 8 figures

  28. arXiv:2302.10322  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation

    Authors: Bobby He, James Martens, Guodong Zhang, Aleksandar Botev, Andrew Brock, Samuel L Smith, Yee Whye Teh

    Abstract: Skip connections and normalisation layers form two standard architectural components that are ubiquitous for the training of Deep Neural Networks (DNNs), but whose precise roles are poorly understood. Recent approaches such as Deep Kernel Shaping have made progress towards reducing our reliance on them, using insights from wide NN kernel theory to improve signal propagation in vanilla DNNs (which… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: ICLR 2023

  29. arXiv:2212.05286  [pdf, other

    cs.RO

    A Survey of Multi-Agent Human-Robot Interaction Systems

    Authors: Abhinav Dahiya, Alexander M. Aroyo, Kerstin Dautenhahn, Stephen L. Smith

    Abstract: This article presents a survey of literature in the area of Human-Robot Interaction (HRI), specifically on systems containing more than two agents (i.e., having multiple humans and/or multiple robots). We identify three core aspects of ``Multi-agent" HRI systems that are useful for understanding how these systems differ from dyadic systems and from one another. These are the Team structure, Intera… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.

    Comments: 23 pages, 7 figures

  30. arXiv:2210.08107  [pdf, other

    cs.RO

    Approximation Algorithms for Robot Tours in Random Fields with Guaranteed Estimation Accuracy

    Authors: Shamak Dutta, Nils Wilde, Pratap Tokekar, Stephen L. Smith

    Abstract: We study the sample placement and shortest tour problem for robots tasked with mapping environmental phenomena modeled as stationary random fields. The objective is to minimize the resources used (samples or tour length) while guaranteeing estimation accuracy. We give approximation algorithms for both problems in convex environments. These improve previously known results, both in terms of theoret… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  31. arXiv:2209.03458  [pdf, other

    cs.RO eess.SY

    Scheduling Operator Assistance for Shared Autonomy in Multi-Robot Teams

    Authors: Yifan Cai, Abhinav Dahiya, Nils Wilde, Stephen L. Smith

    Abstract: In this paper, we consider the problem of allocating human operator assistance in a system with multiple autonomous robots. Each robot is required to complete independent missions, each defined as a sequence of tasks. While executing a task, a robot can either operate autonomously or be teleoperated by the human operator to complete the task at a faster rate. We show that the problem of creating a… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  32. arXiv:2206.00663  [pdf, other

    cs.RO

    Error-Bounded Approximation of Pareto Fronts in Robot Planning Problems

    Authors: Alexander Botros, Armin Sadeghi, Nils Wilde, Javier Alonso-Mora, Stephen L. Smith

    Abstract: Many problems in robotics seek to simultaneously optimize several competing objectives under constraints. A conventional approach to solving such multi-objective optimization problems is to create a single cost function comprised of the weighted sum of the individual objectives. Solutions to this scalarized optimization problem are Pareto optimal solutions to the original multi-objective problem.… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  33. arXiv:2205.11513  [pdf, other

    eess.SY

    Data-Driven Learning of Safety-Critical Control with Stochastic Control Barrier Functions

    Authors: Chuanzheng Wang, Yiming Meng, Stephen L. Smith, Jun Liu

    Abstract: Control barrier functions are widely used to synthesize safety-critical controls. The existence of Gaussian-type noise may lead to unsafe actions and result in severe consequences. While studies are widely done in safety-critical control for stochastic systems, in many real-world applications, we do not have the knowledge of the stochastic component of the dynamics. In this paper, we study safety-… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

  34. arXiv:2204.13650  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Unlocking High-Accuracy Differentially Private Image Classification through Scale

    Authors: Soham De, Leonard Berrada, Jamie Hayes, Samuel L. Smith, Borja Balle

    Abstract: Differential Privacy (DP) provides a formal privacy guarantee preventing adversaries with access to a machine learning model from extracting information about individual training points. Differentially Private Stochastic Gradient Descent (DP-SGD), the most popular DP training method for deep learning, realizes this protection by injecting noise during training. However previous works have found th… ▽ More

    Submitted 16 June, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

  35. arXiv:2204.09571  [pdf, other

    eess.SY

    Informative Path Planning in Random Fields via Mixed Integer Programming

    Authors: Shamak Dutta, Nils Wilde, Stephen L. Smith

    Abstract: We present a new mixed integer formulation for the discrete informative path planning problem in random fields. The objective is to compute a budget constrained path while collecting measurements whose linear estimate results in minimum error over a finite set of prediction locations. The problem is known to be NP-hard. However, we strive to compute optimal solutions by leveraging advances in mixe… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

  36. arXiv:2203.16423  [pdf, ps, other

    eess.SY

    Data-Driven Model Predictive Control for Linear Time-Periodic Systems

    Authors: Ruiqi Li, John W. Simpson-Porco, Stephen L. Smith

    Abstract: We consider the problem of data-driven predictive control for an unknown discrete-time linear time-periodic (LTP) system of known period. Our proposed strategy generalizes both Data-enabled Predictive Control (DeePC) and Subspace Predictive Control (SPC), which are established data-driven control techniques for linear time-invariant (LTI) systems. The approach is supported by an extensive theoreti… ▽ More

    Submitted 12 September, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: 12 pages, 7 figures, the extended version of a CDC 2022 paper

  37. arXiv:2203.16070  [pdf, other

    math.OC cs.LG

    An Improved Greedy Algorithm for Subset Selection in Linear Estimation

    Authors: Shamak Dutta, Nils Wilde, Stephen L. Smith

    Abstract: In this paper, we consider a subset selection problem in a spatial field where we seek to find a set of k locations whose observations provide the best estimate of the field value at a finite set of prediction locations. The measurements can be taken at any location in the continuous field, and the covariance between the field values at different points is given by the widely used squared exponent… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted for publication at European Control Conference, 2022

  38. arXiv:2203.08278  [pdf, other

    hep-ex

    A New Charged Lepton Flavor Violation Program at Fermilab

    Authors: M. Aoki, R. B. Appleby, M. Aslaninejad, R. Barlow, R. H. Bernstein, C. Bloise, L. Calibbi, F. Cervelli, R. Culbertson, Andre Luiz de Gouvea, S. Di Falco, E. Diociaiuti, S. Donati, R. Donghia, B. Echenard, A. Gaponenko, S. Giovannella, C. Group, F. Happacher, M. T. Hedges, D. G. Hitlin, E. Hungerford, C. Johnstone, D. M. Kaplan, M. Kargiantoulakis , et al. (43 additional authors not shown)

    Abstract: The muon has played a central role in establishing the Standard Model of particle physics, and continues to provide valuable information about the nature of new physics. A new complex at Fermilab, the Advanced Muon Facility, would provide the world's most intense positive and negative muon beams by exploiting the full potential of PIP-II and the Booster upgrade. This facility would enable a broad… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: A Contributed Paper for Snowmass 2021

  39. arXiv:2201.00724  [pdf, other

    cs.DS eess.SY math.OC

    Submodular Maximization with Limited Function Access

    Authors: Andrew Downie, Bahman Gharesifard, Stephen L. Smith

    Abstract: We consider a class of submodular maximization problems in which decision-makers have limited access to the objective function. We explore scenarios where the decision-maker can observe only pairwise information, i.e., can evaluate the objective function on sets of size two. We begin with a negative result that no algorithm using only $k$-wise information can guarantee performance better than… ▽ More

    Submitted 7 February, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

    Comments: 14 pages, 8 figures

  40. Learning Submodular Objectives for Team Environmental Monitoring

    Authors: Nils Wilde, Armin Sadeghi, Stephen L. Smith

    Abstract: In this paper, we study the well-known team orienteering problem where a fleet of robots collects rewards by visiting locations. Usually, the rewards are assumed to be known to the robots; however, in applications such as environmental monitoring or scene reconstruction, the rewards are often subjective and specifying them is challenging. We propose a framework to learn the unknown preferences of… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  41. arXiv:2111.06437  [pdf, other

    cs.RO eess.SY

    Scalable Operator Allocation for Multi-Robot Assistance: A Restless Bandit Approach

    Authors: Abhinav Dahiya, Nima Akbarzadeh, Aditya Mahajan, Stephen L. Smith

    Abstract: In this paper, we consider the problem of allocating human operators in a system with multiple semi-autonomous robots. Each robot is required to perform an independent sequence of tasks, subjected to a chance of failing and getting stuck in a fault state at every task. If and when required, a human operator can assist or teleoperate a robot. Conventional MDP techniques used to solve such problems… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: 11 pages + 4 page Appendix, 7 Figures

  42. Dust Emission in Galaxies at Millimeter Wavelengths: Cooling of star forming regions in NGC6946

    Authors: G. Ejlali, R. Adam, P. Ade, H. Ajeddig, P. André, E. Artis, H. Ausse, A. Beelen, A. Benoît, S. Berta, L. Bing, O. Bourrion, M. Calvo, A. Catalano, I. de Looze, M. De Petris, F. -X. Désert, S. Doyle, E. F. C. Driessen, M. Galametz, F. Galliano, A. Gomez, J. Goupy, A. P. Jones, A. Hughes , et al. (32 additional authors not shown)

    Abstract: Interstellar dust plays an important role in the formation of molecular gas and the heating and cooling of the interstellar medium. The spatial distribution of the mm-wavelength dust emission from galaxies is largely unexplored. The NIKA2 Guaranteed Time Project IMEGIN (Interpreting the Millimeter Emission of Galaxies with IRAM and NIKA2) has recently mapped the mm emission in the grand design spi… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

    Comments: To appear in the Proceedings of the International Conference entitled "mm Universe @ NIKA2", Rome (Italy), June 2021, EPJ Web of conferences

  43. arXiv:2110.00284  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Reward Functions from Scale Feedback

    Authors: Nils Wilde, Erdem Bıyık, Dorsa Sadigh, Stephen L. Smith

    Abstract: Today's robots are increasingly interacting with people and need to efficiently learn inexperienced user's preferences. A common framework is to iteratively query the user about which of two presented robot trajectories they prefer. While this minimizes the users effort, a strict choice does not yield any information on how much one trajectory is preferred. We propose scale feedback, where the use… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: 16 pages, 15 figures, 3 tables. Published at Conference on Robot Learning (CoRL) 2021

  44. arXiv:2109.08185  [pdf, other

    cs.RO eess.SY

    Optimal Partitioning of Non-Convex Environments for Minimum Turn Coverage Planning

    Authors: Megnath Ramesh, Frank Imeson, Baris Fidan, Stephen L. Smith

    Abstract: In this paper, we tackle the problem of planning an optimal coverage path for a robot operating indoors. Many existing approaches attempt to discourage turns in the path by covering the environment along the least number of coverage lines, i.e., straight-line paths. This is because turning not only slows down the robot but also negatively affects the quality of coverage, e.g., tools like cameras a… ▽ More

    Submitted 26 May, 2022; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: 8 pages, 9 figures, submitted to RA-L with IROS 2022 option

  45. arXiv:2107.11467  [pdf, other

    cs.RO

    Spatio-Temporal Lattice Planning Using Optimal Motion Primitives

    Authors: Alexander Botros, Stephen L. Smith

    Abstract: Lattice-based planning techniques simplify the motion planning problem for autonomous vehicles by limiting available motions to a pre-computed set of primitives. These primitives are then combined online to generate more complex maneuvers. A set of motion primitives t-span a lattice if, given a real number t at least 1, any configuration in the lattice can be reached via a sequence of motion primi… ▽ More

    Submitted 17 July, 2023; v1 submitted 23 July, 2021; originally announced July 2021.

    Comments: 12 pages, 9 figures, 2 tables, accepted to IEEE Transactions on Intelligent Transportation Systems (preprint)

  46. arXiv:2106.03836  [pdf, other

    cs.RO eess.SY

    Tunable Trajectory Planner Using G3 Curves

    Authors: Alexander Botros, Stephen L. Smith

    Abstract: Trajectory planning is commonly used as part of a local planner in autonomous driving. This paper considers the problem of planning a continuous-curvature-rate trajectory between fixed start and goal states that minimizes a tunable trade-off between passenger comfort and travel time. The problem is an instance of infinite dimensional optimization over two continuous functions: a path, and a veloci… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: 13 pages, 11 figures, submitted to IEEE Transactions on Intelligent Vehicles

  47. arXiv:2105.13343  [pdf, other

    cs.LG cs.CV

    Drawing Multiple Augmentation Samples Per Image During Training Efficiently Decreases Test Error

    Authors: Stanislav Fort, Andrew Brock, Razvan Pascanu, Soham De, Samuel L. Smith

    Abstract: In computer vision, it is standard practice to draw a single sample from the data augmentation procedure for each unique image in the mini-batch. However recent work has suggested drawing multiple samples can achieve higher test accuracies. In this work, we provide a detailed empirical evaluation of how the number of augmentation samples per unique image influences model performance on held out da… ▽ More

    Submitted 24 February, 2022; v1 submitted 27 May, 2021; originally announced May 2021.

  48. arXiv:2104.02585  [pdf, other

    eess.SY

    Safety-Critical Control of Stochastic Systems using Stochastic Control Barrier Functions

    Authors: Chuanzheng Wang, Yiming Meng, Stephen L. Smith, Jun Liu

    Abstract: Control barrier functions have been widely used for synthesizing safety-critical controls, often via solving quadratic programs. However, the existence of Gaussian-type noise may lead to unsafe actions and result in severe consequences. In this paper, we study systems modeled by stochastic differential equations (SDEs) driven by Brownian motions. We propose a notion of stochastic control barrier f… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  49. arXiv:2102.06171  [pdf, other

    cs.CV cs.LG stat.ML

    High-Performance Large-Scale Image Recognition Without Normalization

    Authors: Andrew Brock, Soham De, Samuel L. Smith, Karen Simonyan

    Abstract: Batch normalization is a key component of most image classification models, but it has many undesirable properties stemming from its dependence on the batch size and interactions between examples. Although recent work has succeeded in training deep ResNets without normalization layers, these models do not match the test accuracies of the best batch-normalized networks, and are often unstable for l… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  50. arXiv:2101.12176  [pdf, other

    cs.LG stat.ML

    On the Origin of Implicit Regularization in Stochastic Gradient Descent

    Authors: Samuel L. Smith, Benoit Dherin, David G. T. Barrett, Soham De

    Abstract: For infinitesimal learning rates, stochastic gradient descent (SGD) follows the path of gradient flow on the full batch loss function. However moderately large learning rates can achieve higher test accuracies, and this generalization benefit is not explained by convergence bounds, since the learning rate which maximizes test accuracy is often larger than the learning rate which minimizes training… ▽ More

    Submitted 28 January, 2021; originally announced January 2021.

    Comments: Accepted as a conference paper at ICLR 2021