Zum Hauptinhalt springen

Showing 1–50 of 90 results for author: Smith, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11061  [pdf, other

    cs.CL

    StructuredRAG: JSON Response Formatting with Large Language Models

    Authors: Connor Shorten, Charles Pierse, Thomas Benjamin Smith, Erika Cardenas, Akanksha Sharma, John Trengrove, Bob van Luijt

    Abstract: The ability of Large Language Models (LLMs) to generate structured outputs, such as JSON, is crucial for their use in Compound AI Systems. However, evaluating and improving this capability remains challenging. In this work, we introduce StructuredRAG, a benchmark of six tasks designed to assess LLMs' proficiency in following response format instructions. We evaluate two state-of-the-art LLMs, Gemi… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Preprint. 10 pages, 6 figures

  2. arXiv:2408.05604  [pdf, other

    cs.RO

    Cellular Plasticity Model for Bottom-Up Robotic Design

    Authors: Trevor R. Smith, Thomas J. Smith, Nicholas S. Szczecinski, Sergiy Yakovenko, Yu Gu

    Abstract: Traditional top-down robotic design often lacks the adaptability needed to handle real-world complexities, prompting the need for more flexible approaches. Therefore, this study introduces a novel cellular plasticity model tailored for bottom-up robotic design. The proposed model utilizes an activator-inhibitor reaction, a common foundation of Turing patterns, which are fundamental in morphogenesi… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: 15 pages, 7 figures, Living Machines 2024

  3. arXiv:2407.19115  [pdf, other

    cs.LG

    Towards Scalable and Stable Parallelization of Nonlinear RNNs

    Authors: Xavier Gonzalez, Andrew Warrington, Jimmy T. H. Smith, Scott W. Linderman

    Abstract: Conventional nonlinear RNNs are not naturally parallelizable across the sequence length, whereas transformers and linear RNNs are. Lim et al. [2024] therefore tackle parallelized evaluation of nonlinear RNNs by posing it as a fixed point problem, solved with Newton's method. By deriving and applying a parallelized form of Newton's method, they achieve huge speedups over sequential evaluation. Howe… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 22 pages, 6 figures

    ACM Class: I.2.6

  4. arXiv:2407.13277  [pdf, other

    eess.IV cs.CV

    URCDM: Ultra-Resolution Image Synthesis in Histopathology

    Authors: Sarah Cechnicka, James Ball, Matthew Baugh, Hadrien Reynaud, Naomi Simmonds, Andrew P. T. Smith, Catherine Horsfield, Candice Roufosse, Bernhard Kainz

    Abstract: Diagnosing medical conditions from histopathology data requires a thorough analysis across the various resolutions of Whole Slide Images (WSI). However, existing generative methods fail to consistently represent the hierarchical structure of WSIs due to a focus on high-fidelity patches. To tackle this, we propose Ultra-Resolution Cascaded Diffusion Models (URCDMs) which are capable of synthesising… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2312.01152

  5. Air-Ground Collaboration with SPOMP: Semantic Panoramic Online Mapping and Planning

    Authors: Ian D. Miller, Fernando Cladera, Trey Smith, Camillo Jose Taylor, Vijay Kumar

    Abstract: Mapping and navigation have gone hand-in-hand since long before robots existed. Maps are a key form of communication, allowing someone who has never been somewhere to nonetheless navigate that area successfully. In the context of multi-robot systems, the maps and information that flow between robots are necessary for effective collaboration, whether those robots are operating concurrently, sequent… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: Video: https://www.youtube.com/watch?v=ieNYH40buBo

    Journal ref: IEEE Transactions on Field Robotics (2024)

  6. arXiv:2407.07279  [pdf, other

    cs.LG stat.ML

    Towards a theory of learning dynamics in deep state space models

    Authors: Jakub Smékal, Jimmy T. H. Smith, Michael Kleinman, Dan Biderman, Scott W. Linderman

    Abstract: State space models (SSMs) have shown remarkable empirical performance on many long sequence modeling tasks, but a theoretical understanding of these models is still lacking. In this work, we study the learning dynamics of linear SSMs to understand how covariance structure in data, latent state size, and initialization affect the evolution of parameters throughout learning with gradient descent. We… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  7. arXiv:2405.06147  [pdf, other

    cs.LG eess.SY

    State-Free Inference of State-Space Models: The Transfer Function Approach

    Authors: Rom N. Parnichkun, Stefano Massaroli, Alessandro Moro, Jimmy T. H. Smith, Ramin Hasani, Mathias Lechner, Qi An, Christopher Ré, Hajime Asama, Stefano Ermon, Taiji Suzuki, Atsushi Yamashita, Michael Poli

    Abstract: We approach designing a state-space model for deep learning applications through its dual representation, the transfer function, and uncover a highly efficient sequence parallel inference algorithm that is state-free: unlike other proposed algorithms, state-free inference does not incur any significant memory or computational cost with an increase in state size. We achieve this using properties of… ▽ More

    Submitted 1 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Resubmission 02/06/2024: Fixed minor typo of recurrent form RTF

  8. arXiv:2404.07703  [pdf, ps, other

    cs.LG cs.RO eess.SY

    Learning Hamiltonian Dynamics with Reproducing Kernel Hilbert Spaces and Random Features

    Authors: Torbjørn Smith, Olav Egeland

    Abstract: A method for learning Hamiltonian dynamics from a limited and noisy dataset is proposed. The method learns a Hamiltonian vector field on a reproducing kernel Hilbert space (RKHS) of inherently Hamiltonian vector fields, and in particular, odd Hamiltonian vector fields. This is done with a symplectic kernel, and it is shown how the kernel can be modified to an odd symplectic kernel to impose the od… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.09734

  9. arXiv:2404.03489  [pdf, other

    cs.RO

    Design of Stickbug: a Six-Armed Precision Pollination Robot

    Authors: Trevor Smith, Madhav Rijal, Christopher Tatsch, R. Michael Butts, Jared Beard, R. Tyler Cook, Andy Chu, Jason Gross, Yu Gu

    Abstract: This work presents the design of Stickbug, a six-armed, multi-agent, precision pollination robot that combines the accuracy of single-agent systems with swarm parallelization in greenhouses. Precision pollination robots have often been proposed to offset the effects of a decreasing population of natural pollinators, but they frequently lack the required parallelization and scalability. Stickbug ac… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 7 pages, 7 figures

  10. arXiv:2404.03039  [pdf, ps, other

    cs.FL

    Illustrating Finite Automata with Grail+ and TikZ

    Authors: Alastair May, Taylor J. Smith

    Abstract: In this article, we discuss a new software tool that interacts with Grail+, a library of automata-theoretic command-line utilities. Our software, the Grail+ Visualizer, takes the textual representation of a finite automaton produced by Grail+ and generates TikZ code to illustrate the finite automaton, with automatic layout of states and transitions. In addition to giving an overview of the basics… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    MSC Class: 68-04 (primary); 68Q45 (secondary)

  11. arXiv:2403.08707  [pdf, ps, other

    cs.DS cs.FL

    Improved Randomized Approximation of Hard Universality and Emptiness Problems

    Authors: Pantelis Andreou, Stavros Konstantinidis, Taylor J. Smith

    Abstract: We build on recent research on polynomial randomized approximation (PRAX) algorithms for the hard problems of NFA universality and NFA equivalence. Loosely speaking, PRAX algorithms use sampling of infinite domains within any desired accuracy $δ$. In the spirit of experimental mathematics, we extend the concept of PRAX algorithms to be applicable to the emptiness and universality problems in any d… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    MSC Class: 68W25 (primary); 68W20; 68Q45 (secondary)

  12. arXiv:2312.09734  [pdf, ps, other

    cs.RO cs.LG eess.SY

    Learning of Hamiltonian Dynamics with Reproducing Kernel Hilbert Spaces

    Authors: Torbjørn Smith, Olav Egeland

    Abstract: This paper presents a method for learning Hamiltonian dynamics from a limited set of data points. The Hamiltonian vector field is found by regularized optimization over a reproducing kernel Hilbert space of vector fields that are inherently Hamiltonian, and where the vector field is required to be odd or even. This is done with a symplectic kernel, and it is shown how this symplectic kernel can be… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  13. arXiv:2312.02396  [pdf, other

    cs.RO cs.CV cs.LG

    Unsupervised Change Detection for Space Habitats Using 3D Point Clouds

    Authors: Jamie Santos, Holly Dinkel, Julia Di, Paulo V. K. Borges, Marina Moreira, Oleg Alexandrov, Brian Coltin, Trey Smith

    Abstract: This work presents an algorithm for scene change detection from point clouds to enable autonomous robotic caretaking in future space habitats. Autonomous robotic systems will help maintain future deep-space habitats, such as the Gateway space station, which will be uncrewed for extended periods. Existing scene analysis software used on the International Space Station (ISS) relies on manually-label… ▽ More

    Submitted 5 August, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: 15 pages, 7 figures, Manuscript was presented at the AIAA SciTech Forum in Orlando, FL, USA, 8 - 12 January 2024. Video presentation: [https://www.youtube.com/watch?v=7WHp0dQYG4Y]. Code: [https://github.com/nasa/isaac/tree/master/anomaly/gmm-change-detection]

    Report number: AIAA 2024-1960

    Journal ref: AIAA SCITECH 2024 Forum

  14. arXiv:2311.14711  [pdf, other

    cs.CY cs.AI

    Towards Publicly Accountable Frontier LLMs: Building an External Scrutiny Ecosystem under the ASPIRE Framework

    Authors: Markus Anderljung, Everett Thornton Smith, Joe O'Brien, Lisa Soder, Benjamin Bucknall, Emma Bluemke, Jonas Schuett, Robert Trager, Lacey Strahm, Rumman Chowdhury

    Abstract: With the increasing integration of frontier large language models (LLMs) into society and the economy, decisions related to their training, deployment, and use have far-reaching implications. These decisions should not be left solely in the hands of frontier LLM developers. LLM users, civil society and policymakers need trustworthy sources of information to steer such decisions for the better. Inv… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted to Workshop on Socially Responsible Language Modelling Research (SoLaR) at the 2023 Conference on Neural Information Processing Systems (NeurIPS 2023)

    ACM Class: I.2.0

  15. Multi-Agent 3D Map Reconstruction and Change Detection in Microgravity with Free-Flying Robots

    Authors: Holly Dinkel, Julia Di, Jamie Santos, Keenan Albee, Paulo Borges, Marina Moreira, Oleg Alexandrov, Brian Coltin, Trey Smith

    Abstract: Assistive free-flyer robots autonomously caring for future crewed outposts -- such as NASA's Astrobee robots on the International Space Station (ISS) -- must be able to detect day-to-day interior changes to track inventory, detect and diagnose faults, and monitor the outpost status. This work presents a framework for multi-agent cooperative mapping and change detection to enable robotic maintenanc… ▽ More

    Submitted 6 August, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: 11 pages, 8 figures, Manuscript presented at the 74th International Astronautical Congress, IAC 2023, Baku, Azerbaijan, 2 - 6 October 2023. Video presentation: [https://www.youtube.com/watch?v=VfjV-zwFEtU]. Code: [https://github.com/hollydinkel/astrobeecd]

    Journal ref: Acta Astronautica 223 (2024) 98-107

  16. arXiv:2310.19694  [pdf, other

    cs.LG

    Convolutional State Space Models for Long-Range Spatiotemporal Modeling

    Authors: Jimmy T. H. Smith, Shalini De Mello, Jan Kautz, Scott W. Linderman, Wonmin Byeon

    Abstract: Effectively modeling long spatiotemporal sequences is challenging due to the need to model complex spatial correlations and long-range temporal dependencies simultaneously. ConvLSTMs attempt to address this by updating tensor-valued states with recurrent neural networks, but their sequential computation makes them slow to train. In contrast, Transformers can process an entire spatiotemporal sequen… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  17. arXiv:2310.07322  [pdf

    cs.HC cs.CV

    A webcam-based machine learning approach for three-dimensional range of motion evaluation

    Authors: Xiaoye Michael Wang, Derek T. Smith, Qin Zhu

    Abstract: Background. Joint range of motion (ROM) is an important quantitative measure for physical therapy. Commonly relying on a goniometer, accurate and reliable ROM measurement requires extensive training and practice. This, in turn, imposes a significant barrier for those who have limited in-person access to healthcare. Objective. The current study presents and evaluates an alternative machine learni… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  18. arXiv:2310.02183  [pdf, other

    cs.DC

    Puddles: Application-Independent Recovery and Location-Independent Data for Persistent Memory

    Authors: Suyash Mahar, Mingyao Shen, TJ Smith, Joseph Izraelevitz, Steven Swanson

    Abstract: In this paper, we argue that current work has failed to provide a comprehensive and maintainable in-memory representation for persistent memory. PM data should be easily mappable into a process address space, shareable across processes, shippable between machines, consistent after a crash, and accessible to legacy code with fast, efficient pointers as first-class abstractions. While existing s… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: To appear in EuroSys 2024

  19. arXiv:2310.00206  [pdf, other

    cs.RO

    An Investigation of Multi-feature Extraction and Super-resolution with Fast Microphone Arrays

    Authors: Eric T. Chang, Runsheng Wang, Peter Ballentine, Jingxi Xu, Trey Smith, Brian Coltin, Ioannis Kymissis, Matei Ciocarlie

    Abstract: In this work, we use MEMS microphones as vibration sensors to simultaneously classify texture and estimate contact position and velocity. Vibration sensors are an important facet of both human and robotic tactile sensing, providing fast detection of contact and onset of slip. Microphones are an attractive option for implementing vibration sensing as they offer a fast response and can be sampled qu… ▽ More

    Submitted 7 March, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: 6 pages, 4 figures, accepted to 2024 IEEE International Conference on Robotics and Automation (ICRA)

  20. arXiv:2308.13135  [pdf, other

    stat.ML cs.LG

    Nonparametric Additive Value Functions: Interpretable Reinforcement Learning with an Application to Surgical Recovery

    Authors: Patrick Emedom-Nnamdi, Timothy R. Smith, Jukka-Pekka Onnela, Junwei Lu

    Abstract: We propose a nonparametric additive model for estimating interpretable value functions in reinforcement learning. Learning effective adaptive clinical interventions that rely on digital phenotyping features is a major for concern medical practitioners. With respect to spine surgery, different post-operative recovery recommendations concerning patient mobilization can lead to significant variation… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 28 pages, 13 figures

  21. arXiv:2306.12629  [pdf, other

    cs.RO cs.MA

    Swarm of One: Bottom-up Emergence of Stable Robot Bodies from Identical Cells

    Authors: Trevor Smith, R. Michael Butts, Nathan Adkins, Yu Gu

    Abstract: Unlike most human-engineered systems, biological systems are emergent from low-level interactions, allowing much broader diversity and superior adaptation to the complex environments. Inspired by the process of morphogenesis in nature, a bottom-up design approach for robot morphology is proposed to treat a robot's body as an emergent response to underlying processes rather than a predefined shape.… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 6 pages, 6 figures, IROS 2023

  22. arXiv:2306.05810  [pdf, other

    cs.LG

    Explaining Reinforcement Learning with Shapley Values

    Authors: Daniel Beechey, Thomas M. S. Smith, Özgür Şimşek

    Abstract: For reinforcement learning systems to be widely adopted, their users must understand and trust them. We present a theoretical analysis of explaining reinforcement learning using Shapley values, following a principled approach from game theory for identifying the contribution of individual players to the outcome of a cooperative game. We call this general framework Shapley Values for Explaining Rei… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 12 pages, 9 figures. Accepted at ICML 2023

  23. arXiv:2305.00100  [pdf, other

    cs.LG physics.ao-ph physics.flu-dyn

    Temporal Subsampling Diminishes Small Spatial Scales in Recurrent Neural Network Emulators of Geophysical Turbulence

    Authors: Timothy A. Smith, Stephen G. Penny, Jason A. Platt, Tse-Chun Chen

    Abstract: The immense computational cost of traditional numerical weather and climate models has sparked the development of machine learning (ML) based emulators. Because ML methods benefit from long records of training data, it is common to use datasets that are temporally subsampled relative to the time steps required for the numerical integration of differential equations. Here, we investigate how this o… ▽ More

    Submitted 21 September, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

  24. arXiv:2304.12865  [pdf, other

    cs.LG math.DS physics.geo-ph

    Constraining Chaos: Enforcing dynamical invariants in the training of recurrent neural networks

    Authors: Jason A. Platt, Stephen G. Penny, Timothy A. Smith, Tse-Chun Chen, Henry D. I. Abarbanel

    Abstract: Drawing on ergodic theory, we introduce a novel training method for machine learning based forecasting methods for chaotic dynamical systems. The training enforces dynamical invariants--such as the Lyapunov exponent spectrum and fractal dimension--in the systems of interest, enabling longer and more stable forecasts when operating with limited data. The technique is demonstrated in detail using th… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  25. arXiv:2301.03708  [pdf, ps, other

    cs.FL

    Descriptional Complexity of Finite Automata -- Selected Highlights

    Authors: Arto Salomaa, Kai Salomaa, Taylor J. Smith

    Abstract: The state complexity, respectively, nondeterministic state complexity of a regular language $L$ is the number of states of the minimal deterministic, respectively, of a minimal nondeterministic finite automaton for $L$. Some of the most studied state complexity questions deal with size comparisons of nondeterministic finite automata of differing degree of ambiguity. More generally, if for a regula… ▽ More

    Submitted 4 July, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

  26. arXiv:2212.06701  [pdf

    cs.CV cs.AI cs.GR

    A Novel Approach For Generating Customizable Light Field Datasets for Machine Learning

    Authors: Julia Huang, Toure Smith, Aloukika Patro, Vidhi Chhabra

    Abstract: To train deep learning models, which often outperform traditional approaches, large datasets of a specified medium, e.g., images, are used in numerous areas. However, for light field-specific machine learning tasks, there is a lack of such available datasets. Therefore, we create our own light field datasets, which have great potential for a variety of applications due to the abundance of informat… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: 5 pages, 5 figures, accepted to and presented at MIT URTC Conference, and will be published in IEEE proceedings

    ACM Class: I.3.6

  27. How to select an objective function using information theory

    Authors: Timothy O. Hodson, Thomas M. Over, Tyler J. Smith, Lucy M. Marshall

    Abstract: In machine learning or scientific computing, model performance is measured with an objective function. But why choose one objective over another? Information theory gives one answer: To maximize the information in the model, select the objective function that represents the error in the fewest bits. To evaluate different objectives, transform them into likelihood functions. As likelihoods, their r… ▽ More

    Submitted 3 June, 2024; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 17 pages, 3 figures, 1 table

    Journal ref: Water Resources Research, 60, e2023WR035803 (2024)

  28. arXiv:2209.09313  [pdf, ps, other

    cs.DS math.NT

    Natural Wave Numbers, Natural Wave Co-numbers, and the Computation of the Primes

    Authors: Terence R. Smith

    Abstract: The paper exploits an isomorphism between the natural numbers N and a space U of periodic sequences of the roots of unity in constructing a recursive procedure for representing and computing the prime numbers. The nth wave number ${\bf u}_n$ is the countable sequence of the nth roots of unity having frequencies k/n for all integer phases k. The space U is closed under a commutative and associative… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 16 pages

  29. arXiv:2208.04933  [pdf, other

    cs.LG

    Simplified State Space Layers for Sequence Modeling

    Authors: Jimmy T. H. Smith, Andrew Warrington, Scott W. Linderman

    Abstract: Models using structured state space sequence (S4) layers have achieved state-of-the-art performance on long-range sequence modeling tasks. An S4 layer combines linear state space models (SSMs), the HiPPO framework, and deep learning to achieve high performance. We build on the design of the S4 layer and introduce a new state space layer, the S5 layer. Whereas an S4 layer uses many independent sing… ▽ More

    Submitted 3 March, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

  30. arXiv:2207.10016  [pdf, ps, other

    cs.FL

    Two-Dimensional Typewriter Automata

    Authors: Taylor J. Smith

    Abstract: A typewriter automaton is a special variant of a two-dimensional automaton that receives two-dimensional words as input and is only capable of moving its input head through its input word in three directions: downward, leftward, and rightward. In addition, downward and leftward moves may only be made via a special "reset" move that simulates the action of a typewriter's carriage return. In this… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    MSC Class: 68Q45 (primary); 68Q15; 68Q19 (secondary)

  31. arXiv:2206.14289  [pdf, other

    cs.RO

    Stronger Together: Air-Ground Robotic Collaboration Using Semantics

    Authors: Ian D. Miller, Fernando Cladera, Trey Smith, Camillo Jose Taylor, Vijay Kumar

    Abstract: In this work, we present an end-to-end heterogeneous multi-robot system framework where ground robots are able to localize, plan, and navigate in a semantic map created in real time by a high-altitude quadrotor. The ground robots choose and deconflict their targets independently, without any external intervention. Moreover, they perform cross-view localization by matching their local maps with the… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: Sumbitted to RA-L and IROS

  32. arXiv:2206.08972  [pdf, other

    stat.ML cs.LG

    Shallow and Deep Nonparametric Convolutions for Gaussian Processes

    Authors: Thomas M. McDonald, Magnus Ross, Michael T. Smith, Mauricio A. Álvarez

    Abstract: A key challenge in the practical application of Gaussian processes (GPs) is selecting a proper covariance function. The moving average, or process convolutions, construction of GPs allows some additional flexibility, but still requires choosing a proper smoothing kernel, which is non-trivial. Previous approaches have built covariance functions by using GP priors over the smoothing kernel, and by e… ▽ More

    Submitted 18 October, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: 19 pages, 7 figures. NP-DGP results and discussion updated

  33. arXiv:2205.01988  [pdf, other

    cs.LG

    Modelling calibration uncertainty in networks of environmental sensors

    Authors: Michael Thomas Smith, Magnus Ross, Joel Ssematimba, Pablo A. Alvarado, Mauricio Alvarez, Engineer Bainomugisha, Richard Wilkinson

    Abstract: Networks of low-cost sensors are becoming ubiquitous, but often suffer from poor accuracies and drift. Regular colocation with reference sensors allows recalibration but is complicated and expensive. Alternatively the calibration can be transferred using low-cost, mobile sensors. However inferring the calibration (with uncertainty) becomes difficult. We propose a variational approach to model the… ▽ More

    Submitted 9 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: 31 pages (23 pages of content, 4 pages of references, 4 supplementary). 11 figures. 4 tables. Submitted to Journal of the Royal Statistical Society. Series C

    MSC Class: 60G15

  34. Fast Hybrid Image Retargeting

    Authors: Daniel Valdez-Balderas, Oleg Muraveynyk, Timothy Smith

    Abstract: Image retargeting changes the aspect ratio of images while aiming to preserve content and minimise noticeable distortion. Fast and high-quality methods are particularly relevant at present, due to the large variety of image and display aspect ratios. We propose a retargeting method that quantifies and limits warping distortions with the use of content-aware cropping. The pipeline of the proposed a… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: 5 pages

    ACM Class: I.2.10; I.4.0; I.5.4

    Journal ref: 2021 IEEE International Conference on Image Processing (ICIP), 2021, pp. 1849-1853

  35. Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR

    Authors: Ian D. Miller, Anthony Cowley, Ravi Konkimalla, Shreyas S. Shivakumar, Ty Nguyen, Trey Smith, Camillo Jose Taylor, Vijay Kumar

    Abstract: Currently, GPS is by far the most popular global localization method. However, it is not always reliable or accurate in all environments. SLAM methods enable local state estimation but provide no means of registering the local map to a global one, which can be important for inter-robot collaboration or human interaction. In this work, we present a real-time method for utilizing semantics to global… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Published in the IEEE Robotics and Automation Letters and presented at the IEEE 2021 International Conference on Robotics and Automation. See https://www.youtube.com/watch?v=_qwAoYK9iGU for accompanying video

    Journal ref: in IEEE Robotics and Automation Letters, vol. 6, no. 2, pp. 2397-2404, April 2021

  36. arXiv:2201.08910  [pdf, other

    cs.NE

    A Systematic Exploration of Reservoir Computing for Forecasting Complex Spatiotemporal Dynamics

    Authors: Jason A. Platt, Stephen G. Penny, Timothy A. Smith, Tse-Chun Chen, Henry D. I. Abarbanel

    Abstract: A reservoir computer (RC) is a type of simplified recurrent neural network architecture that has demonstrated success in the prediction of spatiotemporally chaotic dynamical systems. A further advantage of RC is that it reproduces intrinsic dynamical quantities essential for its incorporation into numerical forecasting routines such as the ensemble Kalman filter -- used in numerical weather predic… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

  37. arXiv:2201.05985  [pdf, other

    cs.SI cs.LG stat.AP

    Exposing the Obscured Influence of State-Controlled Media: A Causal Estimation of Influence Between Media Outlets Via Quotation Propagation

    Authors: Joseph Schlessinger, Richard Bennet, Jacob Coakwell, Steven T. Smith, Edward K. Kao

    Abstract: This study quantifies influence between media outlets by applying a novel methodology that uses causal effect estimation on networks and transformer language models. We demonstrate the obscured influence of state-controlled outlets over other outlets, regardless of orientation, by analyzing a large dataset of quotations from over 100 thousand articles published by the most prominent European and R… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

  38. arXiv:2201.05193  [pdf, other

    cs.LG math.DS math.NA

    `Next Generation' Reservoir Computing: an Empirical Data-Driven Expression of Dynamical Equations in Time-Stepping Form

    Authors: Tse-Chun Chen, Stephen G. Penny, Timothy A. Smith, Jason A. Platt

    Abstract: Next generation reservoir computing based on nonlinear vector autoregression (NVAR) is applied to emulate simple dynamical system models and compared to numerical integration schemes such as Euler and the $2^\text{nd}$ order Runge-Kutta. It is shown that the NVAR emulator can be interpreted as a data-driven method used to recover the numerical integration scheme that produced the data. It is also… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: 12 pages, 6 figures

  39. arXiv:2112.01718  [pdf, other

    cs.LG cs.AI

    Improving Predictions of Tail-end Labels using Concatenated BioMed-Transformers for Long Medical Documents

    Authors: Vithya Yogarajan, Bernhard Pfahringer, Tony Smith, Jacob Montiel

    Abstract: Multi-label learning predicts a subset of labels from a given label set for an unseen instance while considering label correlations. A known challenge with multi-label classification is the long-tailed distribution of labels. Many studies focus on improving the overall predictions of the model and thus do not prioritise tail-end labels. Improving the tail-end label predictions in multi-label class… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

  40. arXiv:2111.01256  [pdf, other

    cs.LG

    Reverse engineering recurrent neural networks with Jacobian switching linear dynamical systems

    Authors: Jimmy T. H. Smith, Scott W. Linderman, David Sussillo

    Abstract: Recurrent neural networks (RNNs) are powerful models for processing time-series data, but it remains challenging to understand how they function. Improving this understanding is of substantial interest to both the machine learning and neuroscience communities. The framework of reverse engineering a trained RNN by linearizing around its fixed points has provided insight, but the approach has signif… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 23 pages, 9 figures

  41. Predicting COVID-19 Patient Shielding: A Comprehensive Study

    Authors: Vithya Yogarajan, Jacob Montiel, Tony Smith, Bernhard Pfahringer

    Abstract: There are many ways machine learning and big data analytics are used in the fight against the COVID-19 pandemic, including predictions, risk management, diagnostics, and prevention. This study focuses on predicting COVID-19 patient shielding -- identifying and protecting patients who are clinically extremely vulnerable from coronavirus. This study focuses on techniques used for the multi-label cla… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: Accepted in AJCAI 2021

    Journal ref: The 2021 Australasian Joint Conference on Artificial Intelligence (AJCAI 2021)

  42. arXiv:2109.12269  [pdf, other

    cs.LG cs.AI math.DS math.OC physics.geo-ph

    Integrating Recurrent Neural Networks with Data Assimilation for Scalable Data-Driven State Estimation

    Authors: Stephen G. Penny, Timothy A. Smith, Tse-Chun Chen, Jason A. Platt, Hsin-Yi Lin, Michael Goodliff, Henry D. I. Abarbanel

    Abstract: Data assimilation (DA) is integrated with machine learning in order to perform entirely data-driven online state estimation. To achieve this, recurrent neural networks (RNNs) are implemented as surrogate models to replace key components of the DA cycle in numerical weather prediction (NWP), including the conventional numerical forecast model, the forecast error covariance matrix, and the tangent l… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: 22 pages, 16 figures

  43. arXiv:2108.02955  [pdf, other

    cs.OH

    Impressions of the GDMC AI Settlement Generation Challenge in Minecraft

    Authors: Christoph Salge, Claus Aranha, Adrian Brightmoore, Sean Butler, Rodrigo Canaan, Michael Cook, Michael Cerny Green, Hagen Fischer, Christian Guckelsberger, Jupiter Hadley, Jean-Baptiste Hervé, Mark R Johnson, Quinn Kybartas, David Mason, Mike Preuss, Tristan Smith, Ruck Thawonmas, Julian Togelius

    Abstract: The GDMC AI settlement generation challenge is a PCG competition about producing an algorithm that can create an "interesting" Minecraft settlement for a given map. This paper contains a collection of written experiences with this competition, by participants, judges, organizers and advisors. We asked people to reflect both on the artifacts themselves, and on the competition in general. The aim of… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

    Comments: 28 pages, 5 figures

  44. arXiv:2106.05582  [pdf, other

    stat.ML cs.LG

    Learning Nonparametric Volterra Kernels with Gaussian Processes

    Authors: Magnus Ross, Michael T. Smith, Mauricio A. Álvarez

    Abstract: This paper introduces a method for the nonparametric Bayesian learning of nonlinear operators, through the use of the Volterra series with kernels represented using Gaussian processes (GPs), which we term the nonparametric Volterra kernels model (NVKM). When the input function to the operator is unobserved and has a GP prior, the NVKM constitutes a powerful method for both single and multiple outp… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 17 pages, 5 figures

  45. arXiv:2105.01179  [pdf, ps, other

    cs.FL

    Degrees of Restriction for Two-Dimensional Automata

    Authors: Taylor J. Smith, Kai Salomaa

    Abstract: A three-way (resp., two-way) two-dimensional automaton has a read-only input head that moves in three (resp., two) directions on a finite array of cells labelled by symbols of the input alphabet. Restricting the input head movement of a two-dimensional automaton results in a model that is weaker in terms of recognition power. In this paper, we introduce the notion of "degrees of restriction" for… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

    MSC Class: 68Q45 (primary); 68Q15 (secondary)

  46. arXiv:2104.06481  [pdf, other

    cs.SI cs.CY

    Political Polarization in Online News Consumption

    Authors: Kiran Garimella, Tim Smith, Rebecca Weiss, Robert West

    Abstract: Political polarization appears to be on the rise, as measured by voting behavior, general affect towards opposing partisans and their parties, and contents posted and consumed online. Research over the years has focused on the role of the Web as a driver of polarization. In order to further our understanding of the factors behind online polarization, in the present work we collect and analyze Web… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted at ICWSM 2021

  47. arXiv:2009.00602  [pdf, ps, other

    cs.FL

    Recognition and Complexity Results for Projection Languages of Two-Dimensional Automata

    Authors: Taylor J. Smith, Kai Salomaa

    Abstract: The row projection (resp., column projection) of a two-dimensional language $L$ is the one-dimensional language consisting of all first rows (resp., first columns) of each two-dimensional word in $L$. The operation of row projection has previously been studied under the name "frontier language", and previous work has focused on one- and two-dimensional language classes. In this paper, we study p… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    MSC Class: 68Q45 (primary); 68Q15; 68Q19 (secondary)

  48. arXiv:2008.11164  [pdf, ps, other

    cs.FL

    Concatenation Operations and Restricted Variants of Two-Dimensional Automata

    Authors: Taylor J. Smith, Kai Salomaa

    Abstract: A two-dimensional automaton operates on arrays of symbols. While a standard (four-way) two-dimensional automaton can move its input head in four directions, restricted two-dimensional automata are only permitted to move their input heads in three or two directions; these models are called three-way and two-way two-dimensional automata, respectively. In two dimensions, we may extend the notion of… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    MSC Class: 68Q45 (primary); 68R15 (secondary)

  49. arXiv:2005.10879  [pdf, other

    cs.SI cs.LG stat.AP stat.ML

    Automatic Detection of Influential Actors in Disinformation Networks

    Authors: Steven T. Smith, Edward K. Kao, Erika D. Mackin, Danelle C. Shah, Olga Simek, Donald B. Rubin

    Abstract: The weaponization of digital communications and social media to conduct disinformation campaigns at immense scale, speed, and reach presents new challenges to identify and counter hostile influence operations (IOs). This paper presents an end-to-end framework to automate detection of disinformation narratives, networks, and influential actors. The framework integrates natural language processing,… ▽ More

    Submitted 7 January, 2021; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: Proc. Natl. Acad. Sciences U.S.A. Vol. 118, No. 4, e2011216118

  50. arXiv:2004.03666  [pdf, other

    cs.SE eess.SY

    Compositional Formal Analysis Based on Conventional Engineering Models

    Authors: Tyler D. Smith, Ryan Peroutka, Robert Edman

    Abstract: Applications of formal methods for state space exploration have been successfully applied to evaluate robust critical software systems. Formal methods enable discovery of error conditions that conventional testing may miss, and can aid in planning complex system operations. However, broad application of formal methods has been hampered by the effort required to generate formal specifications for r… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    ACM Class: F.0