Zum Hauptinhalt springen

Showing 1–50 of 1,881 results for author: Nathan

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.17221  [pdf, other

    cs.LG math.AG

    Geometry of Lightning Self-Attention: Identifiability and Dimension

    Authors: Nathan W. Henry, Giovanni Luca Marchetti, Kathlén Kohn

    Abstract: We consider function spaces defined by self-attention networks without normalization, and theoretically analyze their geometry. Since these networks are polynomial, we rely on tools from algebraic geometry. In particular, we study the identifiability of deep attention by providing a description of the generic fibers of the parametrization for an arbitrary number of layers and, as a consequence, co… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  2. arXiv:2408.16201  [pdf, other

    cs.CV cs.LG

    Uni-3DAD: GAN-Inversion Aided Universal 3D Anomaly Detection on Model-free Products

    Authors: Jiayu Liu, Shancong Mou, Nathan Gaw, Yinan Wang

    Abstract: Anomaly detection is a long-standing challenge in manufacturing systems. Traditionally, anomaly detection has relied on human inspectors. However, 3D point clouds have gained attention due to their robustness to environmental factors and their ability to represent geometric data. Existing 3D anomaly detection methods generally fall into two categories. One compares scanned 3D point clouds with des… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  3. arXiv:2408.16092  [pdf, ps, other

    cs.DS

    When to Give Up on a Parallel Implementation

    Authors: Nathan S. Sheffield, Alek Westover

    Abstract: In the Serial Parallel Decision Problem (SPDP), introduced by Kuszmaul and Westover [SPAA'24], an algorithm receives a series of tasks online, and must choose for each between a serial implementation and a parallelizable (but less efficient) implementation. Kuszmaul and Westover describe three decision models: (1) \defn{Instantly-committing} schedulers must decide on arrival, irrevocably, which im… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 15 pages

  4. arXiv:2408.13690  [pdf, other

    cs.LG

    Understanding Uncertainty-based Active Learning Under Model Mismatch

    Authors: Amir Hossein Rahmati, Mingzhou Fan, Ruida Zhou, Nathan M. Urban, Byung-Jun Yoon, Xiaoning Qian

    Abstract: Instead of randomly acquiring training data points, Uncertainty-based Active Learning (UAL) operates by querying the label(s) of pivotal samples from an unlabeled pool selected based on the prediction uncertainty, thereby aiming at minimizing the labeling cost for model training. The efficacy of UAL critically depends on the model capacity as well as the adopted uncertainty-based acquisition funct… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

  5. arXiv:2408.12065  [pdf, ps, other

    cs.AI

    Transformers As Approximations of Solomonoff Induction

    Authors: Nathan Young, Michael Witbrock

    Abstract: Solomonoff Induction is an optimal-in-the-limit unbounded algorithm for sequence prediction, representing a Bayesian mixture of every computable probability distribution and performing close to optimally in predicting any computable sequence. Being an optimal form of computational sequence prediction, it seems plausible that it may be used as a model against which other methods of sequence predi… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  6. arXiv:2408.12004  [pdf, other

    cs.LG stat.ME stat.ML

    CSPI-MT: Calibrated Safe Policy Improvement with Multiple Testing for Threshold Policies

    Authors: Brian M Cho, Ana-Roxana Pop, Kyra Gan, Sam Corbett-Davies, Israel Nir, Ariel Evnine, Nathan Kallus

    Abstract: When modifying existing policies in high-risk settings, it is often necessary to ensure with high certainty that the newly proposed policy improves upon a baseline, such as the status quo. In this work, we consider the problem of safe policy improvement, where one only adopts a new policy if it is deemed to be better than the specified baseline with at least pre-specified probability. We focus on… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  7. arXiv:2408.10417  [pdf

    cs.AI cs.CL

    Development of an AI Anti-Bullying System Using Large Language Model Key Topic Detection

    Authors: Matthew Tassava, Cameron Kolodjski, Jordan Milbrath, Adorah Bishop, Nathan Flanders, Robbie Fetsch, Danielle Hanson, Jeremy Straub

    Abstract: This paper presents and evaluates work on the development of an artificial intelligence (AI) anti-bullying system. The system is designed to identify coordinated bullying attacks via social media and other mechanisms, characterize them and propose remediation and response activities to them. In particular, a large language model (LLM) is used to populate an enhanced expert system-based network mod… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  8. arXiv:2408.10264  [pdf, other

    cs.LG cs.AI cs.IR

    OPDR: Order-Preserving Dimension Reduction for Semantic Embedding of Multimodal Scientific Data

    Authors: Chengyu Gong, Gefei Shen, Luanzheng Guo, Nathan Tallent, Dongfang Zhao

    Abstract: One of the most common operations in multimodal scientific data management is searching for the $k$ most similar items (or, $k$-nearest neighbors, KNN) from the database after being provided a new item. Although recent advances of multimodal machine learning models offer a \textit{semantic} index, the so-called \textit{embedding vectors} mapped from the original multimodal data, the dimension of t… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  9. arXiv:2408.10085  [pdf, other

    cs.LG

    MASALA: Model-Agnostic Surrogate Explanations by Locality Adaptation

    Authors: Saif Anwar, Nathan Griffiths, Abhir Bhalerao, Thomas Popham

    Abstract: Existing local Explainable AI (XAI) methods, such as LIME, select a region of the input space in the vicinity of a given input instance, for which they approximate the behaviour of a model using a simpler and more interpretable surrogate model. The size of this region is often controlled by a user-defined locality hyperparameter. In this paper, we demonstrate the difficulties associated with defin… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  10. arXiv:2408.09125  [pdf, other

    cs.LG cs.AI

    Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning

    Authors: Rishabh Agrawal, Nathan Dahlin, Rahul Jain, Ashutosh Nayyar

    Abstract: Imitation learning (IL) is notably effective for robotic tasks where directly programming behaviors or defining optimal control costs is challenging. In this work, we address a scenario where the imitator relies solely on observed behavior and cannot make environmental interactions during learning. It does not have additional supplementary datasets beyond the expert's dataset nor any information a… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  11. arXiv:2408.09028  [pdf, ps, other

    cs.AI cs.RO

    On the Completeness of Conflict-Based Search: Temporally-Relative Duplicate Pruning

    Authors: Thayne T Walker, Nathan R Sturtevant

    Abstract: Conflict-Based Search (CBS) algorithm for the multi-agent pathfinding (MAPF) problem is that it is incomplete for problems which have no solution; if no mitigating procedure is run in parallel, CBS will run forever when given an unsolvable problem instance. In this work, we introduce Temporally-Relative Duplicate Pruning (TRDP), a technique for duplicate detection and removal in both classic and c… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 9 pages, 4 figures, 2 tables

    ACM Class: F.2.2; I.2.8

  12. arXiv:2408.08926  [pdf, other

    cs.CR cs.AI cs.CL cs.CY cs.LG

    Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models

    Authors: Andy K. Zhang, Neil Perry, Riya Dulepet, Eliot Jones, Justin W. Lin, Joey Ji, Celeste Menders, Gashon Hussein, Samantha Liu, Donovan Jasper, Pura Peetathawatchai, Ari Glenn, Vikram Sivashankar, Daniel Zamoshchin, Leo Glikbarg, Derek Askaryar, Mike Yang, Teddy Zhang, Rishi Alluri, Nathan Tran, Rinnara Sangpisit, Polycarpos Yiorkadjis, Kenny Osele, Gautham Raghupathi, Dan Boneh , et al. (2 additional authors not shown)

    Abstract: Language Model (LM) agents for cybersecurity that are capable of autonomously identifying vulnerabilities and executing exploits have the potential to cause real-world impact. Policymakers, model providers, and other researchers in the AI and cybersecurity communities are interested in quantifying the capabilities of such agents to help mitigate cyberrisk and investigate opportunities for penetrat… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 86 pages, 7 figures

  13. arXiv:2408.08323  [pdf, other

    cs.HC

    Exploring Urban Comfort through Novel Wearables and Environmental Surveys

    Authors: Patrick Chwalek, Sailin Zhong, Nathan Perry, Tianqi Liu, Clayton Miller, Hamed Seiied Alavi, Denis Lalanne, Joseph A. Paradiso

    Abstract: This study presents a comprehensive dataset capturing indoor environmental parameters, physiological responses, and subjective perceptions across three global cities. Utilizing wearable sensors, including smart eyeglasses, and a modified Cozie app, environmental and physiological data were collected, along with pre-screening, onboarding, and recurring surveys. Peripheral cues facilitated participa… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: Submitted to Nature Scientific Data

  14. arXiv:2408.07050  [pdf, other

    cs.SD cs.CV eess.AS

    PSM: Learning Probabilistic Embeddings for Multi-scale Zero-Shot Soundscape Mapping

    Authors: Subash Khanal, Eric Xing, Srikumar Sastry, Aayush Dhakal, Zhexiao Xiong, Adeel Ahmad, Nathan Jacobs

    Abstract: A soundscape is defined by the acoustic environment a person perceives at a location. In this work, we propose a framework for mapping soundscapes across the Earth. Since soundscapes involve sound distributions that span varying spatial scales, we represent locations with multi-scale satellite imagery and learn a joint representation among this imagery, audio, and text. To capture the inherent unc… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: Accepted at ACM MM 2024

  15. arXiv:2408.07009  [pdf, other

    cs.CV

    Imagen 3

    Authors: Imagen-Team-Google, :, Jason Baldridge, Jakob Bauer, Mukul Bhutani, Nicole Brichtova, Andrew Bunner, Kelvin Chan, Yichang Chen, Sander Dieleman, Yuqing Du, Zach Eaton-Rosen, Hongliang Fei, Nando de Freitas, Yilin Gao, Evgeny Gladchenko, Sergio Gómez Colmenarejo, Mandy Guo, Alex Haig, Will Hawkins, Hexiang Hu, Huilian Huang, Tobenna Peter Igwe, Christos Kaplanis, Siavash Khodadadeh , et al. (227 additional authors not shown)

    Abstract: We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models.

    Submitted 13 August, 2024; originally announced August 2024.

  16. arXiv:2408.05968  [pdf, other

    cs.CR

    Nob-MIAs: Non-biased Membership Inference Attacks Assessment on Large Language Models with Ex-Post Dataset Construction

    Authors: Cédric Eichler, Nathan Champeil, Nicolas Anciaux, Alexandra Bensamoun, Heber Hwang Arcolezi, José Maria De Fuentes

    Abstract: The rise of Large Language Models (LLMs) has triggered legal and ethical concerns, especially regarding the unauthorized use of copyrighted materials in their training datasets. This has led to lawsuits against tech companies accused of using protected content without permission. Membership Inference Attacks (MIAs) aim to detect whether specific documents were used in a given LLM pretraining, but… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  17. arXiv:2408.03874  [pdf, other

    cs.CL

    Personalized Clinical Note Generation from Doctor-Patient Conversations

    Authors: Nathan Brake, Thomas Schaaf

    Abstract: In this work, we present a novel technique to improve the quality of draft clinical notes for physicians. This technique is concentrated on the ability to model implicit physician conversation styles and note preferences. We also introduce a novel technique for the enrollment of new physicians when a limited number of clinical notes paired with conversations are available for that physician, witho… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  18. arXiv:2408.03336  [pdf, other

    cs.NE cs.LG eess.SP

    Few-Shot Transfer Learning for Individualized Braking Intent Detection on Neuromorphic Hardware

    Authors: Nathan Lutes, Venkata Sriram Siddhardh Nadendla, K. Krishnamurthy

    Abstract: Objective: This work explores use of a few-shot transfer learning method to train and implement a convolutional spiking neural network (CSNN) on a BrainChip Akida AKD1000 neuromorphic system-on-chip for developing individual-level, instead of traditionally used group-level, models using electroencephalographic data. The efficacy of the method is studied on an advanced driver assist system related… ▽ More

    Submitted 21 July, 2024; originally announced August 2024.

    Comments: Journal of NeuroEngineering Submission

  19. arXiv:2408.02239  [pdf, ps, other

    cs.CL

    BOTS-LM: Training Large Language Models for Setswana

    Authors: Nathan Brown, Vukosi Marivate

    Abstract: In this work we present BOTS-LM, a series of bilingual language models proficient in both Setswana and English. Leveraging recent advancements in data availability and efficient fine-tuning, BOTS-LM achieves performance similar to models significantly larger than itself while maintaining computational efficiency. Our initial release features an 8 billion parameter generative large language model,… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 7 pages, 3 tables

  20. arXiv:2408.01653  [pdf, other

    cs.CV

    MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas

    Authors: Feng Qiao, Zhexiao Xiong, Xinge Zhu, Yuexin Ma, Qiumeng He, Nathan Jacobs

    Abstract: We introduce Multi-Cylindrical Panoramic Depth Estimation (MCPDepth), a two-stage framework for omnidirectional depth estimation via stereo matching between multiple cylindrical panoramas. MCPDepth uses cylindrical panoramas for initial stereo matching and then fuses the resulting depth maps across views. A circular attention module is employed to overcome the distortion along the vertical axis. M… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  21. arXiv:2407.21467  [pdf

    cs.CV cs.AI

    Deep Learning-Based Longitudinal Prediction of Childhood Myopia Progression Using Fundus Image Sequences and Baseline Refraction Data

    Authors: Mengtian Kang, Yansong Hu, Shuo Gao, Yuanyuan Liu, Hongbei Meng, Xuemeng Li, Xuhang Chen, Hubin Zhao, Jing Fu, Guohua Hu, Wei Wang, Yanning Dai, Arokia Nathan, Peter Smielewski, Ningli Wang, Shiming Li

    Abstract: Childhood myopia constitutes a significant global health concern. It exhibits an escalating prevalence and has the potential to evolve into severe, irreversible conditions that detrimentally impact familial well-being and create substantial economic costs. Contemporary research underscores the importance of precisely predicting myopia progression to enable timely and effective interventions, there… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  22. arXiv:2407.21028  [pdf, other

    q-bio.BM cs.LG

    Antibody DomainBed: Out-of-Distribution Generalization in Therapeutic Protein Design

    Authors: Nataša Tagasovska, Ji Won Park, Matthieu Kirchmeyer, Nathan C. Frey, Andrew Martin Watkins, Aya Abdelsalam Ismail, Arian Rokkum Jamasb, Edith Lee, Tyler Bryson, Stephen Ra, Kyunghyun Cho

    Abstract: Machine learning (ML) has demonstrated significant promise in accelerating drug design. Active ML-guided optimization of therapeutic molecules typically relies on a surrogate model predicting the target property of interest. The model predictions are used to determine which designs to evaluate in the lab, and the model is updated on the new measurements to inform the next cycle of decisions. A key… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  23. arXiv:2407.18421  [pdf, other

    cs.CL cs.LG

    Self-Directed Synthetic Dialogues and Revisions Technical Report

    Authors: Nathan Lambert, Hailey Schoelkopf, Aaron Gokaslan, Luca Soldaini, Valentina Pyatkin, Louis Castricato

    Abstract: Synthetic data has become an important tool in the fine-tuning of language models to follow instructions and solve complex problems. Nevertheless, the majority of open data to date is often lacking multi-turn data and collected on closed models, limiting progress on advancing open fine-tuning methods. We introduce Self Directed Synthetic Dialogues (SDSD), an experimental dataset consisting of guid… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 25 pages, 3 figures, 4 tables

  24. arXiv:2407.17387  [pdf, other

    cs.CL

    PERSONA: A Reproducible Testbed for Pluralistic Alignment

    Authors: Louis Castricato, Nathan Lile, Rafael Rafailov, Jan-Philipp Fränken, Chelsea Finn

    Abstract: The rapid advancement of language models (LMs) necessitates robust alignment with diverse user values. However, current preference optimization approaches often fail to capture the plurality of user opinions, instead reinforcing majority viewpoints and marginalizing minority perspectives. We introduce PERSONA, a reproducible test bed designed to evaluate and improve pluralistic alignment of LMs. W… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  25. arXiv:2407.13729  [pdf, other

    cs.CL

    Baba Is AI: Break the Rules to Beat the Benchmark

    Authors: Nathan Cloos, Meagan Jens, Michelangelo Naim, Yen-Ling Kuo, Ignacio Cases, Andrei Barbu, Christopher J. Cueva

    Abstract: Humans solve problems by following existing rules and procedures, and also by leaps of creativity to redefine those rules and objectives. To probe these abilities, we developed a new benchmark based on the game Baba Is You where an agent manipulates both objects in the environment and rules, represented by movable tiles with words written on them, to reach a specified goal and win the game. We tes… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 8 pages, 8 figures

  26. arXiv:2407.13494  [pdf, other

    cs.SE cs.NI

    Streaming Technologies and Serialization Protocols: Empirical Performance Analysis

    Authors: Samuel Jackson, Nathan Cummings, Saiful Khan

    Abstract: Efficiently streaming high-volume data is essential for real-time data analytics, visualization, and AI and machine learning model training. Various streaming technologies and serialization protocols have been developed to meet different streaming needs. Together, they perform differently across various tasks and datasets. Therefore, when developing a streaming system, it can be challenging to mak… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  27. arXiv:2407.11927  [pdf, other

    stat.ML cs.LG stat.AP

    Bayesian Causal Forests for Longitudinal Data: Assessing the Impact of Part-Time Work on Growth in High School Mathematics Achievement

    Authors: Nathan McJames, Ann O'Shea, Andrew Parnell

    Abstract: Modelling growth in student achievement is a significant challenge in the field of education. Understanding how interventions or experiences such as part-time work can influence this growth is also important. Traditional methods like difference-in-differences are effective for estimating causal effects from longitudinal data. Meanwhile, Bayesian non-parametric methods have recently become popular… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 25 pages, 7 figures, 3 tables

  28. arXiv:2407.11283  [pdf, other

    cs.LG

    Novel Approach for Predicting the Air Quality Index of Megacities through Attention-Enhanced Deep Multitask Spatiotemporal Learning

    Authors: Harun Khan, Joseph Tso, Nathan Nguyen, Nivaan Kaushal, Ansh Malhotra, Nayel Rehman

    Abstract: Air pollution remains one of the most formidable environmental threats to human health globally, particularly in urban areas, contributing to nearly 7 million premature deaths annually. Megacities, defined as cities with populations exceeding 10 million, are frequent hotspots of severe pollution, experiencing numerous weeks of dangerously poor air quality due to the concentration of harmful pollut… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, 3 tables

  29. arXiv:2407.11251  [pdf, other

    cs.RO

    Autonomous Soil Collection in Environments With Heterogeneous Terrain

    Authors: Andrew Dudash, Beyonce Andrades, Ryan Rubel, Mohammad Goli, Nathan Clark, William Ewald

    Abstract: To autonomously collect soil in uncultivated terrain, robotic arms must distinguish between different amorphous materials and submerge themselves into the correct material. We develop a prototype that collects soil in heterogeneous terrain. If mounted to a mobile robot, it can be used to perform soil collection and analysis without human intervention. Unique among soil sampling robots, we use a ge… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  30. arXiv:2407.10754  [pdf

    cs.CV

    An Autonomous Drone Swarm for Detecting and Tracking Anomalies among Dense Vegetation

    Authors: Rakesh John Amala Arokia Nathan, Sigrid Strand, Daniel Mehrwald, Dmitriy Shutin, Oliver Bimber

    Abstract: Swarms of drones offer an increased sensing aperture, and having them mimic behaviors of natural swarms enhances sampling by adapting the aperture to local conditions. We demonstrate that such an approach makes detecting and tracking heavily occluded targets practically feasible. While object classification applied to conventional aerial images generalizes poorly the randomness of occlusion and is… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  31. arXiv:2407.09672  [pdf, other

    cs.CV

    Mixed-View Panorama Synthesis using Geospatially Guided Diffusion

    Authors: Zhexiao Xiong, Xin Xing, Scott Workman, Subash Khanal, Nathan Jacobs

    Abstract: We introduce the task of mixed-view panorama synthesis, where the goal is to synthesize a novel panorama given a small set of input panoramas and a satellite image of the area. This contrasts with previous work which only uses input panoramas (same-view synthesis), or an input satellite image (cross-view synthesis). We argue that the mixed-view setting is the most natural to support panorama synth… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  32. arXiv:2407.08610  [pdf, other

    cs.SE cs.LG

    Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports

    Authors: Yanfu Yan, Nathan Cooper, Oscar Chaparro, Kevin Moran, Denys Poshyvanyk

    Abstract: Video-based bug reports are increasingly being used to document bugs for programs centered around a graphical user interface (GUI). However, developing automated techniques to manage video-based reports is challenging as it requires identifying and understanding often nuanced visual patterns that capture key information about a reported bug. In this paper, we aim to overcome these challenges by ad… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 13 pages, accepted to 46th International Conference on Software Engineering (ICSE 2024)

  33. arXiv:2407.07521  [pdf, other

    cs.LG cs.AI

    CHILLI: A data context-aware perturbation method for XAI

    Authors: Saif Anwar, Nathan Griffiths, Abhir Bhalerao, Thomas Popham

    Abstract: The trustworthiness of Machine Learning (ML) models can be difficult to assess, but is critical in high-risk or ethically sensitive applications. Many models are treated as a `black-box' where the reasoning or criteria for a final decision is opaque to the user. To address this, some existing Explainable AI (XAI) approaches approximate model behaviour using perturbed data. However, such methods ha… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  34. arXiv:2407.07059  [pdf, other

    q-bio.NC cs.LG

    Differentiable Optimization of Similarity Scores Between Models and Brains

    Authors: Nathan Cloos, Moufan Li, Markus Siegel, Scott L. Brincat, Earl K. Miller, Guangyu Robert Yang, Christopher J. Cueva

    Abstract: What metrics should guide the development of more realistic models of the brain? One proposal is to quantify the similarity between models and brains using methods such as linear regression, Centered Kernel Alignment (CKA), and angular Procrustes distance. To better understand the limitations of these similarity measures we analyze neural activity recorded in five experiments on nonhuman primates,… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 16 pages, 6 figures

  35. arXiv:2407.05622  [pdf, other

    cs.LG cs.DS

    On the Complexity of Learning Sparse Functions with Statistical and Gradient Queries

    Authors: Nirmit Joshi, Theodor Misiakiewicz, Nathan Srebro

    Abstract: The goal of this paper is to investigate the complexity of gradient algorithms when learning sparse functions (juntas). We introduce a type of Statistical Queries ($\mathsf{SQ}$), which we call Differentiable Learning Queries ($\mathsf{DLQ}$), to model gradient queries on a specified loss with respect to an arbitrary model. We provide a tight characterization of the query complexity of… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 43 pages, 1 table, 1 figure

  36. arXiv:2407.05471  [pdf, other

    eess.AS cs.SD

    Fine-Grained and Interpretable Neural Speech Editing

    Authors: Max Morrison, Cameron Churchwell, Nathan Pruyne, Bryan Pardo

    Abstract: Fine-grained editing of speech attributes$\unicode{x2014}$such as prosody (i.e., the pitch, loudness, and phoneme durations), pronunciation, speaker identity, and formants$\unicode{x2014}$is useful for fine-tuning and fixing imperfections in human and AI-generated speech recordings for creation of podcasts, film dialogue, and video game dialogue. Existing speech synthesis systems use representatio… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Interspeech 2024

  37. arXiv:2407.04856  [pdf, other

    cs.LG cs.AI

    Explorative Imitation Learning: A Path Signature Approach for Continuous Environments

    Authors: Nathan Gavenski, Juarez Monteiro, Felipe Meneguzzi, Michael Luck, Odinaldo Rodrigues

    Abstract: Some imitation learning methods combine behavioural cloning with self-supervision to infer actions from state pairs. However, most rely on a large number of expert trajectories to increase generalisation and human intervention to capture key aspects of the problem, such as domain constraints. In this paper, we propose Continuous Imitation Learning from Observation (CILO), a new method augmenting i… ▽ More

    Submitted 22 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted in the 27th European Conference on Artificial Intelligence (ECAI) 2024

  38. arXiv:2407.04657  [pdf, ps, other

    cs.SE

    Teaching Empirical Methods at Eindhoven University of Technology

    Authors: Alexander Serebrenik, Nathan Cassee

    Abstract: In this chapter, we share an experience report of teaching a master course on empirical research methods at Eindhoven University of Technology in the Netherlands. The course is taught for ten weeks to a mix of students from different study programs and combines both practical assignments with a closed-book exam. We discuss the challenges of teaching a course on research methods and explain how we… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  39. arXiv:2407.04467  [pdf, other

    cs.AI cs.CL cs.GT

    Are Large Language Models Strategic Decision Makers? A Study of Performance and Bias in Two-Player Non-Zero-Sum Games

    Authors: Nathan Herr, Fernando Acero, Roberta Raileanu, María Pérez-Ortiz, Zhibin Li

    Abstract: Large Language Models (LLMs) have been increasingly used in real-world settings, yet their strategic abilities remain largely unexplored. Game theory provides a good framework for assessing the decision-making abilities of LLMs in interactions with other agents. Although prior studies have shown that LLMs can solve these tasks with carefully curated prompts, they fail when the problem setting or p… ▽ More

    Submitted 16 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: 8 pages (19 with appendix), 6 figures in the main body (4 in the appendix), 4 tables in the main body

  40. arXiv:2407.02274  [pdf, other

    cs.RO

    DextrAH-G: Pixels-to-Action Dexterous Arm-Hand Grasping with Geometric Fabrics

    Authors: Tyler Ga Wei Lum, Martin Matak, Viktor Makoviychuk, Ankur Handa, Arthur Allshire, Tucker Hermans, Nathan D. Ratliff, Karl Van Wyk

    Abstract: A pivotal challenge in robotics is achieving fast, safe, and robust dexterous grasping across a diverse range of objects, an important goal within industrial applications. However, existing methods often have very limited speed, dexterity, and generality, along with limited or no hardware safety guarantees. In this work, we introduce DextrAH-G, a depth-based dexterous grasping policy trained entir… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

  41. arXiv:2407.01813  [pdf, ps, other

    math.MG cs.IT math.CO

    Optimal codes in the Stiefel manifold

    Authors: John Jasper, Nathan Mankovich, Dustin G. Mixon

    Abstract: We consider the coding problem in the Stiefel manifold with chordal distance. After considering various low-dimensional instances of this problem, we use Rankin's bounds on spherical codes to prove upper bounds on the minimum distance of a Stiefel code, and then we construct several examples of codes that achieve equality in these bounds.

    Submitted 1 July, 2024; originally announced July 2024.

  42. arXiv:2407.00236  [pdf, other

    cs.LG cs.NE

    Closed-Form Test Functions for Biophysical Sequence Optimization Algorithms

    Authors: Samuel Stanton, Robert Alberstein, Nathan Frey, Andrew Watkins, Kyunghyun Cho

    Abstract: There is a growing body of work seeking to replicate the success of machine learning (ML) on domains like computer vision (CV) and natural language processing (NLP) to applications involving biophysical data. One of the key ingredients of prior successes in CV and NLP was the broad acceptance of difficult benchmarks that distilled key subproblems into approachable tasks that any junior researcher… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  43. arXiv:2406.19283  [pdf, other

    cs.HC

    PhysioLLM: Supporting Personalized Health Insights with Wearables and Large Language Models

    Authors: Cathy Mengying Fang, Valdemar Danry, Nathan Whitmore, Andria Bao, Andrew Hutchison, Cayden Pierce, Pattie Maes

    Abstract: We present PhysioLLM, an interactive system that leverages large language models (LLMs) to provide personalized health understanding and exploration by integrating physiological data from wearables with contextual information. Unlike commercial health apps for wearables, our system offers a comprehensive statistical analysis component that discovers correlations and trends in user data, allowing u… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  44. arXiv:2406.19188  [pdf, other

    cs.LG

    Averaging log-likelihoods in direct alignment

    Authors: Nathan Grinsztajn, Yannis Flet-Berliac, Mohammad Gheshlaghi Azar, Florian Strub, Bill Wu, Eugene Choi, Chris Cremer, Arash Ahmadian, Yash Chandak, Olivier Pietquin, Matthieu Geist

    Abstract: To better align Large Language Models (LLMs) with human judgment, Reinforcement Learning from Human Feedback (RLHF) learns a reward model and then optimizes it using regularized RL. Recently, direct alignment methods were introduced to learn such a fine-tuned model directly from a preference dataset without computing a proxy reward function. These methods are built upon contrastive losses involvin… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  45. arXiv:2406.19185  [pdf, other

    cs.LG

    Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion

    Authors: Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist

    Abstract: Reinforcement Learning (RL) has been used to finetune Large Language Models (LLMs) using a reward model trained from preference data, to better align with human judgment. The recently introduced direct alignment methods, which are often simpler, more stable, and computationally lighter, can more directly achieve this. However, these approaches cannot optimize arbitrary rewards, and the preference-… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  46. arXiv:2406.18670  [pdf, other

    cs.DS cs.DM math.OC

    Generalized Cuts and Grothendieck Covers: a Primal-Dual Approximation Framework Extending the Goemans--Williamson Algorithm

    Authors: Nathan Benedetto Proença, Marcel K. de Carli Silva, Cristiane M. Sato, Levent Tunçel

    Abstract: We provide a primal-dual framework for randomized approximation algorithms utilizing semidefinite programming (SDP) relaxations. Our framework pairs a continuum of APX-complete problems including MaxCut, Max2Sat, MaxDicut, and more generally, Max-Boolean Constraint Satisfaction and MaxQ (maximization of a positive semidefinite quadratic form over the hypercube) with new APX-complete problems which… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  47. arXiv:2406.18495  [pdf, other

    cs.CL

    WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

    Authors: Seungju Han, Kavel Rao, Allyson Ettinger, Liwei Jiang, Bill Yuchen Lin, Nathan Lambert, Yejin Choi, Nouha Dziri

    Abstract: We introduce WildGuard -- an open, light-weight moderation tool for LLM safety that achieves three goals: (1) identifying malicious intent in user prompts, (2) detecting safety risks of model responses, and (3) determining model refusal rate. Together, WildGuard serves the increasing needs for automatic safety moderation and evaluation of LLM interactions, providing a one-stop tool with enhanced a… ▽ More

    Submitted 9 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: First two authors contributed equally. Third and fourth authors contributed equally

  48. arXiv:2406.17119  [pdf, other

    cs.CE cs.CV cs.LG math.NA

    Accelerating Phase Field Simulations Through a Hybrid Adaptive Fourier Neural Operator with U-Net Backbone

    Authors: Christophe Bonneville, Nathan Bieberdorf, Arun Hegde, Mark Asta, Habib N. Najm, Laurent Capolungo, Cosmin Safta

    Abstract: Prolonged contact between a corrosive liquid and metal alloys can cause progressive dealloying. For such liquid-metal dealloying (LMD) process, phase field models have been developed. However, the governing equations often involve coupled non-linear partial differential equations (PDE), which are challenging to solve numerically. In particular, stiffness in the PDEs requires an extremely small tim… ▽ More

    Submitted 8 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  49. arXiv:2406.17038  [pdf, other

    cs.CL

    modeLing: A Novel Dataset for Testing Linguistic Reasoning in Language Models

    Authors: Nathan A. Chi, Teodor Malchev, Riley Kong, Ryan A. Chi, Lucas Huang, Ethan A. Chi, R. Thomas McCoy, Dragomir Radev

    Abstract: We introduce modeLing, a novel benchmark of Linguistics Olympiad-style puzzles which tests few-shot reasoning in AI systems. Solving these puzzles necessitates inferring aspects of a language's grammatical structure from a small number of examples. Such puzzles provide a natural testbed for language models, as they require compositional generalization and few-shot inductive reasoning. Consisting s… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  50. arXiv:2406.16896  [pdf, other

    eess.SP cs.LG

    f-GAN: A frequency-domain-constrained generative adversarial network for PPG to ECG synthesis

    Authors: Nathan C. L. Kong, Dae Lee, Huyen Do, Dae Hoon Park, Cong Xu, Hongda Mao, Jonathan Chung

    Abstract: Electrocardiograms (ECGs) and photoplethysmograms (PPGs) are generally used to monitor an individual's cardiovascular health. In clinical settings, ECGs and fingertip PPGs are the main signals used for assessing cardiovascular health, but the equipment necessary for their collection precludes their use in daily monitoring. Although PPGs obtained from wrist-worn devices are susceptible to noise due… ▽ More

    Submitted 15 May, 2024; originally announced June 2024.