Zum Hauptinhalt springen

Showing 1–50 of 329 results for author: Owen

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.03764  [pdf, other

    cs.CV eess.IV

    Modeling Human Strategy for Flattening Wrinkled Cloth Using Neural Networks

    Authors: Nilay Kant, Ashrut Aryal, Rajiv Ranganathan, Ranjan Mukherjee, Charles Owen

    Abstract: This paper explores a novel approach to model strategies for flattening wrinkled cloth learning from humans. A human participant study was conducted where the participants were presented with various wrinkle types and tasked with flattening the cloth using the fewest actions possible. A camera and Aruco marker were used to capture images of the cloth and finger movements, respectively. The human s… ▽ More

    Submitted 19 August, 2024; originally announced September 2024.

    Comments: 6 Pages

  2. arXiv:2409.02823  [pdf, other

    cs.HC

    Design Contradictions: Help or Hindrance?

    Authors: Aron E. Owen, Jonathan C. Roberts

    Abstract: The need for innovative ideas in data visualisation drives us to explore new creative approaches. Combining two or more creative words, particularly those that contradict each other, can positively impact the creative process, sparking novel ideas and designs. As we move towards AI-driven design, an open question arises: do these design contradictions work positively with AI tools? Currently, the… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  3. arXiv:2409.02036  [pdf, other

    cs.HC

    Towards Metrics for Evaluating Creativity in Visualisation Design

    Authors: Aron E Owen, Jonathan C Roberts

    Abstract: Creativity in visualisation design is essential for designers and data scientists who need to present data in innovative ways. It is often achieved through sketching or drafting low-fidelity prototypes. However, judging this innovation is often difficult. A creative visualisation test would offer a structured approach to enhancing visual thinking and design skills, which are vital across many fiel… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  4. arXiv:2409.01969  [pdf, other

    q-bio.NC cond-mat.dis-nn cs.NE

    Connectivity structure and dynamics of nonlinear recurrent neural networks

    Authors: David G. Clark, Owen Marschall, Alexander van Meegen, Ashok Litwin-Kumar

    Abstract: We develop a theory to analyze how structure in connectivity shapes the high-dimensional, internally generated activity of nonlinear recurrent neural networks. Using two complementary methods -- a path-integral calculation of fluctuations around the saddle point, and a recently introduced two-site cavity approach -- we derive analytic expressions that characterize important features of collective… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 35 pages, 11 figures

  5. arXiv:2409.01283  [pdf, other

    cs.HC

    Towards a Generative AI Design Dialogue

    Authors: Aron E. Owen, Jonathan C. Roberts

    Abstract: Traditional visualisation designers often start with sketches before implementation. With generative AI, these sketches can be turned into AI-generated visualisations using specific prompts. However, guiding AI to create compelling visuals can be challenging. We propose a new design process where designers verbalise their thoughts during work, later converting these narratives into AI prompts. Thi… ▽ More

    Submitted 19 August, 2024; originally announced September 2024.

  6. arXiv:2408.15116  [pdf, other

    cs.AI

    Evaluating Stability of Unreflective Alignment

    Authors: James Lucassen, Mark Henry, Philippa Wright, Owen Yeung

    Abstract: Many theoretical obstacles to AI alignment are consequences of reflective stability - the problem of designing alignment mechanisms that the AI would not disable if given the option. However, problems stemming from reflective stability are not obviously present in current LLMs, leading to disagreement over whether they will need to be solved to enable safe delegation of cognitive labor. In this pa… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  7. arXiv:2408.10439  [pdf, other

    cs.HC cs.GR

    Visual Storytelling: A Methodological Approach to Designing and Implementing a Visualisation Poster

    Authors: Rhiannon Owen, Jonathan Roberts

    Abstract: We present a design study of developing a visualisation poster. Posters can be difficult to create, and the story on a poster is not always clear. Using a case-study approach we propose three important aspects: the poster should have a clear focus (especially a hero visualisation), envisioning its use helps to drive the important aspects, and third the essence (its fundamental concept and guiding… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 5 pages, 1 figure, accepted for publication to the EG UK Computer Graphics & Visual Computing (CGVC) 2024

    ACM Class: I.3.8; K.3.0

  8. arXiv:2408.06507  [pdf

    cs.CV cs.AI

    Benchmarking tree species classification from proximally-sensed laser scanning data: introducing the FOR-species20K dataset

    Authors: Stefano Puliti, Emily R. Lines, Jana Müllerová, Julian Frey, Zoe Schindler, Adrian Straker, Matthew J. Allen, Lukas Winiwarter, Nataliia Rehush, Hristina Hristova, Brent Murray, Kim Calders, Louise Terryn, Nicholas Coops, Bernhard Höfle, Samuli Junttila, Martin Krůček, Grzegorz Krok, Kamil Král, Shaun R. Levick, Linda Luck, Azim Missarov, Martin Mokroš, Harry J. F. Owen, Krzysztof Stereńczak , et al. (8 additional authors not shown)

    Abstract: Proximally-sensed laser scanning offers significant potential for automated forest data capture, but challenges remain in automatically identifying tree species without additional ground data. Deep learning (DL) shows promise for automation, yet progress is slowed by the lack of large, diverse, openly available labeled datasets of single tree point clouds. This has impacted the robustness of DL mo… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  9. arXiv:2408.03319  [pdf, other

    cs.CL cs.AI

    Training LLMs to Recognize Hedges in Spontaneous Narratives

    Authors: Amie J. Paige, Adil Soubki, John Murzaku, Owen Rambow, Susan E. Brennan

    Abstract: Hedges allow speakers to mark utterances as provisional, whether to signal non-prototypicality or "fuzziness", to indicate a lack of commitment to an utterance, to attribute responsibility for a statement to someone else, to invite input from a partner, or to soften critical feedback in the service of face-management needs. Here we focus on hedges in an experimentally parameterized corpus of 63 Ro… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: Amie Paige, Adil Soubki, and John Murzaku contributed equally to this study

    ACM Class: I.2.7

    Journal ref: SIGDIAL 2024

  10. arXiv:2408.02798  [pdf, other

    cs.CL cs.AI cs.LG

    Examining Gender and Power on Wikipedia Through Face and Politeness

    Authors: Adil Soubki, Shyne Choi, Owen Rambow

    Abstract: We propose a framework for analyzing discourse by combining two interdependent concepts from sociolinguistic theory: face acts and politeness. While politeness has robust existing tools and data, face acts are less resourced. We introduce a new corpus created by annotating Wikipedia talk pages with face acts and we use this to train a face act tagger. We then employ our framework to study how face… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Journal ref: SIGDIAL 2024

  11. arXiv:2407.18712  [pdf, other

    cs.AI cs.CL cs.LG

    Cluster-norm for Unsupervised Probing of Knowledge

    Authors: Walter Laurito, Sharan Maiya, Grégoire Dhimoïla, Owen, Yeung, Kaarel Hänni

    Abstract: The deployment of language models brings challenges in generating reliable information, especially when these models are fine-tuned using human preferences. To extract encoded knowledge without (potentially) biased human labels, unsupervised probing techniques like Contrast-Consistent Search (CCS) have been developed (Burns et al., 2022). However, salient but unrelated features in a given dataset… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 34 pages, 35 figures

  12. arXiv:2407.16593  [pdf, other

    cs.CL cs.AI

    A Comparative Study on Patient Language across Therapeutic Domains for Effective Patient Voice Classification in Online Health Discussions

    Authors: Giorgos Lysandrou, Roma English Owen, Vanja Popovic, Grant Le Brun, Aryo Pradipta Gema, Beatrice Alex, Elizabeth A. L. Fairley

    Abstract: There exists an invisible barrier between healthcare professionals' perception of a patient's clinical experience and the reality. This barrier may be induced by the environment that hinders patients from sharing their experiences openly with healthcare professionals. As patients are observed to discuss and exchange knowledge more candidly on social media, valuable insights can be leveraged from t… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 14 pages, 4 figures, 5 tables, funded by Talking Medicines Limited

  13. arXiv:2407.11894  [pdf, other

    cs.LG math.NA stat.ML

    Deep Learning without Global Optimization by Random Fourier Neural Networks

    Authors: Owen Davis, Gianluca Geraci, Mohammad Motamed

    Abstract: We introduce a new training algorithm for variety of deep neural networks that utilize random complex exponential activation functions. Our approach employs a Markov Chain Monte Carlo sampling procedure to iteratively train network layers, avoiding global and gradient-based optimization while maintaining error control. It consistently attains the theoretical approximation rate for residual network… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    MSC Class: 65T40; 90C15; 65C05; 65C40; 60J22; 68T07

  14. arXiv:2407.08162  [pdf, other

    cs.CV cs.RO

    Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates

    Authors: Owen Claxton, Connor Malone, Helen Carson, Jason Ford, Gabe Bolton, Iman Shames, Michael Milford

    Abstract: Visual Place Recognition (VPR) systems often have imperfect performance, which affects robot navigation decisions. This research introduces a novel Multi-Layer Perceptron (MLP) integrity monitor for VPR which demonstrates improved performance and generalizability over the previous state-of-the-art SVM approach, removing per-environment training and reducing manual tuning requirements. We test our… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Currently Under Review

  15. arXiv:2407.05206  [pdf, other

    cs.CV cs.HC cs.LG

    Helios: An extremely low power event-based gesture recognition for always-on smart eyewear

    Authors: Prarthana Bhattacharyya, Joshua Mitton, Ryan Page, Owen Morgan, Ben Menzies, Gabriel Homewood, Kemi Jacobs, Paolo Baesso, David Trickett, Chris Mair, Taru Muhonen, Rory Clark, Louis Berridge, Richard Vigars, Iain Wallace

    Abstract: This paper introduces Helios, the first extremely low-power, real-time, event-based hand gesture recognition system designed for all-day on smart eyewear. As augmented reality (AR) evolves, current smart glasses like the Meta Ray-Bans prioritize visual and wearable comfort at the expense of functionality. Existing human-machine interfaces (HMIs) in these devices, such as capacitive touch and voice… ▽ More

    Submitted 26 August, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV-Integrating Computer Vision in Smart Eyewear, 2024. 18 pages, 10 figures. First three authors contributed equally to this paper

  16. arXiv:2407.04915  [pdf, other

    cs.HC

    Safe Generative Chats in a WhatsApp Intelligent Tutoring System

    Authors: Zachary Levonian, Owen Henkel

    Abstract: Large language models (LLMs) are flexible, personalizable, and available, which makes their use within Intelligent Tutoring Systems (ITSs) appealing. However, that flexibility creates risks: inaccuracies, harmful content, and non-curricular material. Ethically deploying LLM-backed ITS systems requires designing safeguards that ensure positive experiences for students. We describe the design of a c… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: EDM 2024 LLM Workshop

  17. arXiv:2407.04153  [pdf, other

    cs.LG cs.AI

    Mixture of A Million Experts

    Authors: Xu Owen He

    Abstract: The feedforward (FFW) layers in standard transformer architectures incur a linear increase in computational costs and activation memory as the hidden layer width grows. Sparse mixture-of-experts (MoE) architectures have emerged as a viable approach to address this issue by decoupling model size from computational cost. The recent discovery of the fine-grained MoE scaling law shows that higher gran… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  18. arXiv:2406.15649   

    cs.CV

    Efficient Human Pose Estimation: Leveraging Advanced Techniques with MediaPipe

    Authors: Sandeep Singh Sengar, Abhishek Kumar, Owen Singh

    Abstract: This study presents significant enhancements in human pose estimation using the MediaPipe framework. The research focuses on improving accuracy, computational efficiency, and real-time processing capabilities by comprehensively optimising the underlying algorithms. Novel modifications are introduced that substantially enhance pose estimation accuracy across challenging scenarios, such as dynamic m… ▽ More

    Submitted 13 July, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: There is an error in this work. BY mistake in Section 3.3, the angle is calculated wrongly

  19. arXiv:2406.15646  [pdf, other

    cs.CV

    VigilEye -- Artificial Intelligence-based Real-time Driver Drowsiness Detection

    Authors: Sandeep Singh Sengar, Aswin Kumar, Owen Singh

    Abstract: This study presents a novel driver drowsiness detection system that combines deep learning techniques with the OpenCV framework. The system utilises facial landmarks extracted from the driver's face as input to Convolutional Neural Networks trained to recognise drowsiness patterns. The integration of OpenCV enables real-time video processing, making the system suitable for practical implementation… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  20. arXiv:2406.15317  [pdf, other

    math.CO cs.DM math.MG

    Diverse beam search to find densest-known planar unit distance graphs

    Authors: Peter Engel, Owen Hammond-Lee, Yiheng Su, Dániel Varga, Pál Zsámboki

    Abstract: This paper addresses the problem of determining the maximum number of edges in a unit distance graph (UDG) of $n$ vertices using computer search. An unsolved problem of Paul Erdős asks the maximum number of edges $u(n)$ a UDG of $n$ vertices can have. Those UDGs that attain $u(n)$ are called "maximally dense." In this paper, we seek to demonstrate a computer algorithm to generate dense UDGs for ve… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  21. arXiv:2406.13961  [pdf, other

    cs.LG cs.RO

    Equivariant Offline Reinforcement Learning

    Authors: Arsh Tangri, Ondrej Biza, Dian Wang, David Klee, Owen Howell, Robert Platt

    Abstract: Sample efficiency is critical when applying learning-based methods to robotic manipulation due to the high cost of collecting expert demonstrations and the challenges of on-robot policy learning through online Reinforcement Learning (RL). Offline RL addresses this issue by enabling policy learning from an offline dataset collected using any behavioral policy, regardless of its quality. However, re… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  22. arXiv:2406.12800  [pdf, other

    cs.CR

    Supporting Human Raters with the Detection of Harmful Content using Large Language Models

    Authors: Kurt Thomas, Patrick Gage Kelley, David Tao, Sarah Meiklejohn, Owen Vallis, Shunwen Tan, Blaž Bratanič, Felipe Tiengo Ferreira, Vijay Kumar Eranti, Elie Bursztein

    Abstract: In this paper, we explore the feasibility of leveraging large language models (LLMs) to automate or otherwise assist human raters with identifying harmful content including hate speech, harassment, violent extremism, and election misinformation. Using a dataset of 50,000 comments, we demonstrate that LLMs can achieve 90% accuracy when compared to human verdicts. We explore how to best leverage the… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  23. arXiv:2406.12131  [pdf, other

    cs.CL

    Gram2Vec: An Interpretable Document Vectorizer

    Authors: Peter Zeng, Eric Sclafani, Owen Rambow

    Abstract: We present Gram2Vec, a grammatical style embedding algorithm that embeds documents into a higher dimensional space by extracting the normalized relative frequencies of grammatical features present in the text. Compared to neural approaches, Gram2Vec offers inherent interpretability based on how the feature vectors are generated. In our demo, we present a way to visualize a mapping of authors to do… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 6 pages, 2 figures

  24. arXiv:2406.10786  [pdf, other

    cs.AI cs.CL

    Evaluating LLMs with Multiple Problems at once: A New Paradigm for Probing LLM Capabilities

    Authors: Zhengxiang Wang, Jordan Kodner, Owen Rambow

    Abstract: Current LLM evaluation predominantly performs evaluation with prompts comprising single problems. We propose multi-problem evaluation as an additional approach to study the multiple problem handling capabilities of LLMs. We present a systematic study in this regard by comprehensively examining 7 LLMs on 4 related types of tasks constructed from 6 classification benchmarks. The 4 task types include… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 20 pages, 15 figures, 9 tables

  25. arXiv:2406.07466  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Multimodal Belief Prediction

    Authors: John Murzaku, Adil Soubki, Owen Rambow

    Abstract: Recognizing a speaker's level of commitment to a belief is a difficult task; humans do not only interpret the meaning of the words in context, but also understand cues from intonation and other aspects of the audio signal. Many papers and corpora in the NLP community have approached the belief prediction task using text-only approaches. We are the first to frame and present results on the multimod… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: John Murzaku and Adil Soubki contributed equally to this work

    Journal ref: Interspeech 2024

  26. arXiv:2406.07263  [pdf, other

    cs.LG q-bio.QM stat.ML

    Active learning for affinity prediction of antibodies

    Authors: Alexandra Gessner, Sebastian W. Ober, Owen Vickery, Dino Oglić, Talip Uçar

    Abstract: The primary objective of most lead optimization campaigns is to enhance the binding affinity of ligands. For large molecules such as antibodies, identifying mutations that enhance antibody affinity is particularly challenging due to the combinatorial explosion of potential mutations. When the structure of the antibody-antigen complex is available, relative binding free energy (RBFE) methods can of… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  27. arXiv:2406.06576  [pdf, other

    cs.CL cs.AI cs.LG

    OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step

    Authors: Owen Dugan, Donato Manuel Jimenez Beneto, Charlotte Loh, Zhuo Chen, Rumen Dangovski, Marin Soljačić

    Abstract: Despite significant advancements in text generation and reasoning, Large Language Models (LLMs) still face challenges in accurately performing complex arithmetic operations. Language model systems often enable LLMs to generate code for arithmetic operations to achieve accurate calculations. However, this approach compromises speed and security, and fine-tuning risks the language model losing prior… ▽ More

    Submitted 2 September, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  28. arXiv:2406.04109  [pdf, other

    cs.CL

    Intention and Face in Dialog

    Authors: Adil Soubki, Owen Rambow

    Abstract: The notion of face described by Brown and Levinson (1987) has been studied in great detail, but a critical aspect of the framework, that which focuses on how intentions mediate the planning of turns which impose upon face, has received far less attention. We present an analysis of three computational systems trained for classifying both intention and politeness, focusing on how the former influenc… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Journal ref: May 2024. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 9143-9153, Torino, Italia. ELRA and ICCL

  29. arXiv:2406.03476  [pdf, other

    cs.LG cs.CL

    Does your data spark joy? Performance gains from domain upsampling at the end of training

    Authors: Cody Blakeney, Mansheej Paul, Brett W. Larsen, Sean Owen, Jonathan Frankle

    Abstract: Pretraining datasets for large language models (LLMs) have grown to trillions of tokens composed of large amounts of CommonCrawl (CC) web scrape along with smaller, domain-specific datasets. It is expensive to understand the impact of these domain-specific datasets on model capabilities as training at large FLOP scales is required to reveal significant changes to difficult and emergent benchmarks.… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: The first three authors contributed equally

  30. arXiv:2406.00132  [pdf, other

    cs.LG quant-ph

    QuanTA: Efficient High-Rank Fine-Tuning of LLMs with Quantum-Informed Tensor Adaptation

    Authors: Zhuo Chen, Rumen Dangovski, Charlotte Loh, Owen Dugan, Di Luo, Marin Soljačić

    Abstract: We propose Quantum-informed Tensor Adaptation (QuanTA), a novel, easy-to-implement, fine-tuning method with no inference overhead for large-scale pre-trained language models. By leveraging quantum-inspired methods derived from quantum circuit structures, QuanTA enables efficient high-rank fine-tuning, surpassing the limitations of Low-Rank Adaptation (LoRA)--low-rank approximation may fail for com… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  31. arXiv:2405.21015  [pdf, other

    cs.CY

    The rising costs of training frontier AI models

    Authors: Ben Cottier, Robi Rahman, Loredana Fattorini, Nestor Maslej, David Owen

    Abstract: The costs of training frontier AI models have grown dramatically in recent years, but there is limited public data on the magnitude and growth of these expenses. This paper develops a detailed cost model to address this gap, estimating training costs using three approaches that account for hardware, energy, cloud rental, and staff expenses. The analysis reveals that the amortized cost to train the… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  32. arXiv:2405.18600  [pdf, other

    cs.RO cs.AR eess.SY

    OpenConvoy: Universal Platform for Real-World Testing of Cooperative Driving Systems

    Authors: Owen Burns, Hossein Maghsoumi, Yaser Fallah, Israel Charles

    Abstract: Cooperative driving, enabled by communication between automated vehicle systems, promises significant benefits to fuel efficiency, road capacity, and safety over single-vehicle driver assistance systems such as adaptive cruise control (ACC). However, the responsible development and implementation of these algorithms poses substantial challenges due to the need for extensive real-world testing. We… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 7 pages, 8 figures

  33. arXiv:2405.17813  [pdf, other

    cs.IR

    The Impacts of Data, Ordering, and Intrinsic Dimensionality on Recall in Hierarchical Navigable Small Worlds

    Authors: Owen Pendrigh Elliott, Jesse Clark

    Abstract: Vector search systems, pivotal in AI applications, often rely on the Hierarchical Navigable Small Worlds (HNSW) algorithm. However, the behaviour of HNSW under real-world scenarios using vectors generated with deep learning models remains under-explored. Existing Approximate Nearest Neighbours (ANN) benchmarks and research typically has an over-reliance on simplistic datasets like MNIST or SIFT1M… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 15 pages, 2 figures

  34. arXiv:2405.15537  [pdf, ps, other

    cs.CR cs.AR cs.ET

    Do Not Trust Power Management: Challenges and Hints for Securing Future Trusted Execution Environments

    Authors: Owen Le Gonidec, Maria Méndez Real, Guillaume Bouffard, Jean-Christophe Prévotet

    Abstract: Over the past few years, several research groups have introduced innovative hardware designs for Trusted Execution Environments (TEEs), aiming to secure applications against potentially compromised privileged software, including the kernel. Since 2017, Tang et al. introduced a new class of software-enabled hardware attacks, which leverages energy management mechanisms. These attacks aim at bypassi… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  35. arXiv:2405.13640  [pdf, other

    cs.CL cs.AI cs.LG

    Knowledge Graph Reasoning with Self-supervised Reinforcement Learning

    Authors: Ying Ma, Owen Burns, Mingqiu Wang, Gang Li, Nan Du, Laurent El Shafey, Liqiang Wang, Izhak Shafran, Hagen Soltau

    Abstract: Reinforcement learning (RL) is an effective method of finding reasoning pathways in incomplete knowledge graphs (KGs). To overcome the challenges of a large action space, a self-supervised pre-training method is proposed to warm up the policy network before the RL training stage. To alleviate the distributional mismatch issue in general self-supervised RL (SSRL), in our supervised learning (SL) st… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 17 pages, 11 figures

  36. arXiv:2405.06727  [pdf, other

    stat.ML cs.LG

    Approximation Error and Complexity Bounds for ReLU Networks on Low-Regular Function Spaces

    Authors: Owen Davis, Gianluca Geraci, Mohammad Motamed

    Abstract: In this work, we consider the approximation of a large class of bounded functions, with minimal regularity assumptions, by ReLU neural networks. We show that the approximation error can be bounded from above by a quantity proportional to the uniform norm of the target function and inversely proportional to the product of network width and depth. We inherit this approximation error bound from Fouri… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    MSC Class: 41A25; 41A30; 41A46; 68T07

  37. arXiv:2405.05594  [pdf, other

    cs.AI

    Expected Work Search: Combining Win Rate and Proof Size Estimation

    Authors: Owen Randall, Martin Müller, Ting Han Wei, Ryan Hayward

    Abstract: We propose Expected Work Search (EWS), a new game solving algorithm. EWS combines win rate estimation, as used in Monte Carlo Tree Search, with proof size estimation, as used in Proof Number Search. The search efficiency of EWS stems from minimizing a novel notion of Expected Work, which predicts the expected computation required to solve a position. EWS outperforms traditional solving algorithms… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  38. arXiv:2405.04288  [pdf, other

    eess.IV cs.CV cs.LG

    BetterNet: An Efficient CNN Architecture with Residual Learning and Attention for Precision Polyp Segmentation

    Authors: Owen Singh, Sandeep Singh Sengar

    Abstract: Colorectal cancer contributes significantly to cancer-related mortality. Timely identification and elimination of polyps through colonoscopy screening is crucial in order to decrease mortality rates. Accurately detecting polyps in colonoscopy images is difficult because of the differences in characteristics such as size, shape, texture, and similarity to surrounding tissues. Current deep-learning… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  39. arXiv:2405.02985  [pdf

    cs.CL cs.AI

    Can Large Language Models Make the Grade? An Empirical Study Evaluating LLMs Ability to Mark Short Answer Questions in K-12 Education

    Authors: Owen Henkel, Adam Boxer, Libby Hills, Bill Roberts

    Abstract: This paper presents reports on a series of experiments with a novel dataset evaluating how well Large Language Models (LLMs) can mark (i.e. grade) open text responses to short answer questions, Specifically, we explore how well different combinations of GPT version and prompt engineering strategies performed at marking real student answers to short answer across different domain areas (Science and… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  40. arXiv:2404.16767  [pdf, other

    cs.LG cs.CL cs.CV

    REBEL: Reinforcement Learning via Regressing Relative Rewards

    Authors: Zhaolin Gao, Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Gokul Swamy, Kianté Brantley, Thorsten Joachims, J. Andrew Bagnell, Jason D. Lee, Wen Sun

    Abstract: While originally developed for continuous control problems, Proximal Policy Optimization (PPO) has emerged as the work-horse of a variety of reinforcement learning (RL) applications, including the fine-tuning of generative models. Unfortunately, PPO requires multiple heuristics to enable stable convergence (e.g. value networks, clipping), and is notorious for its sensitivity to the precise impleme… ▽ More

    Submitted 1 September, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: New experimental results on general chat

  41. arXiv:2404.11525  [pdf, other

    cs.CV eess.IV

    JointViT: Modeling Oxygen Saturation Levels with Joint Supervision on Long-Tailed OCTA

    Authors: Zeyu Zhang, Xuyin Qi, Mingxi Chen, Guangxi Li, Ryan Pham, Ayub Qassim, Ella Berry, Zhibin Liao, Owen Siggs, Robert Mclaughlin, Jamie Craig, Minh-Son To

    Abstract: The oxygen saturation level in the blood (SaO2) is crucial for health, particularly in relation to sleep-related breathing disorders. However, continuous monitoring of SaO2 is time-consuming and highly variable depending on patients' conditions. Recently, optical coherence tomography angiography (OCTA) has shown promising development in rapidly and effectively screening eye-related lesions, offeri… ▽ More

    Submitted 28 July, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted to MIUA 2024 Oral

  42. arXiv:2404.08495  [pdf, other

    cs.LG cs.AI cs.CL

    Dataset Reset Policy Optimization for RLHF

    Authors: Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun

    Abstract: Reinforcement Learning (RL) from Human Preference-based feedback is a popular paradigm for fine-tuning generative models, which has produced impressive models such as GPT-4 and Claude3 Opus. This framework often consists of two steps: learning a reward model from an offline preference dataset followed by running online RL to optimize the learned reward model. In this work, leveraging the idea of r… ▽ More

    Submitted 16 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: 28 pages, 6 tables, 3 Figures, 3 Algorithms

  43. arXiv:2404.04837  [pdf, other

    cs.LO cs.PL

    GATlab: Modeling and Programming with Generalized Algebraic Theories

    Authors: Owen Lynch, Kris Brown, James Fairbanks, Evan Patterson

    Abstract: Categories and categorical structures are increasingly recognized as useful abstractions for modeling in science and engineering. To uniformly implement category-theoretic mathematical models in software, we introduce GATlab, a domain-specific language for algebraic specification embedded in a technical programming language. GATlab is based on generalized algebraic theories (GATs), a logical syste… ▽ More

    Submitted 8 June, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: 14 pages plus references and appendix. To appear at MFPS 2024

  44. arXiv:2404.03673  [pdf, other

    cs.CV cs.AI cs.LG

    RL for Consistency Models: Faster Reward Guided Text-to-Image Generation

    Authors: Owen Oertell, Jonathan D. Chang, Yiyi Zhang, Kianté Brantley, Wen Sun

    Abstract: Reinforcement learning (RL) has improved guided image generation with diffusion models by directly optimizing rewards that capture image quality, aesthetics, and instruction following capabilities. However, the resulting generative policies inherit the same iterative sampling process of diffusion models that causes slow generation. To overcome this limitation, consistency models proposed learning… ▽ More

    Submitted 22 June, 2024; v1 submitted 25 March, 2024; originally announced April 2024.

    Comments: 18 pages, 9 figures, 1 table

  45. arXiv:2403.19851  [pdf, other

    cs.CL cs.CR cs.LG stat.ML

    Localizing Paragraph Memorization in Language Models

    Authors: Niklas Stoehr, Mitchell Gordon, Chiyuan Zhang, Owen Lewis

    Abstract: Can we localize the weights and mechanisms used by a language model to memorize and recite entire paragraphs of its training data? In this paper, we show that while memorization is spread across multiple layers and model components, gradients of memorized paragraphs have a distinguishable spatial pattern, being larger in lower model layers than gradients of non-memorized examples. Moreover, the me… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  46. arXiv:2403.14100  [pdf, other

    cs.AI

    Causal knowledge engineering: A case study from COVID-19

    Authors: Steven Mascaro, Yue Wu, Ross Pearson, Owen Woodberry, Jessica Ramsay, Tom Snelling, Ann E. Nicholson

    Abstract: COVID-19 appeared abruptly in early 2020, requiring a rapid response amid a context of great uncertainty. Good quality data and knowledge was initially lacking, and many early models had to be developed with causal assumptions and estimations built in to supplement limited data, often with no reliable approach for identifying, validating and documenting these causal assumptions. Our team embarked… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 22 pages (plus 19 pages in appendices), 9 figures, submitted for review

  47. arXiv:2403.09362  [pdf, other

    cs.CL

    Komodo: A Linguistic Expedition into Indonesia's Regional Languages

    Authors: Louis Owen, Vishesh Tripathi, Abhay Kumar, Biddwan Ahmed

    Abstract: The recent breakthroughs in Large Language Models (LLMs) have mostly focused on languages with easily available and sufficient resources, such as English. However, there remains a significant gap for languages that lack sufficient linguistic resources in the public domain. Our work introduces Komodo-7B, 7-billion-parameter Large Language Models designed to address this gap by seamlessly operating… ▽ More

    Submitted 19 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 30 Pages, 8 Figures, 4 Tables

  48. Alya towards Exascale: Optimal OpenACC Performance of the Navier-Stokes Finite Element Assembly on GPUs

    Authors: Herbert Owen, Dominik Ernst, Thomas Gruber, Oriol Lemkuhl, Guillaume Houzeaux, Lucas Gasparino, Gerhard Wellein

    Abstract: This paper addresses the challenge of providing portable and highly efficient code structures for CPU and GPU architectures. We choose the assembly of the right-hand term in the incompressible flow module of the High-Performance Computational Mechanics code Alya, which is one of the two CFD codes in the Unified European Benchmark Suite. Starting from an efficient CPU-code and a related OpenACC-por… ▽ More

    Submitted 22 January, 2024; originally announced March 2024.

  49. arXiv:2403.08127  [pdf

    cs.DB physics.data-an stat.OT

    Guidelines for the Creation of Analysis Ready Data

    Authors: Harriette Phillips, Aiden Price, Owen Forbes, Claire Boulange, Kerrie Mengersen, Marketa Reeves, Rebecca Glauert

    Abstract: Globally, there is an increased need for guidelines to produce high-quality data outputs for analysis. No framework currently exists that provides guidelines for a comprehensive approach to producing analysis ready data (ARD). Through critically reviewing and summarising current literature, this paper proposes such guidelines for the creation of ARD. The guidelines proposed in this paper inform te… ▽ More

    Submitted 29 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: 49 pages, 3 figures, 3 tables, and 5 appendices

  50. arXiv:2403.05812  [pdf, other

    cs.CL cs.AI

    Algorithmic progress in language models

    Authors: Anson Ho, Tamay Besiroglu, Ege Erdil, David Owen, Robi Rahman, Zifan Carl Guo, David Atkinson, Neil Thompson, Jaime Sevilla

    Abstract: We investigate the rate at which algorithms for pre-training language models have improved since the advent of deep learning. Using a dataset of over 200 language model evaluations on Wikitext and Penn Treebank spanning 2012-2023, we find that the compute required to reach a set performance threshold has halved approximately every 8 months, with a 95% confidence interval of around 5 to 14 months,… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.