Zum Hauptinhalt springen

Showing 1–50 of 94 results for author: Jones, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08853  [pdf, other

    cs.HC cs.CL

    GPT-4 is judged more human than humans in displaced and inverted Turing tests

    Authors: Ishika Rathi, Sydney Taylor, Benjamin K. Bergen, Cameron R. Jones

    Abstract: Everyday AI detection requires differentiating between people and AI in informal, online conversations. In many cases, people will not interact directly with AI systems but instead read conversations between AI systems and other people. We measured how well people and large language models can discriminate using two modified versions of the Turing test: inverted and displaced. GPT-3.5, GPT-4, and… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.00761  [pdf, other

    cs.LG cs.CE

    Improving the performance of Stein variational inference through extreme sparsification of physically-constrained neural network models

    Authors: Govinda Anantha Padmanabha, Jan Niklas Fuhg, Cosmin Safta, Reese E. Jones, Nikolaos Bouklas

    Abstract: Most scientific machine learning (SciML) applications of neural networks involve hundreds to thousands of parameters, and hence, uncertainty quantification for such models is plagued by the curse of dimensionality. Using physical applications, we show that $L_0$ sparsification prior to Stein variational gradient descent ($L_0$+SVGD) is a more robust and efficient means of uncertainty quantificatio… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 30 pages, 11 figures

  3. arXiv:2406.14737  [pdf, other

    cs.CL

    Dissecting the Ullman Variations with a SCALPEL: Why do LLMs fail at Trivial Alterations to the False Belief Task?

    Authors: Zhiqiang Pi, Annapurna Vadaparty, Benjamin K. Bergen, Cameron R. Jones

    Abstract: Recent empirical results have sparked a debate about whether or not Large Language Models (LLMs) are capable of Theory of Mind (ToM). While some have found LLMs to be successful on ToM evaluations such as the False Belief task (Kosinski, 2023), others have argued that LLMs solve these tasks by exploiting spurious correlations -- not representing beliefs -- since they fail on trivial alterations to… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.03273  [pdf, other

    cs.CV

    VWise: A novel benchmark for evaluating scene classification for vehicular applications

    Authors: Pedro Azevedo, Emanuella Araújo, Gabriel Pierre, Willams de Lima Costa, João Marcelo Teixeira, Valter Ferreira, Roberto Jones, Veronica Teichrieb

    Abstract: Current datasets for vehicular applications are mostly collected in North America or Europe. Models trained or evaluated on these datasets might suffer from geographical bias when deployed in other regions. Specifically, for scene classification, a highway in a Latin American country differs drastically from an Autobahn, for example, both in design and maintenance levels. We propose VWise, a novel… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  5. arXiv:2406.02383  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Learning to Edit Visual Programs with Self-Supervision

    Authors: R. Kenny Jones, Renhao Zhang, Aditya Ganeshan, Daniel Ritchie

    Abstract: We design a system that learns how to edit visual programs. Our edit network consumes a complete input program and a visual target. From this input, we task our network with predicting a local edit operation that could be applied to the input program to improve its similarity to the target. In order to apply this scheme for domains that lack program annotations, we develop a self-supervised learni… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2405.20319  [pdf, other

    cs.CV cs.AI cs.GR cs.HC cs.SC

    ParSEL: Parameterized Shape Editing with Language

    Authors: Aditya Ganeshan, Ryan Y. Huang, Xianghao Xu, R. Kenny Jones, Daniel Ritchie

    Abstract: The ability to edit 3D assets from natural language presents a compelling paradigm to aid in the democratization of 3D content creation. However, while natural language is often effective at communicating general intent, it is poorly suited for specifying precise manipulation. To address this gap, we introduce ParSEL, a system that enables controllable editing of high-quality 3D assets from natura… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  7. arXiv:2405.08007  [pdf, other

    cs.HC cs.AI

    People cannot distinguish GPT-4 from a human in a Turing test

    Authors: Cameron R. Jones, Benjamin K. Bergen

    Abstract: We evaluated 3 systems (ELIZA, GPT-3.5 and GPT-4) in a randomized, controlled, and preregistered Turing test. Human participants had a 5 minute conversation with either a human or an AI, and judged whether or not they thought their interlocutor was human. GPT-4 was judged to be a human 54% of the time, outperforming ELIZA (22%) but lagging behind actual humans (67%). The results provide the first… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 23 pages, 13 figures

  8. arXiv:2404.17584  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Equivariant graph convolutional neural networks for the representation of homogenized anisotropic microstructural mechanical response

    Authors: Ravi Patel, Cosmin Safta, Reese E. Jones

    Abstract: Composite materials with different microstructural material symmetries are common in engineering applications where grain structure, alloying and particle/fiber packing are optimized via controlled manufacturing. In fact these microstructural tunings can be done throughout a part to achieve functional gradation and optimization at a structural level. To predict the performance of particular micros… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 23 pages, 10 figures

  9. arXiv:2403.16895  [pdf

    cs.HC cs.AI

    "It is there, and you need it, so why do you not use it?" Achieving better adoption of AI systems by domain experts, in the case study of natural science research

    Authors: Auste Simkute, Ewa Luger, Michael Evans, Rhianne Jones

    Abstract: Artificial Intelligence (AI) is becoming ubiquitous in domains such as medicine and natural science research. However, when AI systems are implemented in practice, domain experts often refuse them. Low acceptance hinders effective human-AI collaboration, even when it is essential for progress. In natural science research, scientists' ineffective use of AI-enabled systems can impede them from analy… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  10. arXiv:2403.15476  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Learning to Infer Generative Template Programs for Visual Concepts

    Authors: R. Kenny Jones, Siddhartha Chaudhuri, Daniel Ritchie

    Abstract: People grasp flexible visual concepts from a few examples. We explore a neurosymbolic system that learns how to infer programs that capture visual concepts in a domain-general fashion. We introduce Template Programs: programmatic expressions from a domain-specific language that specify structural and parametric patterns common to an input concept. Our framework supports multiple concept-related ta… ▽ More

    Submitted 9 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: ICML 2024; Project page: https://rkjones4.github.io/template.html

  11. arXiv:2403.09675  [pdf, other

    cs.CV cs.GR

    Open-Universe Indoor Scene Generation using LLM Program Synthesis and Uncurated Object Databases

    Authors: Rio Aguina-Kang, Maxim Gumin, Do Heon Han, Stewart Morris, Seung Jean Yoo, Aditya Ganeshan, R. Kenny Jones, Qiuhong Anna Wei, Kailiang Fu, Daniel Ritchie

    Abstract: We present a system for generating indoor scenes in response to text prompts. The prompts are not limited to a fixed vocabulary of scene descriptions, and the objects in generated scenes are not restricted to a fixed set of object categories -- we call this setting indoor scene generation. Unlike most prior work on indoor scene generation, our system does not require a large training dataset of ex… ▽ More

    Submitted 4 February, 2024; originally announced March 2024.

    Comments: See ancillary files for link to supplemental material

  12. Evaluating Versal AI Engines for option price discovery in market risk analysis

    Authors: Mark Klaisoongnoen, Nick Brown, Tim Dykes, Jessica R. Jones, Utz-Uwe Haus

    Abstract: Whilst Field-Programmable Gate Arrays (FPGAs) have been popular in accelerating high-frequency financial workload for many years, their application in quantitative finance, the utilisation of mathematical models to analyse financial markets and securities, is less mature. Nevertheless, recent work has demonstrated the benefits that FPGAs can deliver to quantitative workloads, and in this paper, we… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Author accepted version of paper accepted to the 32nd ACM/SIGDA International Symposium on Field-Programmable Gate Arrays

  13. arXiv:2402.11179  [pdf, other

    cs.LG math.ST physics.comp-ph

    Uncertainty Quantification of Graph Convolution Neural Network Models of Evolving Processes

    Authors: Jeremiah Hauth, Cosmin Safta, Xun Huan, Ravi G. Patel, Reese E. Jones

    Abstract: The application of neural network models to scientific machine learning tasks has proliferated in recent years. In particular, neural network models have proved to be adept at modeling processes with spatial-temporal complexity. Nevertheless, these highly parameterized models have garnered skepticism in their ability to produce outputs with quantified error bounds over the regimes of interest. Hen… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 27 pages, 20 figures

  14. arXiv:2312.04933  [pdf, other

    quant-ph cs.DC

    A Hybrid Classical-Quantum HPC Workload

    Authors: Aniello Esposito, Sebastien Cabaniols, Jessica R. Jones, David Brayford

    Abstract: A strategy for the orchestration of hybrid classical-quantum workloads on supercomputers featuring quantum devices is proposed. The method makes use of heterogeneous job launches with Slurm to interleave classical and quantum computation, thereby reducing idle time of the quantum components. To better understand the possible shortcomings and bottlenecks of such a workload, an example application i… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 5 pages, 7 listings, 4 figures. Presented at WIHPQC 2023

    ACM Class: B.m; B.8.1

  15. arXiv:2312.04648  [pdf, other

    stat.ML cs.LG

    Enhancing Polynomial Chaos Expansion Based Surrogate Modeling using a Novel Probabilistic Transfer Learning Strategy

    Authors: Wyatt Bridgman, Uma Balakrishnan, Reese Jones, Jiefu Chen, Xuqing Wu, Cosmin Safta, Yueqin Huang, Mohammad Khalil

    Abstract: In the field of surrogate modeling, polynomial chaos expansion (PCE) allows practitioners to construct inexpensive yet accurate surrogates to be used in place of the expensive forward model simulations. For black-box simulations, non-intrusive PCE allows the construction of these surrogates using a set of simulation response evaluations. In this context, the PCE coefficients can be obtained using… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  16. arXiv:2310.20216  [pdf, other

    cs.AI cs.CL

    Does GPT-4 pass the Turing test?

    Authors: Cameron R. Jones, Benjamin K. Bergen

    Abstract: We evaluated GPT-4 in a public online Turing test. The best-performing GPT-4 prompt passed in 49.7% of games, outperforming ELIZA (22%) and GPT-3.5 (20%), but falling short of the baseline set by human participants (66%). Participants' decisions were based mainly on linguistic style (35%) and socioemotional traits (27%), supporting the idea that intelligence, narrowly conceived, is not sufficient… ▽ More

    Submitted 20 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 28 pages, 21 figures

  17. arXiv:2310.10831  [pdf, other

    math.NA cs.LG math.DS

    Accurate Data-Driven Surrogates of Dynamical Systems for Forward Propagation of Uncertainty

    Authors: Saibal De, Reese E. Jones, Hemanth Kolla

    Abstract: Stochastic collocation (SC) is a well-known non-intrusive method of constructing surrogate models for uncertainty quantification. In dynamical systems, SC is especially suited for full-field uncertainty propagation that characterizes the distributions of the high-dimensional primary solution fields of a model with stochastic input parameters. However, due to the highly nonlinear nature of the para… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  18. arXiv:2310.03652  [pdf, other

    cs.CE cs.LG

    Extreme sparsification of physics-augmented neural networks for interpretable model discovery in mechanics

    Authors: Jan N. Fuhg, Reese E. Jones, Nikolaos Bouklas

    Abstract: Data-driven constitutive modeling with neural networks has received increased interest in recent years due to its ability to easily incorporate physical and mechanistic constraints and to overcome the challenging and time-consuming task of formulating phenomenological constitutive laws that can accurately capture the observed material response. However, even though neural network-based constitutiv… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 34 pages, 19 Figures

    MSC Class: 74B20 (Primary); 74C05 (Secondary)

  19. arXiv:2309.14972  [pdf, other

    cs.CV cs.AI cs.GR

    Improving Unsupervised Visual Program Inference with Code Rewriting Families

    Authors: Aditya Ganeshan, R. Kenny Jones, Daniel Ritchie

    Abstract: Programs offer compactness and structure that makes them an attractive representation for visual data. We explore how code rewriting can be used to improve systems for inferring programs from visual data. We first propose Sparse Intermittent Rewrite Injection (SIRI), a framework for unsupervised bootstrapped learning. SIRI sparsely applies code rewrite operations over a dataset of training program… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: Accepted at ICCV 23 (oral). Website: https://bardofcodes.github.io/coref/

  20. Visualizing Comparisons of Bills of Materials

    Authors: Rebecca Jones, Lucas Tate

    Abstract: Data analysis often involves the comparison of complex objects. With the ever increasing amounts and complexity of data, the demand for systems to help with these comparisons is also growing. Increasingly, information visualization tools support such comparisons explicitly, beyond simply allowing a viewer to examine each object individually. In this paper, we argue that the design of information v… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Journal ref: 2023 IEEE Symposium on Visualization for Cyber Security (VizSec), Melbourne, Australia, 2023, pp. 12-16

  21. arXiv:2309.10656  [pdf, other

    cs.LG

    A spectrum of physics-informed Gaussian processes for regression in engineering

    Authors: Elizabeth J Cross, Timothy J Rogers, Daniel J Pitchforth, Samuel J Gibson, Matthew R Jones

    Abstract: Despite the growing availability of sensing and data in general, we remain unable to fully characterise many in-service engineering systems and structures from a purely data-driven approach. The vast data and resources available to capture human activity are unmatched in our engineered world, and, even in cases where data could be referred to as ``big,'' they will rarely hold information across op… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  22. arXiv:2309.04976  [pdf, other

    cs.LG cs.AI eess.SY

    AVARS -- Alleviating Unexpected Urban Road Traffic Congestion using UAVs

    Authors: Jiaying Guo, Michael R. Jones, Soufiene Djahel, Shen Wang

    Abstract: Reducing unexpected urban traffic congestion caused by en-route events (e.g., road closures, car crashes, etc.) often requires fast and accurate reactions to choose the best-fit traffic signals. Traditional traffic light control systems, such as SCATS and SCOOT, are not efficient as their traffic data provided by induction loops has a low update frequency (i.e., longer than 1 minute). Moreover, th… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  23. arXiv:2308.13274  [pdf, other

    cs.DC

    Fortran High-Level Synthesis: Reducing the barriers to accelerating HPC codes on FPGAs

    Authors: Gabriel Rodriguez-Canal, Nick Brown, Tim Dykes, Jessica R. Jones, Utz-Uwe Haus

    Abstract: In recent years the use of FPGAs to accelerate scientific applications has grown, with numerous applications demonstrating the benefit of FPGAs for high performance workloads. However, whilst High Level Synthesis (HLS) has significantly lowered the barrier to entry in programming FPGAs by enabling programmers to use C++, a major challenge is that most often these codes are not originally written i… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: Author accepted version to appear in 33rd International Conference on Field-Programmable Logic and Applications

  24. arXiv:2308.11080  [pdf, other

    cond-mat.soft cs.LG

    Stress representations for tensor basis neural networks: alternative formulations to Finger-Rivlin-Ericksen

    Authors: Jan N. Fuhg, Nikolaos Bouklas, Reese E. Jones

    Abstract: Data-driven constitutive modeling frameworks based on neural networks and classical representation theorems have recently gained considerable attention due to their ability to easily incorporate constitutive constraints and their excellent generalization performance. In these models, the stress prediction follows from a linear combination of invariant-dependent coefficient functions and known tens… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 32 pages, 20 figures, 4 appendices

  25. arXiv:2307.06424  [pdf, other

    stat.ME cs.CE stat.CO stat.ML

    Robust scalable initialization for Bayesian variational inference with multi-modal Laplace approximations

    Authors: Wyatt Bridgman, Reese Jones, Mohammad Khalil

    Abstract: For predictive modeling relying on Bayesian inversion, fully independent, or ``mean-field'', Gaussian distributions are often used as approximate probability density functions in variational inference since the number of variational parameters is twice the number of unknown model parameters. The resulting diagonal covariance structure coupled with unimodal behavior can be too restrictive when deal… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  26. arXiv:2306.12346  [pdf, ps, other

    cs.DC cs.ET

    A Practical Overview of Quantum Computing: Is Exascale Possible?

    Authors: James H. Davenport, Jessica R. Jones, Matthew Thomason

    Abstract: Despite numerous advances in the field and a seemingly ever-increasing amount of investment, we are still some years away from seeing a production quantum computer in action. However, it is possible to make some educated guesses about the operational difficulties and challenges that may be encountered in practice. We can be reasonably confident that the early machines will be hybrid, with the quan… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 9 pages, 0 figures

    ACM Class: B.m; B.8.1

  27. arXiv:2305.08657  [pdf, other

    stat.ML cs.LG stat.AP

    Encoding Domain Expertise into Multilevel Models for Source Location

    Authors: Lawrence A. Bull, Matthew R. Jones, Elizabeth J. Cross, Andrew Duncan, Mark Girolami

    Abstract: Data from populations of systems are prevalent in many industrial applications. Machines and infrastructure are increasingly instrumented with sensing systems, emitting streams of telemetry data with complex interdependencies. In practice, data-centric monitoring procedures tend to consider these assets (and respective models) as distinct -- operating in isolation and associated with independent d… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  28. arXiv:2305.05661  [pdf, other

    cs.GR cs.AI cs.CV cs.LG cs.PL

    ShapeCoder: Discovering Abstractions for Visual Programs from Unstructured Primitives

    Authors: R. Kenny Jones, Paul Guerrero, Niloy J. Mitra, Daniel Ritchie

    Abstract: Programs are an increasingly popular representation for visual data, exposing compact, interpretable structure that supports manipulation. Visual programs are usually written in domain-specific languages (DSLs). Finding "good" programs, that only expose meaningful degrees of freedom, requires access to a DSL with a "good" library of functions, both of which are typically authored by domain experts… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: SIGGRAPH 2023

  29. arXiv:2304.10320  [pdf, other

    cs.GR

    Neurosymbolic Models for Computer Graphics

    Authors: Daniel Ritchie, Paul Guerrero, R. Kenny Jones, Niloy J. Mitra, Adriana Schulz, Karl D. D. Willis, Jiajun Wu

    Abstract: Procedural models (i.e. symbolic programs that output visual data) are a historically-popular method for representing graphics content: vegetation, buildings, textures, etc. They offer many advantages: interpretable design parameters, stochastic variations, high-quality outputs, compact representation, and more. But they also have some limitations, such as the difficulty of authoring a procedural… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: Eurographics 2023 State-of-the-art report (STAR)

  30. Modular machine learning-based elastoplasticity: generalization in the context of limited data

    Authors: Jan N. Fuhg, Craig M. Hamel, Kyle Johnson, Reese Jones, Nikolaos Bouklas

    Abstract: The development of accurate constitutive models for materials that undergo path-dependent processes continues to be a complex challenge in computational solid mechanics. Challenges arise both in considering the appropriate model assumptions and from the viewpoint of data availability, verification, and validation. Recently, data-driven modeling approaches have been proposed that aim to establish s… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: 36 pages, 25 figures

  31. arXiv:2210.02631  [pdf, other

    cs.LG stat.ML

    Data-driven Approaches to Surrogate Machine Learning Model Development

    Authors: H. Rhys Jones, Tingting Mu, Andrei C. Popescu, Yusuf Sulehman

    Abstract: We demonstrate the adaption of three established methods to the field of surrogate machine learning model development. These methods are data augmentation, custom loss functions and transfer learning. Each of these methods have seen widespread use in the field of machine learning, however, here we apply them specifically to surrogate machine learning model development. The machine learning model t… ▽ More

    Submitted 3 November, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: 16 pages, 13 figures

  32. arXiv:2210.00854  [pdf, other

    cs.LG

    Deep learning and multi-level featurization of graph representations of microstructural data

    Authors: Reese Jones, Cosmin Safta, Ari Frankel

    Abstract: Many material response functions depend strongly on microstructure, such as inhomogeneities in phase or orientation. Homogenization presents the task of predicting the mean response of a sample of the microstructure to external loading for use in subgrid models and structure-property explorations. Although many microstructural fields have obvious segmentations, learning directly from the graph ind… ▽ More

    Submitted 29 September, 2022; originally announced October 2022.

    Comments: 27 pages, 17 figures

  33. arXiv:2209.15579  [pdf, other

    cs.LG

    Physically Meaningful Uncertainty Quantification in Probabilistic Wind Turbine Power Curve Models as a Damage Sensitive Feature

    Authors: J. H. Mclean, M. R. Jones, B. J. O'Connell, A. E Maguire, T. J. Rogers

    Abstract: A wind turbines' power curve is easily accessible damage sensitive data, and as such is a key part of structural health monitoring in wind turbines. Power curve models can be constructed in a number of ways, but the authors argue that probabilistic methods carry inherent benefits in this use case, such as uncertainty quantification and allowing uncertainty propagation analysis. Many probabilistic… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  34. arXiv:2209.13126  [pdf, other

    cs.LG

    Design of experiments for the calibration of history-dependent models via deep reinforcement learning and an enhanced Kalman filter

    Authors: Ruben Villarreal, Nikolaos N. Vlassis, Nhon N. Phan, Tommie A. Catanach, Reese E. Jones, Nathaniel A. Trask, Sharlotte L. B. Kramer, WaiChing Sun

    Abstract: Experimental data is costly to obtain, which makes it difficult to calibrate complex models. For many models an experimental design that produces the best calibration given a limited experimental budget is not obvious. This paper introduces a deep reinforcement learning (RL) algorithm for design of experiments that maximizes the information gain measured by Kullback-Leibler (KL) divergence obtaine… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: 40 pages, 20 figures

  35. Cem Mil Podcasts: A Spoken Portuguese Document Corpus For Multi-modal, Multi-lingual and Multi-Dialect Information Access Research

    Authors: Ekaterina Garmash, Edgar Tanaka, Ann Clifton, Joana Correia, Sharmistha Jat, Winstead Zhu, Rosie Jones, Jussi Karlgren

    Abstract: In this paper we describe the Portuguese-language podcast dataset we have released for academic research purposes. We give an overview of how the data was sampled, descriptive statistics over the collection, as well as information about the distribution over Brazilian and Portuguese dialects. We give results from experiments on multi-lingual summarization, showing that summarizing podcast transcri… ▽ More

    Submitted 13 December, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 12 pages, 1 figure

    Journal ref: Volume 14163 of Lecture Notes in Computer Science, pages 48-59, Springer, 2023

  36. arXiv:2207.12504  [pdf, other

    cs.CL

    Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free

    Authors: M. Iftekhar Tanveer, Diego Casabuena, Jussi Karlgren, Rosie Jones

    Abstract: Podcasts are conversational in nature and speaker changes are frequent -- requiring speaker diarization for content understanding. We propose an unsupervised technique for speaker diarization without relying on language-specific components. The algorithm is overlap-aware and does not require information about the number of speakers. Our approach shows 79% improvement on purity scores (34% on F-sco… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Published at Interspeech 2022

  37. arXiv:2207.05680  [pdf, other

    cs.MM cs.AI cs.CL cs.SD eess.AS

    The Contribution of Lyrics and Acoustics to Collaborative Understanding of Mood

    Authors: Shahrzad Naseri, Sravana Reddy, Joana Correia, Jussi Karlgren, Rosie Jones

    Abstract: In this work, we study the association between song lyrics and mood through a data-driven analysis. Our data set consists of nearly one million songs, with song-mood associations derived from user playlists on the Spotify streaming platform. We take advantage of state-of-the-art natural language processing models based on transformers to learn the association between the lyrics and moods. We find… ▽ More

    Submitted 31 May, 2022; originally announced July 2022.

  38. arXiv:2207.01336  [pdf, ps, other

    cs.NI cs.LG

    Spectral Power Profile Optimization of Field-Deployed WDM Network by Remote Link Modeling

    Authors: Rasmus T. Jones, Kyle R. H. Bottrill, Natsupa Taengnoi, Periklis Petropoulos, Metodi P. Yankov

    Abstract: A digital twin model of a multi-node WDM network is obtained from a single access point. The model is used to predict and optimize the transmit power profile for each link in the network and up to 2.2~dB of margin improvements are obtained w.r.t. unoptimized transmission.

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: accepted, European Conference on Optical Communications, ECOC 2022

  39. Physics-informed machine learning for Structural Health Monitoring

    Authors: Elizabeth J Cross, Samuel J Gibson, Matthew R Jones, Daniel J Pitchforth, Sikai Zhang, Timothy J Rogers

    Abstract: The use of machine learning in Structural Health Monitoring is becoming more common, as many of the inherent tasks (such as regression and classification) in developing condition-based assessment fall naturally into its remit. This chapter introduces the concept of physics-informed machine learning, where one adapts ML algorithms to account for the physical insight an engineer will often have of t… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

  40. arXiv:2206.03480  [pdf, other

    cs.CV cs.GR cs.LG

    SHRED: 3D Shape Region Decomposition with Learned Local Operations

    Authors: R. Kenny Jones, Aalia Habib, Daniel Ritchie

    Abstract: We present SHRED, a method for 3D SHape REgion Decomposition. SHRED takes a 3D point cloud as input and uses learned local operations to produce a segmentation that approximates fine-grained part instances. We endow SHRED with three decomposition operations: splitting regions, fixing the boundaries between regions, and merging regions together. Modules are trained independently and locally, allowi… ▽ More

    Submitted 3 October, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: SIGGRAPH ASIA 2022

  41. arXiv:2206.01495  [pdf, other

    cs.LG cs.SD eess.AS

    Constraining Gaussian processes for physics-informed acoustic emission mapping

    Authors: Matthew R Jones, Timothy J Rogers, Elizabeth J Cross

    Abstract: The automated localisation of damage in structures is a challenging but critical ingredient in the path towards predictive or condition-based maintenance of high value structures. The use of acoustic emission time of arrival mapping is a promising approach to this challenge, but is severely hindered by the need to collect a dense set of artificial acoustic emission measurements across the structur… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  42. Unikernel Linux (UKL)

    Authors: Ali Raza, Thomas Unger, Matthew Boyd, Eric Munson, Parul Sohal, Ulrich Drepper, Richard Jones, Daniel Bristot de Oliveira, Larry Woodman, Renato Mancuso, Jonathan Appavoo, Orran Krieger

    Abstract: This paper presents Unikernel Linux (UKL), a path toward integrating unikernel optimization techniques in Linux, a general purpose operating system. UKL adds a configuration option to Linux allowing for a single, optimized process to link with the kernel directly, and run at supervisor privilege. This UKL process does not require application source code modification, only a re-link with our, sligh… ▽ More

    Submitted 22 June, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Added more results in the evaluation section. Improved overall writing and added diagrams to explain the architecture

    Journal ref: Proceedings of the Eighteenth European Conference on Computer Systems (EuroSys 23), May 2023, Pages 590 - 605

  43. arXiv:2204.00570  [pdf, other

    cs.LG cs.CV

    Connect, Not Collapse: Explaining Contrastive Learning for Unsupervised Domain Adaptation

    Authors: Kendrick Shen, Robbie Jones, Ananya Kumar, Sang Michael Xie, Jeff Z. HaoChen, Tengyu Ma, Percy Liang

    Abstract: We consider unsupervised domain adaptation (UDA), where labeled data from a source domain (e.g., photographs) and unlabeled data from a target domain (e.g., sketches) are used to learn a classifier for the target domain. Conventional UDA methods (e.g., domain adversarial training) learn domain-invariant features to improve generalization to the target domain. In this paper, we show that contrastiv… ▽ More

    Submitted 1 December, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: ICML 2022 (Long Talk)

  44. arXiv:2203.13948  [pdf

    cs.CV

    AI-augmented histopathologic review using image analysis to optimize DNA yield and tumor purity from FFPE slides

    Authors: Bolesław L. Osinski, Aïcha BenTaieb, Irvin Ho, Ryan D. Jones, Rohan P. Joshi, Andrew Westley, Michael Carlson, Caleb Willis, Luke Schleicher, Brett M. Mahon, Martin C. Stumpe

    Abstract: To achieve minimum DNA input and tumor purity requirements for next-generation sequencing (NGS), pathologists visually estimate macrodissection and slide count decisions. Misestimation may cause tissue waste and increased laboratory costs. We developed an AI-augmented smart pathology review system (SmartPath) to empower pathologists with quantitative metrics for determining tissue extraction param… ▽ More

    Submitted 7 April, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

  45. arXiv:2202.10054  [pdf, other

    cs.LG cs.CV

    Fine-Tuning can Distort Pretrained Features and Underperform Out-of-Distribution

    Authors: Ananya Kumar, Aditi Raghunathan, Robbie Jones, Tengyu Ma, Percy Liang

    Abstract: When transferring a pretrained model to a downstream task, two popular methods are full fine-tuning (updating all the model parameters) and linear probing (updating only the last linear layer -- the "head"). It is well known that fine-tuning leads to better accuracy in-distribution (ID). However, in this paper, we find that fine-tuning can achieve worse accuracy than linear probing out-of-distribu… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: ICLR (Oral) 2022

  46. arXiv:2202.00078  [pdf, other

    physics.app-ph cond-mat.mtrl-sci cs.LG

    A heteroencoder architecture for prediction of failure locations in porous metals using variational inference

    Authors: Wyatt Bridgman, Xiaoxuan Zhang, Greg Teichert, Mohammad Khalil, Krishna Garikipati, Reese Jones

    Abstract: In this work we employ an encoder-decoder convolutional neural network to predict the failure locations of porous metal tension specimens based only on their initial porosities. The process we model is complex, with a progression from initial void nucleation, to saturation, and ultimately failure. The objective of predicting failure locations presents an extreme case of class imbalance since most… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

    Comments: 40 pages, 12 figures

  47. arXiv:2112.07022  [pdf, other

    cs.GR cs.CV cs.LG

    Learning Body-Aware 3D Shape Generative Models

    Authors: Bryce Blinn, Alexander Ding, R. Kenny Jones, Manolis Savva, Srinath Sridhar, Daniel Ritchie

    Abstract: The shape of many objects in the built environment is dictated by their relationships to the human body: how will a person interact with this object? Existing data-driven generative models of 3D shapes produce plausible objects but do not reason about the relationship of those objects to the human body. In this paper, we learn body-aware generative models of 3D shapes. Specifically, we train gener… ▽ More

    Submitted 20 January, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: 11 pages, 8 figures

  48. Podcast Metadata and Content: Episode Relevance andAttractiveness in Ad Hoc Search

    Authors: Ben Carterette, Rosie Jones, Gareth F. Jones, Maria Eskevich, Sravana Reddy, Ann Clifton, Yongze Yu, Jussi Karlgren, Ian Soboroff

    Abstract: Rapidly growing online podcast archives contain diverse content on a wide range of topics. These archives form an important resource for entertainment and professional use, but their value can only be realized if users can rapidly and reliably locate content of interest. Search for relevant content can be based on metadata provided by content creators, but also on transcripts of the spoken content… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

  49. arXiv:2107.00090  [pdf, other

    cs.LG

    Mesh-based graph convolutional neural networks for modeling materials with microstructure

    Authors: Ari Frankel, Cosmin Safta, Coleman Alleman, Reese Jones

    Abstract: Predicting the evolution of a representative sample of a material with microstructure is a fundamental problem in homogenization. In this work we propose a graph convolutional neural network that utilizes the discretized representation of the initial microstructure directly, without segmentation or clustering. Compared to feature-based and pixel-based convolutional neural network models, the propo… ▽ More

    Submitted 29 November, 2021; v1 submitted 3 June, 2021; originally announced July 2021.

    Comments: 45 pages, 19 figures

  50. arXiv:2106.12026  [pdf, other

    cs.CV cs.AI cs.LG

    The Neurally-Guided Shape Parser: Grammar-based Labeling of 3D Shape Regions with Approximate Inference

    Authors: R. Kenny Jones, Aalia Habib, Rana Hanocka, Daniel Ritchie

    Abstract: We propose the Neurally-Guided Shape Parser (NGSP), a method that learns how to assign fine-grained semantic labels to regions of a 3D shape. NGSP solves this problem via MAP inference, modeling the posterior probability of a label assignment conditioned on an input shape with a learned likelihood function. To make this search tractable, NGSP employs a neural guide network that learns to approxima… ▽ More

    Submitted 22 March, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: CVPR 2022; https://github.com/rkjones4/NGSP