Zum Hauptinhalt springen

Showing 1–50 of 375 results for author: Reid

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.15898  [pdf, other

    cs.LG cs.AI

    Airfoil Diffusion: Denoising Diffusion Model For Conditional Airfoil Generation

    Authors: Reid Graves, Amir Barati Farimani

    Abstract: The design of aerodynamic shapes, such as airfoils, has traditionally required significant computational resources and relied on predefined design parameters, which limit the potential for novel shape synthesis. In this work, we introduce a data-driven methodology for airfoil generation using a diffusion model. Trained on a dataset of preexisting airfoils, our model can generate an arbitrary numbe… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 12 Pages, 6 figures

  2. arXiv:2408.14227  [pdf, other

    cs.CV

    TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation

    Authors: Anh-Dzung Doan, Vu Minh Hieu Phan, Surabhi Gupta, Markus Wagner, Tat-Jun Chin, Ian Reid

    Abstract: Infrared imaging offers resilience against changing lighting conditions by capturing object temperatures. Yet, in few scenarios, its lack of visual details compared to daytime visible images, poses a significant challenge for human and machine interpretation. This paper proposes a novel diffusion method, dubbed Temporally Consistent Patch Diffusion Models (TC-DPM), for infrared-to-visible video tr… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: Technical report

  3. arXiv:2408.12655  [pdf, other

    cs.LG cs.HC

    Improving Radiography Machine Learning Workflows via Metadata Management for Training Data Selection

    Authors: Mirabel Reid, Christine Sweeney, Oleg Korobkin

    Abstract: Most machine learning models require many iterations of hyper-parameter tuning, feature engineering, and debugging to produce effective results. As machine learning models become more complicated, this pipeline becomes more difficult to manage effectively. In the physical sciences, there is an ever-increasing pool of metadata that is generated by the scientific research cycle. Tracking this metada… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 14 pages, 9 figures

  4. arXiv:2408.06463  [pdf, other

    cs.CR

    Statistical Quality Comparison of the Bitstrings Generated by a Physical Unclonable Function across Xilinx, Altera and Microsemi Devices

    Authors: Jenilee Jao, Kristi Hoffman, Cheryl Reid, Ryan Thomson, Michael Thompson, Jim Plusquellic

    Abstract: Entropy or randomness represents a foundational security property in security-related operations, such as key generation. Key generation in turn is central to security protocols such as authentication and encryption. Physical unclonable functions (PUF) are hardware-based primitives that can serve as key generation engines in modern microelectronic devices and applications. PUFs derive entropy from… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 15 pages, 22 figures, IEEE journal

  5. arXiv:2408.00118  [pdf, other

    cs.CL cs.AI

    Gemma 2: Improving Open Language Models at a Practical Size

    Authors: Gemma Team, Morgane Riviere, Shreya Pathak, Pier Giuseppe Sessa, Cassidy Hardin, Surya Bhupatiraju, Léonard Hussenot, Thomas Mesnard, Bobak Shahriari, Alexandre Ramé, Johan Ferret, Peter Liu, Pouya Tafti, Abe Friesen, Michelle Casbon, Sabela Ramos, Ravin Kumar, Charline Le Lan, Sammy Jerome, Anton Tsitsulin, Nino Vieillard, Piotr Stanczyk, Sertan Girgin, Nikola Momchev, Matt Hoffman , et al. (172 additional authors not shown)

    Abstract: In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We al… ▽ More

    Submitted 2 August, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

  6. arXiv:2407.10061  [pdf, other

    cs.CV

    InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation

    Authors: Zeyu Zhang, Akide Liu, Qi Chen, Feng Chen, Ian Reid, Richard Hartley, Bohan Zhuang, Hao Tang

    Abstract: Text-to-motion generation holds potential for film, gaming, and robotics, yet current methods often prioritize short motion generation, making it challenging to produce long motion sequences effectively: (1) Current methods struggle to handle long motion sequences as a single input due to prohibitively high computational cost; (2) Breaking down the generation of long motion sequences into shorter… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  7. arXiv:2407.07171  [pdf, other

    cs.CV

    ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation

    Authors: Yuyuan Liu, Yuanhong Chen, Hu Wang, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

    Abstract: The costly and time-consuming annotation process to produce large training sets for modelling semantic LiDAR segmentation methods has motivated the development of semi-supervised learning (SSL) methods. However, such SSL approaches often concentrate on employing consistency learning only for individual LiDAR representations. This narrow focus results in limited perturbations that generally fail to… ▽ More

    Submitted 19 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: 27 pages (15 pages main paper and 12 pages supplementary with references), ECCV 2024 accepted

  8. arXiv:2407.05607  [pdf, other

    cs.CV

    Weakly Supervised Test-Time Domain Adaptation for Object Detection

    Authors: Anh-Dzung Doan, Bach Long Nguyen, Terry Lim, Madhuka Jayawardhana, Surabhi Gupta, Christophe Guettier, Ian Reid, Markus Wagner, Tat-Jun Chin

    Abstract: Prior to deployment, an object detector is trained on a dataset compiled from a previous data collection campaign. However, the environment in which the object detector is deployed will invariably evolve, particularly in outdoor settings where changes in lighting, weather and seasons will significantly affect the appearance of the scene and target objects. It is almost impossible for all potential… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  9. arXiv:2406.14722  [pdf, other

    cs.AI

    Does GPT Really Get It? A Hierarchical Scale to Quantify Human vs AI's Understanding of Algorithms

    Authors: Mirabel Reid, Santosh S. Vempala

    Abstract: As Large Language Models (LLMs) perform (and sometimes excel at) more and more complex cognitive tasks, a natural question is whether AI really understands. The study of understanding in LLMs is in its infancy, and the community has yet to incorporate well-trodden research in philosophy, psychology, and education. We initiate this, specifically focusing on understanding algorithms, and propose a h… ▽ More

    Submitted 20 August, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 13 pages, 8 figures

    ACM Class: I.2.m; F.1.1

  10. arXiv:2406.11850  [pdf, other

    cs.CY cs.AI

    Closed-loop Teaching via Demonstrations to Improve Policy Transparency

    Authors: Michael S. Lee, Reid Simmons, Henny Admoni

    Abstract: Demonstrations are a powerful way of increasing the transparency of AI policies. Though informative demonstrations may be selected a priori through the machine teaching paradigm, student learning may deviate from the preselected curriculum in situ. This paper thus explores augmenting a curriculum with a closed-loop teaching framework inspired by principles from the education literature, such as th… ▽ More

    Submitted 1 April, 2024; originally announced June 2024.

    Comments: Supplementary material available at https://drive.google.com/file/d/1f_BDk3JpY6DvqlvgKtnQZ8zdfO3XAn3p/view?usp=drive_link

  11. arXiv:2406.11363  [pdf, other

    cs.SE

    A Preliminary Study on Self-Contained Libraries in the NPM Ecosystem

    Authors: Pongchai Jaisri, Brittany Reid, Raula Gaikovina Kula

    Abstract: The widespread of libraries within modern software ecosystems creates complex networks of dependencies. These dependencies are fragile to breakage, outdated, or redundancy, potentially leading to cascading issues in dependent libraries. One mitigation strategy involves reducing dependencies; libraries with zero dependencies become to self-contained. This paper explores the characteristics of self-… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Contains 6 pages, 5 figures. This research paper has been accepted by 22nd IEEE/ACIS International Conference on Software Engineering, Management and Applications (SERA 2024)

    ACM Class: D.2.7

  12. arXiv:2406.10724  [pdf, other

    eess.IV cs.CV cs.LG

    Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft

    Authors: Ian Vyse, Rishit Dagli, Dav Vrat Chadha, John P. Ma, Hector Chen, Isha Ruparelia, Prithvi Seran, Matthew Xie, Eesa Aamer, Aidan Armstrong, Naveen Black, Ben Borstein, Kevin Caldwell, Orrin Dahanaggamaarachchi, Joe Dai, Abeer Fatima, Stephanie Lu, Maxime Michet, Anoushka Paul, Carrie Ann Po, Shivesh Prakash, Noa Prosser, Riddhiman Roy, Mirai Shinjo, Iliya Shofman , et al. (4 additional authors not shown)

    Abstract: Satellite remote sensing missions have gained popularity over the past fifteen years due to their ability to cover large swaths of land at regular intervals, making them ideal for monitoring environmental trends. The FINCH mission, a 3U+ CubeSat equipped with a hyperspectral camera, aims to monitor crop residue cover in agricultural fields. Although hyperspectral imaging captures both spectral and… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: To appear in 38th Annual Small Satellite Conference

  13. arXiv:2406.07767  [pdf, other

    cs.RO cs.LG

    Conformalized Teleoperation: Confidently Mapping Human Inputs to High-Dimensional Robot Actions

    Authors: Michelle Zhao, Reid Simmons, Henny Admoni, Andrea Bajcsy

    Abstract: Assistive robotic arms often have more degrees-of-freedom than a human teleoperator can control with a low-dimensional input, like a joystick. To overcome this challenge, existing approaches use data-driven methods to learn a mapping from low-dimensional human inputs to high-dimensional robot actions. However, determining if such a black-box mapping can confidently infer a user's intended high-dim… ▽ More

    Submitted 10 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  14. arXiv:2406.06581  [pdf, other

    cs.CL cs.AI cs.LG

    Set-Based Prompting: Provably Solving the Language Model Order Dependency Problem

    Authors: Reid McIlroy-Young, Katrina Brown, Conlan Olson, Linjun Zhang, Cynthia Dwork

    Abstract: The development of generative language models that can create long and coherent textual outputs via autoregression has lead to a proliferation of uses and a corresponding sweep of analyses as researches work to determine the limitations of this new paradigm. Unlike humans, these 'Large Language Models' (LLMs) are highly sensitive to small changes in their inputs, leading to unwanted inconsistency… ▽ More

    Submitted 12 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 29 pages, 27 figures, code https://github.com/reidmcy/set-based-prompting

  15. arXiv:2405.16541  [pdf, other

    stat.ML cs.LG

    Variance-Reducing Couplings for Random Features: Perspectives from Optimal Transport

    Authors: Isaac Reid, Stratis Markou, Krzysztof Choromanski, Richard E. Turner, Adrian Weller

    Abstract: Random features (RFs) are a popular technique to scale up kernel methods in machine learning, replacing exact kernel evaluations with stochastic Monte Carlo estimates. They underpin models as diverse as efficient transformers (by approximating attention) to sparse spectrum Gaussian processes (by approximating the covariance function). Efficiency can be further improved by speeding up the convergen… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  16. arXiv:2405.15893  [pdf, other

    cs.SI

    Quantifying Influencer Effects on Affective Polarization

    Authors: Rezaur Rashid, Joshua Melton, Ouldouz Ghorbani, Siddharth Krishnan, Shannon Reid, Gabriel Terejanu

    Abstract: In an era where digital platforms increasingly mediate public discourse, grasping the complexities and nuances in affective polarization--especially as influenced by key figures on social media--has never been more vital. This study delves into the intricate web of interactions on Twitter, now rebranded as 'X', to unravel how influencer-led conversations catalyze shifts in public sentiment, laying… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 8 pages, 4 figures

  17. arXiv:2405.10255  [pdf, other

    cs.CV cs.RO

    When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models

    Authors: Xianzheng Ma, Yash Bhalgat, Brandon Smart, Shuai Chen, Xinghui Li, Jian Ding, Jindong Gu, Dave Zhenyu Chen, Songyou Peng, Jia-Wang Bian, Philip H Torr, Marc Pollefeys, Matthias Nießner, Ian D Reid, Angel X. Chang, Iro Laina, Victor Adrian Prisacariu

    Abstract: As large language models (LLMs) evolve, their integration with 3D spatial data (3D-LLMs) has seen rapid progress, offering unprecedented capabilities for understanding and interacting with physical spaces. This survey provides a comprehensive overview of the methodologies enabling LLMs to process, understand, and generate 3D data. Highlighting the unique advantages of LLMs, such as in-context lear… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  18. arXiv:2405.07560  [pdf

    cs.LG

    Coding historical causes of death data with Large Language Models

    Authors: Bjørn Pedersen, Maisha Islam, Doris Tove Kristoffersen, Lars Ailo Bongo, Eilidh Garrett, Alice Reid, Hilde Sommerseth

    Abstract: This paper investigates the feasibility of using pre-trained generative Large Language Models (LLMs) to automate the assignment of ICD-10 codes to historical causes of death. Due to the complex narratives often found in historical causes of death, this task has traditionally been manually performed by coding experts. We evaluate the ability of GPT-3.5, GPT-4, and Llama 2 LLMs to accurately assign… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 18 pages, 1 figure in main text, 3 figures in appendix

  19. arXiv:2405.06085  [pdf, ps, other

    cs.DC cs.OS

    Zero-consistency root emulation for unprivileged container image build

    Authors: Reid Priedhorsky, Michael Jennings, Megan Phinney

    Abstract: Do Linux distribution package managers need the privileged operations they request to actually happen? Apparently not, at least for building container images for HPC applications. We use this observation to implement a root emulation mode using a Linux seccomp filter that intercepts some privileged system calls, does nothing, and returns success to the calling program. This approach provides no co… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 7 pages, 2 figures

    Report number: LA-UR 24-24056

  20. arXiv:2405.05792  [pdf, other

    cs.RO cs.AI cs.CV cs.HC cs.LG

    RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation

    Authors: Sourav Garg, Krishan Rana, Mehdi Hosseinzadeh, Lachlan Mares, Niko Sünderhauf, Feras Dayoub, Ian Reid

    Abstract: Mapping is crucial for spatial reasoning, planning and robot navigation. Existing approaches range from metric, which require precise geometry-based optimization, to purely topological, where image-as-node based graphs lack explicit object-level reasoning and interconnectivity. In this paper, we propose a novel topological representation of an environment based on "image segments", which are seman… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Published at ICRA 2024; 9 pages, 8 figures

  21. arXiv:2405.05066  [pdf, other

    cs.AI cs.CY cs.LG

    Designing Skill-Compatible AI: Methodologies and Frameworks in Chess

    Authors: Karim Hamade, Reid McIlroy-Young, Siddhartha Sen, Jon Kleinberg, Ashton Anderson

    Abstract: Powerful artificial intelligence systems are often used in settings where they must interact with agents that are computationally much weaker, for example when they work alongside humans or operate in complex environments where some tasks are handled by algorithms, heuristics, or other entities of varying computational power. For AI agents to successfully interact in these settings, however, achie… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 18 pages, 5 figures, 15 tables, Published In The Twelfth International Conference on Learning Representations, ICLR 2024

  22. arXiv:2405.04380  [pdf, other

    math.OC cs.CE math.NA

    Preserving Nonlinear Constraints in Variational Flow Filtering Data Assimilation

    Authors: Amit N. Subrahmanya, Andrey A. Popov, Reid J. Gomillion, Adrian Sandu

    Abstract: Data assimilation aims to estimate the states of a dynamical system by optimally combining sparse and noisy observations of the physical system with uncertain forecasts produced by a computational model. The states of many dynamical systems of interest obey nonlinear physical constraints, and the corresponding dynamics is confined to a certain sub-manifold of the state space. Standard data assimil… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Report number: CSL-TR-24-1 MSC Class: 65C05; 62F15; 62F30; 35R30

  23. arXiv:2404.16188  [pdf, other

    cs.LG cs.AI stat.ML

    Pearls from Pebbles: Improved Confidence Functions for Auto-labeling

    Authors: Harit Vishwakarma, Reid, Chen, Sui Jiet Tay, Satya Sai Srinath Namburi, Frederic Sala, Ramya Korlakai Vinayak

    Abstract: Auto-labeling is an important family of techniques that produce labeled training sets with minimum manual labeling. A prominent variant, threshold-based auto-labeling (TBAL), works by finding a threshold on a model's confidence scores above which it can accurately label unlabeled data points. However, many models are known to produce overconfident scores, leading to poor TBAL performance. While a… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  24. arXiv:2404.15472  [pdf, other

    cs.RO

    Understanding Robot Minds: Leveraging Machine Teaching for Transparent Human-Robot Collaboration Across Diverse Groups

    Authors: Suresh Kumaar Jayaraman, Reid Simmons, Aaron Steinfeld, Henny Admoni

    Abstract: In this work, we aim to improve transparency and efficacy in human-robot collaboration by developing machine teaching algorithms suitable for groups with varied learning capabilities. While previous approaches focused on tailored approaches for teaching individuals, our method teaches teams with various compositions of diverse learners using team belief representations to address personalization c… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  25. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 19 pages

  26. arXiv:2404.10228  [pdf, other

    cs.LG cs.CL cs.SI

    Two-Stage Stance Labeling: User-Hashtag Heuristics with Graph Neural Networks

    Authors: Joshua Melton, Shannon Reid, Gabriel Terejanu, Siddharth Krishnan

    Abstract: The high volume and rapid evolution of content on social media present major challenges for studying the stance of social media users. In this work, we develop a two stage stance labeling method that utilizes the user-hashtag bipartite graph and the user-user interaction graph. In the first stage, a simple and efficient heuristic for stance labeling uses the user-hashtag bipartite graph to iterati… ▽ More

    Submitted 17 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  27. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  28. arXiv:2404.05578  [pdf, other

    cs.CV

    Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning

    Authors: Mahsa Ehsanpour, Ian Reid, Hamid Rezatofighi

    Abstract: For a complete comprehension of multi-person scenes, it is essential to go beyond basic tasks like detection and tracking. Higher-level tasks, such as understanding the interactions and social activities among individuals, are also crucial. Progress towards models that can fully understand scenes involving multiple people is hindered by a lack of sufficient annotated data for such high-level tasks… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  29. arXiv:2404.04268  [pdf

    cs.IR cs.AI cs.CY cs.SI

    The Use of Generative Search Engines for Knowledge Work and Complex Tasks

    Authors: Siddharth Suri, Scott Counts, Leijie Wang, Chacha Chen, Mengting Wan, Tara Safavi, Jennifer Neville, Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Sathish Manivannan, Nagu Rangan, Longqi Yang

    Abstract: Until recently, search engines were the predominant method for people to access online information. The recent emergence of large language models (LLMs) has given machines new capabilities such as the ability to generate new digital artifacts like text, images, code etc., resulting in a new tool, a generative search engine, which combines the capabilities of LLMs with a traditional search engine.… ▽ More

    Submitted 19 March, 2024; originally announced April 2024.

    Comments: 32 pages, 3 figures, 4 tables

    ACM Class: J.4

  30. arXiv:2404.01686  [pdf, other

    cs.CV

    JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

    Authors: Duy-Tho Le, Chenhui Gou, Stavya Datta, Hengcan Shi, Ian Reid, Jianfei Cai, Hamid Rezatofighi

    Abstract: Autonomous robot systems have attracted increasing research attention in recent years, where environment understanding is a crucial step for robot navigation, human-robot interaction, and decision. Real-world robot systems usually collect visual data from multiple sensors and are required to recognize numerous objects and their movements in complex human-crowded settings. Traditional benchmarks, w… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  31. arXiv:2403.15095  [pdf

    physics.geo-ph cs.LG

    End-to-End Mineral Exploration with Artificial Intelligence and Ambient Noise Tomography

    Authors: Jack Muir, Gerrit Olivier, Anthony Reid

    Abstract: This paper presents an innovative end-to-end workflow for mineral exploration, integrating ambient noise tomography (ANT) and artificial intelligence (AI) to enhance the discovery and delineation of mineral resources essential for the global transition to a low carbon economy. We focus on copper as a critical element, required in significant quantities for renewable energy solutions. We show the b… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  32. arXiv:2403.12388  [pdf, other

    cs.IR cs.AI

    Interpretable User Satisfaction Estimation for Conversational Systems with Large Language Models

    Authors: Ying-Chun Lin, Jennifer Neville, Jack W. Stokes, Longqi Yang, Tara Safavi, Mengting Wan, Scott Counts, Siddharth Suri, Reid Andersen, Xiaofeng Xu, Deepak Gupta, Sujay Kumar Jauhar, Xia Song, Georg Buscher, Saurabh Tiwary, Brent Hecht, Jaime Teevan

    Abstract: Accurate and interpretable user satisfaction estimation (USE) is critical for understanding, evaluating, and continuously improving conversational systems. Users express their satisfaction or dissatisfaction with diverse conversational patterns in both general-purpose (ChatGPT and Bing Copilot) and task-oriented (customer service chatbot) conversational systems. Existing approaches based on featur… ▽ More

    Submitted 8 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  33. arXiv:2403.12173  [pdf, other

    cs.CL cs.AI cs.IR

    TnT-LLM: Text Mining at Scale with Large Language Models

    Authors: Mengting Wan, Tara Safavi, Sujay Kumar Jauhar, Yujin Kim, Scott Counts, Jennifer Neville, Siddharth Suri, Chirag Shah, Ryen W White, Longqi Yang, Reid Andersen, Georg Buscher, Dhruv Joshi, Nagu Rangan

    Abstract: Transforming unstructured text into structured and meaningful forms, organized by useful category labels, is a fundamental step in text mining for downstream analysis and application. However, most existing methods for producing label taxonomies and building text-based label classifiers still rely heavily on domain expertise and manual curation, making the process expensive and time-consuming. Thi… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 9 pages main content, 8 pages references and appendix

  34. arXiv:2403.09212  [pdf, other

    cs.CV

    PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest

    Authors: Jiajun Deng, Sha Zhang, Feras Dayoub, Wanli Ouyang, Yanyong Zhang, Ian Reid

    Abstract: In this work, we present PoIFusion, a simple yet effective multi-modal 3D object detection framework to fuse the information of RGB images and LiDAR point clouds at the point of interest (abbreviated as PoI). Technically, our PoIFusion follows the paradigm of query-based object detection, formulating object queries as dynamic 3D boxes. The PoIs are adaptively generated from each query box on the f… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: NIL

  35. arXiv:2403.08733  [pdf, other

    cs.CV

    GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

    Authors: Jing Wu, Jia-Wang Bian, Xinghui Li, Guangrun Wang, Ian Reid, Philip Torr, Victor Adrian Prisacariu

    Abstract: We propose GaussCtrl, a text-driven method to edit a 3D scene reconstructed by the 3D Gaussian Splatting (3DGS). Our method first renders a collection of images by using the 3DGS and edits them by using a pre-trained 2D diffusion model (ControlNet) based on the input prompt, which is then used to optimise the 3D model. Our key contribution is multi-view consistent editing, which enables editin… ▽ More

    Submitted 14 July, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: ECCV2024, Project Website: https://gaussctrl.active.vision/

  36. arXiv:2403.08295  [pdf, other

    cs.CL cs.AI

    Gemma: Open Models Based on Gemini Research and Technology

    Authors: Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Léonard Hussenot, Pier Giuseppe Sessa, Aakanksha Chowdhery, Adam Roberts, Aditya Barua, Alex Botev, Alex Castro-Ros, Ambrose Slone, Amélie Héliou, Andrea Tacchetti, Anna Bulanova, Antonia Paterson, Beth Tsai, Bobak Shahriari , et al. (83 additional authors not shown)

    Abstract: This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language understanding, reasoning, and safety. We release two sizes of models (2 billion and 7 billion parameters), and provide both pretrained and fine-tuned checkpoints. Ge… ▽ More

    Submitted 16 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  37. arXiv:2403.07487  [pdf, other

    cs.CV

    Motion Mamba: Efficient and Long Sequence Motion Generation

    Authors: Zeyu Zhang, Akide Liu, Ian Reid, Richard Hartley, Bohan Zhuang, Hao Tang

    Abstract: Human motion generation stands as a significant pursuit in generative computer vision, while achieving long-sequence and efficient motion generation remains challenging. Recent advancements in state space models (SSMs), notably Mamba, have showcased considerable promise in long sequence modeling with an efficient hardware-aware design, which appears to be a promising direction to build motion gene… ▽ More

    Submitted 3 August, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted to ECCV 2024

  38. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  39. arXiv:2403.04893  [pdf, other

    cs.AI

    A Safe Harbor for AI Evaluation and Red Teaming

    Authors: Shayne Longpre, Sayash Kapoor, Kevin Klyman, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Zheng-Xin Yong, Suhas Kotha, Yi Zeng, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia, Daniel Kang, Sandy Pentland, Arvind Narayanan, Percy Liang, Peter Henderson

    Abstract: Independent evaluation and red teaming are critical for identifying the risks posed by generative AI systems. However, the terms of service and enforcement strategies used by prominent AI companies to deter model misuse have disincentives on good faith safety evaluations. This causes some researchers to fear that conducting such research or releasing their findings will result in account suspensio… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  40. arXiv:2403.00810  [pdf, other

    cs.AI cs.CL

    Bootstrapping Cognitive Agents with a Large Language Model

    Authors: Feiyu Zhu, Reid Simmons

    Abstract: Large language models contain noisy general knowledge of the world, yet are hard to train or fine-tune. On the other hand cognitive architectures have excellent interpretability and are flexible to update but require a lot of manual work to instantiate. In this work, we combine the best of both worlds: bootstrapping a cognitive-based model with the noisy knowledge encoded in large language models.… ▽ More

    Submitted 24 February, 2024; originally announced March 2024.

  41. arXiv:2402.09984  [pdf, other

    cs.LG cs.AI

    Symmetry-Breaking Augmentations for Ad Hoc Teamwork

    Authors: Ravi Hammond, Dustin Craggs, Mingyu Guo, Jakob Foerster, Ian Reid

    Abstract: In many collaborative settings, artificial intelligence (AI) agents must be able to adapt to new teammates that use unknown or previously unobserved strategies. While often simple for humans, this can be challenging for AI agents. For example, if an AI agent learns to drive alongside others (a training set) that only drive on one side of the road, it may struggle to adapt this experience to coordi… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: Currently in review for ICML 2024. 16 pages (including references and appendix), 9 Figures, 11 tables

  42. arXiv:2402.04494  [pdf, other

    cs.LG cs.AI stat.ML

    Grandmaster-Level Chess Without Search

    Authors: Anian Ruoss, Grégoire Delétang, Sourabh Medapati, Jordi Grau-Moya, Li Kevin Wenliang, Elliot Catt, John Reid, Tim Genewein

    Abstract: The recent breakthrough successes in machine learning are mainly attributed to scale: namely large-scale attention-based architectures and datasets of unprecedented scale. This paper investigates the impact of training at scale for chess. Unlike traditional chess engines that rely on complex heuristics, explicit search, or a combination of both, we train a 270M parameter transformer model with sup… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  43. arXiv:2402.03214  [pdf, other

    cs.CV cs.AI cs.LG

    Organic or Diffused: Can We Distinguish Human Art from AI-generated Images?

    Authors: Anna Yoo Jeong Ha, Josephine Passananti, Ronik Bhaskar, Shawn Shan, Reid Southen, Haitao Zheng, Ben Y. Zhao

    Abstract: The advent of generative AI images has completely disrupted the art world. Distinguishing AI generated images from human art is a challenging problem whose impact is growing over time. A failure to address this problem allows bad actors to defraud individuals paying a premium for human art and companies whose stated policies forbid AI imagery. It is also critical for content owners to establish co… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  44. arXiv:2401.15834  [pdf, other

    cs.CV cs.AI

    Few and Fewer: Learning Better from Few Examples Using Fewer Base Classes

    Authors: Raphael Lafargue, Yassir Bendou, Bastien Pasdeloup, Jean-Philippe Diguet, Ian Reid, Vincent Gripon, Jack Valmadre

    Abstract: When training data is scarce, it is common to make use of a feature extractor that has been pre-trained on a large base dataset, either by fine-tuning its parameters on the ``target'' dataset or by directly adopting its representation as features for a simple classifier. Fine-tuning is ineffective for few-shot learning, since the target dataset contains only a handful of examples. However, directl… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 9.5 pages + bibliography and supplementary material

    MSC Class: 68T ACM Class: I.2; I.4; I.5

  45. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  46. arXiv:2312.06707  [pdf, other

    cs.CY

    Exploring Public's Perception of Safety and Video Surveillance Technology: A Survey Approach

    Authors: Babak Rahimi Ardabili, Armin Danesh Pazho, Ghazal Alinezhad Noghre, Vinit Katariya, Gordon Hull, Shannon Reid, Hamed Tabkhi

    Abstract: Addressing public safety effectively requires incorporating diverse stakeholder perspectives, particularly those of the community, which are often underrepresented compared to other stakeholders. This study presents a comprehensive analysis of the community's general public safety concerns, their view of existing surveillance technologies, and their perception of AI-driven solutions for enhancing… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 12 pages, 4 figures

  47. arXiv:2312.04092  [pdf

    cs.DL

    Data stewardship: case studies from North-American, Dutch, and Finnish universities

    Authors: Antti M. Rousi, Reid I. Boehm, Yan Wang

    Abstract: Purpose - As national legislation, federated national services, institutional policies and institutional research service organizations may differ, data stewardship transpires differently in higher education institutions across the world. This work seeks to elaborate the picture of different data stewardship programs running in different institutional arrangements and research environments. Design… ▽ More

    Submitted 21 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

  48. arXiv:2311.11955  [pdf, other

    cs.RO

    Multi-Agent Strategy Explanations for Human-Robot Collaboration

    Authors: Ravi Pandya, Michelle Zhao, Changliu Liu, Reid Simmons, Henny Admoni

    Abstract: As robots are deployed in human spaces, it is important that they are able to coordinate their actions with the people around them. Part of such coordination involves ensuring that people have a good understanding of how a robot will act in the environment. This can be achieved through explanations of the robot's policy. Much prior work in explainable AI and RL focuses on generating explanations f… ▽ More

    Submitted 1 July, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: International Conference on Robotics and Automation (ICRA) 2024

  49. arXiv:2311.10730  [pdf

    cs.CY cs.SE

    ItsSQL: Intelligent Tutoring System for SQL

    Authors: Sören Aguirre Reid, Frank Kammer, Johannes Kunz, Timon Pellekoorne, Markus Siepermann, Jonas Wölfer

    Abstract: SQL is a central component of any database course. Despite the small number of SQL commands, students struggle to practice the concepts. To overcome this challenge, we developed an intelligent tutoring system (ITS) to guide the learning process with a small effort by the lecturer. Other systems often give only basic feedback (correct or incorrect) or require hundreds of instance specific rules def… ▽ More

    Submitted 14 October, 2023; originally announced November 2023.

    ACM Class: D.2.0

  50. arXiv:2311.10728  [pdf

    cs.CY

    Improving Feedback from Automated Reviews of Student Spreadsheets

    Authors: Sören Aguirre Reid, Frank Kammer, Jonas-Ian Kuche, Pia-Doreen Ritzke, Markus Siepermann, Max Stephan, Armin Wagenknecht

    Abstract: Spreadsheets are one of the most widely used tools for end users. As a result, spreadsheets such as Excel are now included in many curricula. However, digital solutions for assessing spreadsheet assignments are still scarce in the teaching context. Therefore, we have developed an Intelligent Tutoring System (ITS) to review students' Excel submissions and provide individualized feedback automatical… ▽ More

    Submitted 14 October, 2023; originally announced November 2023.

    ACM Class: D.2.0