Zum Hauptinhalt springen

Showing 1–50 of 69 results for author: Anderson, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16131  [pdf, other

    cs.CL

    Evaluating Computational Representations of Character: An Austen Character Similarity Benchmark

    Authors: Funing Yang, Carolyn Jane Anderson

    Abstract: Several systems have been developed to extract information about characters to aid computational analysis of English literature. We propose character similarity grouping as a holistic evaluation task for these pipelines. We present AustenAlike, a benchmark suite of character similarities in Jane Austen's novels. Our benchmark draws on three notions of character similarity: a structurally defined n… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  2. arXiv:2408.05894  [pdf, other

    cs.CV cs.CL

    GlyphPattern: An Abstract Pattern Recognition for Vision-Language Models

    Authors: Zixuan Wu, Yoolim Kim, Carolyn Jane Anderson

    Abstract: Vision-Language Models (VLMs) building upon the foundation of powerful large language models have made rapid progress in reasoning across visual and textual data. While VLMs perform well on vision tasks that they are trained on, our results highlight key challenges in abstract pattern recognition. We present GlyphPattern, a 954 item dataset that pairs 318 human-written descriptions of visual patte… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  3. arXiv:2407.21691  [pdf, other

    cs.CV

    Explainable Artificial Intelligence for Quantifying Interfering and High-Risk Behaviors in Autism Spectrum Disorder in a Real-World Classroom Environment Using Privacy-Preserving Video Analysis

    Authors: Barun Das, Conor Anderson, Tania Villavicencio, Johanna Lantz, Jenny Foster, Theresa Hamlin, Ali Bahrami Rad, Gari D. Clifford, Hyeokhyen Kwon

    Abstract: Rapid identification and accurate documentation of interfering and high-risk behaviors in ASD, such as aggression, self-injury, disruption, and restricted repetitive behaviors, are important in daily classroom environments for tracking intervention effectiveness and allocating appropriate resources to manage care needs. However, having a staff dedicated solely to observing is costly and uncommon i… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  4. arXiv:2407.01757  [pdf, other

    astro-ph.EP astro-ph.IM cs.MA physics.ao-ph physics.geo-ph

    Distributed Instruments for Planetary Surface Science: Scientific Opportunities and Technology Feasibility

    Authors: Federico Rossi, Robert C. Anderson, Saptarshi Bandyopadhyay, Erik Brandon, Ashish Goel, Joshua Vander Hook, Michael Mischna, Michaela Villarreal, Mark Wronkiewicz

    Abstract: In this paper, we assess the scientific promise and technology feasibility of distributed instruments for planetary science. A distributed instrument is an instrument designed to collect spatially and temporally correlated data from multiple networked, geographically distributed point sensors. Distributed instruments are ubiquitous in Earth science, where they are routinely employed for weather an… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  5. arXiv:2406.16955  [pdf, other

    eess.SP cs.CV cs.LG

    SRViT: Vision Transformers for Estimating Radar Reflectivity from Satellite Observations at Scale

    Authors: Jason Stock, Kyle Hilburn, Imme Ebert-Uphoff, Charles Anderson

    Abstract: We introduce a transformer-based neural network to generate high-resolution (3km) synthetic radar reflectivity fields at scale from geostationary satellite imagery. This work aims to enhance short-term convective-scale forecasts of high-impact weather events and aid in data assimilation for numerical weather prediction over the United States. Compared to convolutional approaches, which have limite… ▽ More

    Submitted 28 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Published as a workshop paper at "Machine Learning for Earth System Modeling", ICML 2024; added acknowledgements and github link

  6. arXiv:2404.18774  [pdf

    cond-mat.supr-con cs.AI

    Self-training superconducting neuromorphic circuits using reinforcement learning rules

    Authors: M. L. Schneider, E. M. Jué, M. R. Pufall, K. Segall, C. W. Anderson

    Abstract: Reinforcement learning algorithms are used in a wide range of applications, from gaming and robotics to autonomous vehicles. In this paper we describe a set of reinforcement learning-based local weight update rules and their implementation in superconducting hardware. Using SPICE circuit simulations, we implement a small-scale neural network with a learning time of order one nanosecond. This netwo… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 15 pages, 6 figures

  7. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  8. arXiv:2402.01969  [pdf, other

    cs.LG eess.SP

    Simulation-Enhanced Data Augmentation for Machine Learning Pathloss Prediction

    Authors: Ahmed P. Mohamed, Byunghyun Lee, Yaguang Zhang, Max Hollingsworth, C. Robert Anderson, James V. Krogmeier, David J. Love

    Abstract: Machine learning (ML) offers a promising solution to pathloss prediction. However, its effectiveness can be degraded by the limited availability of data. To alleviate these challenges, this paper introduces a novel simulation-enhanced data augmentation method for ML pathloss prediction. Our method integrates synthetic data generated from a cellular coverage simulator and independently collected re… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 6 pages, 5 figures, Accepted at ICC 2024

  9. How Beginning Programmers and Code LLMs (Mis)read Each Other

    Authors: Sydney Nguyen, Hannah McLean Babe, Yangtian Zi, Arjun Guha, Carolyn Jane Anderson, Molly Q Feldman

    Abstract: Generative AI models, specifically large language models (LLMs), have made strides towards the long-standing goal of text-to-code generation. This progress has invited numerous studies of user interaction. However, less is known about the struggles and strategies of non-experts, for whom each step of the text-to-code problem presents challenges: describing their intent in natural language, evaluat… ▽ More

    Submitted 7 July, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Published in CHI 2024

  10. Segmenting Messy Text: Detecting Boundaries in Text Derived from Historical Newspaper Images

    Authors: Carol Anderson, Phil Crone

    Abstract: Text segmentation, the task of dividing a document into sections, is often a prerequisite for performing additional natural language processing tasks. Existing text segmentation methods have typically been developed and tested using clean, narrative-style text with segments containing distinct topics. Here we consider a challenging text segmentation task: dividing newspaper marriage announcement l… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 8 pages, 4 figures

    ACM Class: I.2.7; I.7.5

    Journal ref: 2020 25th International Conference on Pattern Recognition (ICPR), Milan, Italy, 2021, pp. 5543-5550

  11. arXiv:2312.12450  [pdf, other

    cs.SE cs.AI cs.LG cs.PL

    Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions

    Authors: Federico Cassano, Luisa Li, Akul Sethi, Noah Shinn, Abby Brennan-Jones, Jacob Ginesin, Edward Berman, George Chakhnashvili, Anton Lozhkov, Carolyn Jane Anderson, Arjun Guha

    Abstract: A significant amount of research is focused on developing and evaluating large language models for a variety of code synthesis tasks. These include synthesizing code from natural language, synthesizing tests from code, and synthesizing explanations of code. In contrast, the behavior of instructional code editing with LLMs is understudied. These are tasks in which the model is provided a block of c… ▽ More

    Submitted 19 March, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  12. arXiv:2310.13304  [pdf, other

    cs.HC

    "Living Within Four Walls": Exploring Emotional and Social Dynamics in Mobile Usage During Home Confinement

    Authors: Nan Gao, Sam Nolan, Kaixin Ji, Shakila Khan Rumi, Judith Simone Heinisch, Christoph Anderson, Klaus David, Flora D. Salim

    Abstract: Home confinement, a situation experienced by individuals for reasons ranging from medical quarantines, rehabilitation needs, disability accommodations, and remote working, is a common yet impactful aspect of modern life. While essential in various scenarios, confinement within the home environment can profoundly influence psychological well-being and digital device usage. In this study, we delve i… ▽ More

    Submitted 8 June, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  13. Autonomous Systems' Safety Cases for use in UK Nuclear Environments

    Authors: Christopher R. Anderson, Louise A. Dennis

    Abstract: An overview of the process to develop a safety case for an autonomous robot deployment on a nuclear site in the UK is described and a safety case for a hypothetical robot incorporating AI is presented. This forms a first step towards a deployment, showing what is possible now and what may be possible with development of tools. It forms the basis for further discussion between nuclear site licensee… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: In Proceedings AREA 2023, arXiv:2310.00333

    Journal ref: EPTCS 391, 2023, pp. 83-88

  14. A Large Language Model Approach to Educational Survey Feedback Analysis

    Authors: Michael J. Parker, Caitlin Anderson, Claire Stone, YeaRim Oh

    Abstract: This paper assesses the potential for the large language models (LLMs) GPT-4 and GPT-3.5 to aid in deriving insight from education feedback surveys. Exploration of LLM use cases in education has focused on teaching and learning, with less exploration of capabilities in education feedback analysis. Survey analysis in education involves goals such as finding gaps in curricula or evaluating teachers,… ▽ More

    Submitted 26 June, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Journal ref: Int J Artif Intell Educ (2024)

  15. Julia as a unifying end-to-end workflow language on the Frontier exascale system

    Authors: William F. Godoy, Pedro Valero-Lara, Caira Anderson, Katrina W. Lee, Ana Gainaru, Rafael Ferreira da Silva, Jeffrey S. Vetter

    Abstract: We evaluate Julia as a single language and ecosystem paradigm powered by LLVM to develop workflow components for high-performance computing. We run a Gray-Scott, 2-variable diffusion-reaction application using a memory-bound, 7-point stencil kernel on Frontier, the US Department of Energy's first exascale supercomputer. We evaluate the performance, scaling, and trade-offs of (i) the computational… ▽ More

    Submitted 27 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 11 pages, 8 figures, accepted at the 18th Workshop on Workflows in Support of Large-Scale Science (WORKS23), IEEE/ACM The International Conference for High Performance Computing, Networking, Storage, and Analysis, SC23

  16. arXiv:2308.09895  [pdf, other

    cs.PL cs.LG

    Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs

    Authors: Federico Cassano, John Gouwar, Francesca Lucchetti, Claire Schlesinger, Anders Freeman, Carolyn Jane Anderson, Molly Q Feldman, Michael Greenberg, Abhinav Jangda, Arjun Guha

    Abstract: Over the past few years, Large Language Models of Code (Code LLMs) have started to have a significant impact on programming practice. Code LLMs are also emerging as building blocks for research in programming languages and software engineering. However, Code LLMs produce impressive results on programming languages that are well represented in their training data (e.g., Java, Python, or JavaScript)… ▽ More

    Submitted 10 February, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

  17. arXiv:2307.08692  [pdf, other

    eess.SY cs.LG

    A Multiobjective Reinforcement Learning Framework for Microgrid Energy Management

    Authors: M. Vivienne Liu, Patrick M. Reed, David Gold, Garret Quist, C. Lindsay Anderson

    Abstract: The emergence of microgrids (MGs) has provided a promising solution for decarbonizing and decentralizing the power grid, mitigating the challenges posed by climate change. However, MG operations often involve considering multiple objectives that represent the interests of different stakeholders, leading to potentially complex conflicts. To tackle this issue, we propose a novel multi-objective rein… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: This work will be submitted to the IEEE Transactions on Smart Grid for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  18. arXiv:2306.12255  [pdf, other

    cs.CL

    Solving and Generating NPR Sunday Puzzles with Large Language Models

    Authors: Jingmiao Zhao, Carolyn Jane Anderson

    Abstract: We explore the ability of large language models to solve and generate puzzles from the NPR Sunday Puzzle game show using PUZZLEQA, a dataset comprising 15 years of on-air puzzles. We evaluate four large language models using PUZZLEQA, in both multiple choice and free response formats, and explore two prompt engineering techniques to improve free response performance: chain-of-thought reasoning and… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: To appear in the Proceedings of the 14th International Conference on Computational Creativity (ICCC)

  19. arXiv:2306.04556  [pdf, other

    cs.LG cs.HC cs.SE

    StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code

    Authors: Hannah McLean Babe, Sydney Nguyen, Yangtian Zi, Arjun Guha, Molly Q Feldman, Carolyn Jane Anderson

    Abstract: Code LLMs are being rapidly deployed and there is evidence that they can make professional programmers more productive. Current benchmarks for code generation measure whether models generate correct programs given an expert prompt. In this paper, we present a new benchmark containing multiple prompts per problem, written by a specific population of non-expert prompters: beginning programmers. Stud… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  20. Keep It Simple: Fault Tolerance Evaluation of Federated Learning with Unreliable Clients

    Authors: Victoria Huang, Shaleeza Sohail, Michael Mayo, Tania Lorido Botran, Mark Rodrigues, Chris Anderson, Melanie Ooi

    Abstract: Federated learning (FL), as an emerging artificial intelligence (AI) approach, enables decentralized model training across multiple devices without exposing their local training data. FL has been increasingly gaining popularity in both academia and industry. While research works have been proposed to improve the fault tolerance of FL, the real impact of unreliable devices (e.g., dropping out, misc… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  21. arXiv:2305.06161  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    StarCoder: may the source be with you!

    Authors: Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu , et al. (42 additional authors not shown)

    Abstract: The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  22. arXiv:2302.08584  [pdf, other

    eess.SP cs.RO eess.SY

    Propagation Measurements and Analyses at 28 GHz via an Autonomous Beam-Steering Platform

    Authors: Bharath Keshavamurthy, Yaguang Zhang, Christopher R. Anderson, Nicolo Michelusi, James V. Krogmeier, David J. Love

    Abstract: This paper details the design of an autonomous alignment and tracking platform to mechanically steer directional horn antennas in a sliding correlator channel sounder setup for 28 GHz V2X propagation modeling. A pan-and-tilt subsystem facilitates uninhibited rotational mobility along the yaw and pitch axes, driven by open-loop servo units and orchestrated via inertial motion controllers. A geo-pos… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 6 pages, 18 figures, 2 tables; Accepted at IEEE International Conference on Communications (ICC) 2023: Paper #1570867736

    Report number: ICC Paper #1570867736

  23. arXiv:2301.03988  [pdf, other

    cs.SE cs.AI cs.LG

    SantaCoder: don't reach for the stars!

    Authors: Loubna Ben Allal, Raymond Li, Denis Kocetkov, Chenghao Mou, Christopher Akiki, Carlos Munoz Ferrandis, Niklas Muennighoff, Mayank Mishra, Alex Gu, Manan Dey, Logesh Kumar Umapathi, Carolyn Jane Anderson, Yangtian Zi, Joel Lamy Poirier, Hailey Schoelkopf, Sergey Troshin, Dmitry Abulkhanov, Manuel Romero, Michael Lappert, Francesco De Toni, Bernardo García del Río, Qian Liu, Shamik Bose, Urvashi Bhattacharyya, Terry Yue Zhuo , et al. (16 additional authors not shown)

    Abstract: The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes the progress of the collaboration until December 2022, outlining the current state of the Personally Identifiable Information (PII) redaction pipeline, the experiments conducted to de-risk the model architecture, and the experiments investigat… ▽ More

    Submitted 24 February, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

  24. arXiv:2212.04478  [pdf, other

    physics.ao-ph cs.LG

    An Interpretable Model of Climate Change Using Correlative Learning

    Authors: Charles Anderson, Jason Stock

    Abstract: Determining changes in global temperature and precipitation that may indicate climate change is complicated by annual variations. One approach for finding potential climate change indicators is to train a model that predicts the year from annual means of global temperatures and precipitations. Such data is available from the CMIP6 ensemble of simulations. Here a two-hidden-layer neural network tra… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2022 Workshop - Tackling Climate Change with Machine Learning, 4 page limit w/ appendix

  25. arXiv:2210.12185  [pdf, other

    cs.CV cs.AI cs.LG

    Attention-Based Scattering Network for Satellite Imagery

    Authors: Jason Stock, Chuck Anderson

    Abstract: Multi-channel satellite imagery, from stacked spectral bands or spatiotemporal data, have meaningful representations for various atmospheric properties. Combining these features in an effective manner to create a performant and trustworthy model is of utmost importance to forecasters. Neural networks show promise, yet suffer from unintuitive computations, fusion of high-level features, and may be… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022 Workshop - Tackling Climate Change with Machine Learning, 4 page limit w/ appendix

  26. arXiv:2208.08227  [pdf, other

    cs.LG cs.PL

    MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation

    Authors: Federico Cassano, John Gouwar, Daniel Nguyen, Sydney Nguyen, Luna Phipps-Costin, Donald Pinckney, Ming-Ho Yee, Yangtian Zi, Carolyn Jane Anderson, Molly Q Feldman, Arjun Guha, Michael Greenberg, Abhinav Jangda

    Abstract: Large language models have demonstrated the ability to generate both natural language and programming language text. Such models open up the possibility of multi-language code generation: could code generation models generalize knowledge from one language to another? Although contemporary code generation models can generate semantically correct Python code, little is known about their abilities wi… ▽ More

    Submitted 19 December, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

  27. arXiv:2207.03405  [pdf, other

    cs.HC

    Investigating the Effects of Mood & Usage Behaviour on Notification Response Time

    Authors: Judith S. Heinisch, Nan Gao, Christoph Anderson, Shohreh Deldari, Klaus David, Flora Salim

    Abstract: Notifications are one of the most prevailing mechanisms on smartphones and personal computers to convey timely and important information. Despite these benefits, smartphone notifications demand individuals' attention and can cause stress and frustration when delivered at inopportune timings. This paper investigates the effect of individuals' smartphone usage behavior and mood on notification respo… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  28. arXiv:2205.06351  [pdf, other

    cs.LG

    Interpretable Climate Change Modeling With Progressive Cascade Networks

    Authors: Charles Anderson, Jason Stock, David Anderson

    Abstract: Typical deep learning approaches to modeling high-dimensional data often result in complex models that do not easily reveal a new understanding of the data. Research in the deep learning field is very actively pursuing new methods to interpret deep neural networks and to reduce their complexity. An approach is described here that starts with linear models and incrementally adds complexity only as… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

  29. arXiv:2205.03355  [pdf, other

    cs.LG

    Trainable Wavelet Neural Network for Non-Stationary Signals

    Authors: Jason Stock, Chuck Anderson

    Abstract: This work introduces a wavelet neural network to learn a filter-bank specialized to fit non-stationary signals and improve interpretability and performance for digital signal processing. The network uses a wavelet transform as the first layer of a neural network where the convolution is a parameterized function of the complex Morlet wavelet. Experimental results, on both simplified data and atmosp… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: AI for Earth and Space Science Workshop at the International Conference on Learning Representations (ICLR), April, 2022

  30. arXiv:2201.10511  [pdf, other

    eess.IV cs.CV cs.LG

    Initial Investigations Towards Non-invasive Monitoring of Chronic Wound Healing Using Deep Learning and Ultrasound Imaging

    Authors: Maja Schlereth, Daniel Stromer, Yash Mantri, Jason Tsujimoto, Katharina Breininger, Andreas Maier, Caesar Anderson, Pranav S. Garimella, Jesse V. Jokerst

    Abstract: Chronic wounds including diabetic and arterial/venous insufficiency injuries have become a major burden for healthcare systems worldwide. Demographic changes suggest that wound care will play an even bigger role in the coming decades. Predicting and monitoring response to therapy in wound care is currently largely based on visual inspection with little information on the underlying tissue. Thus, t… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 6 pages, 2 figures, accepted by BVM conference proceedings 2022

  31. Statistical detection of format dialects using the weighted Dowker complex

    Authors: Michael Robinson, Letitia W. Li, Cory Anderson, Steve Huntsman

    Abstract: This paper provides an experimentally validated, probabilistic model of file behavior when consumed by a set of pre-existing parsers. File behavior is measured by way of a standardized set of Boolean "messages" produced as the files are read. By thresholding the posterior probability that a file exhibiting a particular set of messages is from a particular dialect, our model yields a practical clas… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: 15 pages, 11 figures, 5 tables

    MSC Class: 62P30; 55U10 ACM Class: D.3.4

  32. arXiv:2111.15641  [pdf, ps, other

    cs.CL

    Automatic Extraction of Medication Names in Tweets as Named Entity Recognition

    Authors: Carol Anderson, Bo Liu, Anas Abidin, Hoo-Chang Shin, Virginia Adams

    Abstract: Social media posts contain potentially valuable information about medical conditions and health-related behavior. Biocreative VII Task 3 focuses on mining this information by recognizing mentions of medications and dietary supplements in tweets. We approach this task by fine tuning multiple BERT-style language models to perform token-level classification, and combining them into ensembles to gener… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Comments: Submission to the BioCreative VII challenge - Track-3

  33. arXiv:2111.15622  [pdf, other

    cs.CL

    Chemical Identification and Indexing in PubMed Articles via BERT and Text-to-Text Approaches

    Authors: Virginia Adams, Hoo-Chang Shin, Carol Anderson, Bo Liu, Anas Abidin

    Abstract: The Biocreative VII Track-2 challenge consists of named entity recognition, entity-linking (or entity-normalization), and topic indexing tasks -- with entities and topics limited to chemicals for this challenge. Named entity recognition is a well-established problem and we achieve our best performance with BERT-based BioMegatron models. We extend our BERT-based approach to the entity linking task.… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Comments: Submission to the BioCreative VII challenge - Track-2

  34. arXiv:2111.15617  [pdf, other

    cs.CL

    Text Mining Drug/Chemical-Protein Interactions using an Ensemble of BERT and T5 Based Models

    Authors: Virginia Adams, Hoo-Chang Shin, Carol Anderson, Bo Liu, Anas Abidin

    Abstract: In Track-1 of the BioCreative VII Challenge participants are asked to identify interactions between drugs/chemicals and proteins. In-context named entity annotations for each drug/chemical and protein are provided and one of fourteen different interactions must be automatically predicted. For this relation extraction task, we attempt both a BERT-based sentence classification approach, and a more n… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Comments: Submission to the BioCreative VII challenge, Track-1

  35. arXiv:2110.07106  [pdf, other

    eess.SP cs.RO eess.SY

    A Robotic Antenna Alignment and Tracking System for Millimeter Wave Propagation Modeling

    Authors: Bharath Keshavamurthy, Yaguang Zhang, Christopher R. Anderson, Nicolo Michelusi, James V. Krogmeier, David J. Love

    Abstract: In this paper, we discuss the design of a sliding-correlator channel sounder for 28 GHz propagation modeling on the NSF POWDER testbed in Salt Lake City, UT. Beam-alignment is mechanically achieved via a fully autonomous robotic antenna tracking platform, designed using commercial off-the-shelf components. Equipped with an Apache Zookeeper/Kafka managed fault-tolerant publish-subscribe framework,… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: Submitted to -- and yet to be presented (and archived) -- in the proceedings of the 2022 USNC-URSI National Radio Science Meeting (NRSM)

    Report number: Paper Number: 1182

  36. arXiv:2110.03091  [pdf, other

    cs.CV

    Improving Fractal Pre-training

    Authors: Connor Anderson, Ryan Farrell

    Abstract: The deep neural networks used in modern computer vision systems require enormous image datasets to train them. These carefully-curated datasets typically have a million or more images, across a thousand or more distinct categories. The process of creating and curating such a dataset is a monumental undertaking, demanding extensive effort and labelling expense and necessitating careful navigation o… ▽ More

    Submitted 17 December, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: Accepted to WACV 2022. 15 pages, 15 figures. Added note about error, removed erroneous result

  37. arXiv:2109.05049  [pdf, other

    cs.PL

    Solver-based Gradual Type Migration

    Authors: Luna Phipps-Costin, Carolyn Jane Anderson, Michael Greenberg, Arjun Guha

    Abstract: Gradually typed languages allow programmers to mix statically and dynamically typed code, enabling them to incrementally reap the benefits of static typing as they add type annotations to their code. However, this type migration process is typically a manual effort with limited tool support. This paper examines the problem of \emph{automated type migration}: given a dynamic program, infer addition… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  38. Fair Comparison: Quantifying Variance in Resultsfor Fine-grained Visual Categorization

    Authors: Matthew Gwilliam, Adam Teuscher, Connor Anderson, Ryan Farrell

    Abstract: For the task of image classification, researchers work arduously to develop the next state-of-the-art (SOTA) model, each bench-marking their own performance against that of their predecessors and of their peers. Unfortunately, the metric used most frequently to describe a model's performance, average categorization accuracy, is often used in isolation. As the number of classes increases, such as i… ▽ More

    Submitted 7 September, 2021; v1 submitted 7 September, 2021; originally announced September 2021.

    Comments: Accepted at WACV 2021; 8 pages text, 2 pages bib, 12 figures

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), January, 2021, pages 3309-3318

  39. Towards Social Role-Based Interruptibility Management

    Authors: Christoph Anderson, Judith Simone Heinisch, Shohreh Deldari, Flora D. Salim, Sandra Ohly, Klaus David, Veljko Pejovic

    Abstract: Pervasive and ubiquitous computing facilitates immediate access to information in the sense of always-on. Information such as news, messages, or reminders can significantly enhance our daily routines but are rendered useless or disturbing when not being aligned with our intrinsic interruptibility preferences. Attention management systems use machine learning to identify short-term opportune moment… ▽ More

    Submitted 18 December, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: 10 pages, 6 figures, submitted on December 2022, to appear in IEEE Pervasive Computing, Special Issue - Human-Centered AI

  40. Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning & HPC Workloads

    Authors: Evangelos Georganas, Dhiraj Kalamkar, Sasikanth Avancha, Menachem Adelman, Deepti Aggarwal, Cristina Anderson, Alexander Breuer, Jeremy Bruestle, Narendra Chaudhary, Abhisek Kundu, Denise Kutnick, Frank Laub, Vasimuddin Md, Sanchit Misra, Ramanarayan Mohanty, Hans Pabst, Brian Retford, Barukh Ziv, Alexander Heinecke

    Abstract: During the past decade, novel Deep Learning (DL) algorithms, workloads and hardware have been developed to tackle a wide range of problems. Despite the advances in workload and hardware ecosystems, the programming methodology of DL systems is stagnant. DL workloads leverage either highly-optimized, yet platform-specific and inflexible kernels from DL libraries, or in the case of novel operators, r… ▽ More

    Submitted 30 November, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

  41. arXiv:2103.16673  [pdf, other

    cs.RO

    A Kinematic Model for Trajectory Prediction in General Highway Scenarios

    Authors: Cyrus Anderson, Ram Vasudevan, Matthew Johnson-Roberson

    Abstract: Highway driving invariably combines high speeds with the need to interact closely with other drivers. Prediction methods enable autonomous vehicles (AVs) to anticipate drivers' future trajectories and plan accordingly. Kinematic methods for prediction have traditionally ignored the presence of other drivers, or made predictions only for a limited set of scenarios. Data-driven approaches fill this… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: 8 pages, 4 figures, 1 table

  42. arXiv:2103.10520  [pdf, ps, other

    cs.DB

    Optimally Summarizing Data by Small Fact Sets for Concise Answers to Voice Queries

    Authors: Immanuel Trummer, Connor Anderson

    Abstract: Our goal is to find combinations of facts that optimally summarize data sets. We consider this problem in the context of voice query interfaces for simple, exploratory data analysis. Here, the system answers voice queries with a short summary of relevant data. Finding optimal voice data summaries is computationally expensive. Prior work in this domain has exploited sampling and incremental process… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  43. arXiv:2007.12353  [pdf, other

    cs.HC cs.CY

    Exploring the Impact of COVID-19 Lockdown on Social Roles and Emotions while Working from Home

    Authors: Sam Nolan, Shakila Khan Rumi, Christoph Anderson, Klaus David, Flora D. Salim

    Abstract: In the opening months of 2020, COVID-19 changed the way for which people work, forcing more people to work from home. This research investigates the impact of COVID-19 on five researchers' work and private roles, happiness, and mobile and desktop activity patterns. Desktop and smartphone application usage were gathered before and during COVID-19. Individuals' roles and happiness were captured thro… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: 9 pages, Accepted at The New Future of Work Symposium, Microsoft, 2020

  44. arXiv:2006.13190  [pdf, other

    cs.CV

    Facing the Hard Problems in FGVC

    Authors: Connor Anderson, Matt Gwilliam, Adam Teuscher, Andrew Merrill, Ryan Farrell

    Abstract: In fine-grained visual categorization (FGVC), there is a near-singular focus in pursuit of attaining state-of-the-art (SOTA) accuracy. This work carefully analyzes the performance of recent SOTA methods, quantitatively, but more importantly, qualitatively. We show that these models universally struggle with certain "hard" images, while also making complementary mistakes. We underscore the importan… ▽ More

    Submitted 24 June, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: 17 pages, 6 figures, 2 tables; fixed typo, minor adjustment to format, added equations

  45. arXiv:2006.00962  [pdf, other

    cs.RO

    Off The Beaten Sidewalk: Pedestrian Prediction In Shared Spaces For Autonomous Vehicles

    Authors: Cyrus Anderson, Ram Vasudevan, Matthew Johnson-Roberson

    Abstract: Pedestrians and drivers interact closely in a wide range of environments. Autonomous vehicles (AVs) correspondingly face the need to predict pedestrians' future trajectories in these same environments. Traditional model-based prediction methods have been limited to making predictions in highly structured scenes with signalized intersections, marked crosswalks, or curbs. Deep learning methods have… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: 8 pages, 4 figures, 2 tables

  46. arXiv:1912.04252  [pdf, other

    physics.app-ph cs.LG

    Automated Classification of Helium Ingress in Irradiated X-750

    Authors: Chris Anderson, Jacob Klein, Heygaan Rajakumar, Colin Judge, Laurent K Beland

    Abstract: Imaging nanoscale features using transmission electron microscopy is key to predicting and assessing the mechanical behavior of structural materials in nuclear reactors. Analyzing these micrographs is often a tedious and labour intensive manual process. It is a prime candidate for automation. Here, a region-based convolutional neural network is adapted to detect helium bubbles in micrographs of ne… ▽ More

    Submitted 28 May, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

  47. arXiv:1910.02052  [pdf, other

    eess.SP cs.AI cs.LG

    AI Assisted Annotator using Reinforcement Learning

    Authors: V. Ratna Saripalli, Gopal Avinash, Dibyajyoti Pati, Michael Potter, Charles W. Anderson

    Abstract: Healthcare data suffers from both noise and lack of ground truth. The cost of data increases as it is cleaned and annotated in healthcare. Unlike other data sets, medical data annotation, which is critical to accurate ground truth, requires medical domain expertise for a better patient outcome. In this work, we report on the use of reinforcement learning to mimic the decision making process of ann… ▽ More

    Submitted 11 June, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: 10 pages

  48. arXiv:1909.05227  [pdf, other

    cs.RO eess.SP

    On-Demand Trajectory Predictions for Interaction Aware Highway Driving

    Authors: Cyrus Anderson, Ram Vasudevan, Matthew Johnson-Roberson

    Abstract: Highway driving places significant demands on human drivers and autonomous vehicles (AVs) alike due to high speeds and the complex interactions in dense traffic. Merging onto the highway poses additional challenges by limiting the amount of time available for decision-making. Predicting others' trajectories accurately and quickly is crucial to safely executing maneuvers. Many existing prediction m… ▽ More

    Submitted 2 March, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: 8 pages, 4 figures, 3 tables

  49. The Impact of Private and Work-Related Smartphone Usage on Interruptibility

    Authors: Christoph Anderson, Judith Simone Heinisch, Sandra Ohly, Klaus David, Veljko Pejovic

    Abstract: In the last decade, the effects of interruptions through mobile notifications have been extensively researched in the field of Human-Computer Interaction. Breakpoints in tasks and activities, cognitive load, and personality traits have all been shown to correlate with individuals' interruptibility. However, concepts that explain interruptibility in a broader sense are needed to provide a holistic… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: 6 pages, 3 figures

    Journal ref: Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2019 International Symposium on Wearable Computers (UbiComp/ISWC '19 Adjunct)

  50. Stochastic Sampling Simulation for Pedestrian Trajectory Prediction

    Authors: Cyrus Anderson, Xiaoxiao Du, Ram Vasudevan, Matthew Johnson-Roberson

    Abstract: Urban environments pose a significant challenge for autonomous vehicles (AVs) as they must safely navigate while in close proximity to many pedestrians. It is crucial for the AV to correctly understand and predict the future trajectories of pedestrians to avoid collision and plan a safe path. Deep neural networks (DNNs) have shown promising results in accurately predicting pedestrian trajectories,… ▽ More

    Submitted 5 March, 2019; originally announced March 2019.

    Comments: 8 pages, 6 figures and 2 tables