Zum Hauptinhalt springen

Showing 1–33 of 33 results for author: More, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01741  [pdf, other

    cs.CR cs.AI cs.AR cs.LG

    PVF (Parameter Vulnerability Factor): A Scalable Metric for Understanding AI Vulnerability Against SDCs in Model Parameters

    Authors: Xun Jiao, Fred Lin, Harish D. Dixit, Joel Coburn, Abhinav Pandey, Han Wang, Venkat Ramesh, Jianyu Huang, Wang Xu, Daniel Moore, Sriram Sankar

    Abstract: Reliability of AI systems is a fundamental concern for the successful deployment and widespread adoption of AI technologies. Unfortunately, the escalating complexity and heterogeneity of AI hardware systems make them increasingly susceptible to hardware faults, e.g., silent data corruptions (SDC), that can potentially corrupt model parameters. When this occurs during AI inference/servicing, it can… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2403.13177  [pdf, other

    cs.RO

    User-customizable Shared Control for Robot Teleoperation via Virtual Reality

    Authors: Rui Luo, Mark Zolotas, Drake Moore, Taskin Padir

    Abstract: Shared control can ease and enhance a human operator's ability to teleoperate robots, particularly for intricate tasks demanding fine control over multiple degrees of freedom. However, the arbitration process dictating how much autonomous assistance to administer in shared control can confuse novice operators and impede their understanding of the robot's behavior. To overcome these adverse side-ef… ▽ More

    Submitted 14 August, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted at IROS 2024

  3. arXiv:2401.10688  [pdf, ps, other

    cs.IT cs.AR

    Unraveling codes: fast, robust, beyond-bound error correction for DRAM

    Authors: Mike Hamburg, Eric Linstadt, Danny Moore, Thomas Vogelsang

    Abstract: Generalized Reed-Solomon (RS) codes are a common choice for efficient, reliable error correction in memory and communications systems. These codes add $2t$ extra parity symbols to a block of memory, and can efficiently and reliably correct up to $t$ symbol errors in that block. Decoding is possible beyond this bound, but it is imperfectly reliable and often computationally expensive. Beyond-bound… ▽ More

    Submitted 27 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Changes vs first arxiv version: wordsmithing, typo corrections and citation fixes

  4. arXiv:2312.13410  [pdf, other

    cs.RO cs.HC

    Shared Affordance-awareness via Augmented Reality for Proactive Assistance in Human-robot Collaboration

    Authors: Drake Moore, Mark Zolotas, Taskin Padir

    Abstract: Enabling humans and robots to collaborate effectively requires purposeful communication and an understanding of each other's affordances. Prior work in human-robot collaboration has incorporated knowledge of human affordances, i.e., their action possibilities in the current context, into autonomous robot decision-making. This "affordance awareness" is especially promising for service robots that n… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  5. arXiv:2309.12814  [pdf, other

    cs.CV

    Domain Adaptive Few-Shot Open-Set Learning

    Authors: Debabrata Pal, Deeptej More, Sai Bhargav, Dipesh Tamboli, Vaneet Aggarwal, Biplab Banerjee

    Abstract: Few-shot learning has made impressive strides in addressing the crucial challenges of recognizing unknown samples from novel classes in target query sets and managing visual shifts between domains. However, existing techniques fall short when it comes to identifying target outliers under domain shifts by learning to reject pseudo-outliers from the source domain, resulting in an incomplete solution… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Journal ref: ICCV 2023

  6. arXiv:2307.10244  [pdf, other

    cs.IR cs.LG

    Evaluating and Enhancing Robustness of Deep Recommendation Systems Against Hardware Errors

    Authors: Dongning Ma, Xun Jiao, Fred Lin, Mengshi Zhang, Alban Desmaison, Thomas Sellinger, Daniel Moore, Sriram Sankar

    Abstract: Deep recommendation systems (DRS) heavily depend on specialized HPC hardware and accelerators to optimize energy, efficiency, and recommendation quality. Despite the growing number of hardware errors observed in large-scale fleet systems where DRS are deployed, the robustness of DRS has been largely overlooked. This paper presents the first systematic study of DRS robustness against hardware error… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  7. arXiv:2303.06311  [pdf, other

    hep-ex cs.LG physics.ins-det

    Generative Adversarial Networks for Scintillation Signal Simulation in EXO-200

    Authors: S. Li, I. Ostrovskiy, Z. Li, L. Yang, S. Al Kharusi, G. Anton, I. Badhrees, P. S. Barbeau, D. Beck, V. Belov, T. Bhatta, M. Breidenbach, T. Brunner, G. F. Cao, W. R. Cen, C. Chambers, B. Cleveland, M. Coon, A. Craycraft, T. Daniels, L. Darroch, S. J. Daugherty, J. Davis, S. Delaquis, A. Der Mesrobian-Kabakian , et al. (65 additional authors not shown)

    Abstract: Generative Adversarial Networks trained on samples of simulated or actual events have been proposed as a way of generating large simulated datasets at a reduced computational cost. In this work, a novel approach to perform the simulation of photodetector signals from the time projection chamber of the EXO-200 experiment is demonstrated. The method is based on a Wasserstein Generative Adversarial N… ▽ More

    Submitted 8 May, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: As accepted by JINST

    Journal ref: JINST 18 P06005 2023

  8. arXiv:2212.03475  [pdf, other

    cs.LG

    PyGFI: Analyzing and Enhancing Robustness of Graph Neural Networks Against Hardware Errors

    Authors: Ruixuan Wang, Fred Lin, Daniel Moore, Sriram Sankar, Xun Jiao

    Abstract: Graph neural networks (GNNs) have recently emerged as a promising learning paradigm in learning graph-structured data and have demonstrated wide success across various domains such as recommendation systems, social networks, and electronic design automation (EDA). Like other deep learning (DL) methods, GNNs are being deployed in sophisticated modern hardware systems, as well as dedicated accelerat… ▽ More

    Submitted 24 April, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

  9. arXiv:2211.05617  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.CY

    Debiasing Methods for Fairer Neural Models in Vision and Language Research: A Survey

    Authors: Otávio Parraga, Martin D. More, Christian M. Oliveira, Nathan S. Gavenski, Lucas S. Kupssinskü, Adilson Medronha, Luis V. Moura, Gabriel S. Simões, Rodrigo C. Barros

    Abstract: Despite being responsible for state-of-the-art results in several computer vision and natural language processing tasks, neural networks have faced harsh criticism due to some of their current shortcomings. One of them is that neural networks are correlation machines prone to model biases within the data instead of focusing on actual useful causal relationships. This problem is particularly seriou… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: Submitted to ACM Computing Surveys - Special Issue on Trustworthy AI

  10. arXiv:2205.01931  [pdf, other

    cs.CV cs.LG

    Mapping the landscape of histomorphological cancer phenotypes using self-supervised learning on unlabeled, unannotated pathology slides

    Authors: Adalberto Claudio Quiros, Nicolas Coudray, Anna Yeaton, Xinyu Yang, Bojing Liu, Hortense Le, Luis Chiriboga, Afreen Karimkhan, Navneet Narula, David A. Moore, Christopher Y. Park, Harvey Pass, Andre L. Moreira, John Le Quesne, Aristotelis Tsirigos, Ke Yuan

    Abstract: Definitive cancer diagnosis and management depend upon the extraction of information from microscopy images by pathologists. These images contain complex information requiring time-consuming expert human interpretation that is prone to human bias. Supervised deep learning approaches have proven powerful for classification tasks, but they are inherently limited by the cost and quality of annotation… ▽ More

    Submitted 1 September, 2023; v1 submitted 4 May, 2022; originally announced May 2022.

  11. arXiv:2110.06021  [pdf, other

    stat.ML cs.LG

    Embedded-model flows: Combining the inductive biases of model-free deep learning and explicit probabilistic modeling

    Authors: Gianluigi Silvestri, Emily Fertig, Dave Moore, Luca Ambrogioni

    Abstract: Normalizing flows have shown great success as general-purpose density estimators. However, many real world applications require the use of domain-specific knowledge, which normalizing flows cannot readily incorporate. We propose embedded-model flows (EMF), which alternate general-purpose transformations with structured layers that embed domain-specific inductive biases. These layers are automatica… ▽ More

    Submitted 15 March, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  12. arXiv:2002.04205  [pdf, other

    cs.LG cs.CV stat.ML

    Fine-grained Uncertainty Modeling in Neural Networks

    Authors: Rahul Soni, Naresh Shah, Jimmy D. Moore

    Abstract: Existing uncertainty modeling approaches try to detect an out-of-distribution point from the in-distribution dataset. We extend this argument to detect finer-grained uncertainty that distinguishes between (a). certain points, (b). uncertain points but within the data distribution, and (c). out-of-distribution points. Our method corrects overconfident NN decisions, detects outlier points and learns… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  13. arXiv:2002.03549  [pdf, other

    cs.LG cs.CV stat.ML

    Adversarial TCAV -- Robust and Effective Interpretation of Intermediate Layers in Neural Networks

    Authors: Rahul Soni, Naresh Shah, Chua Tat Seng, Jimmy D. Moore

    Abstract: Interpreting neural network decisions and the information learned in intermediate layers is still a challenge due to the opaque internal state and shared non-linear interactions. Although (Kim et al, 2017) proposed to interpret intermediate layers by quantifying its ability to distinguish a user-defined concept (from random examples), the questions of robustness (variation against the choice of ra… ▽ More

    Submitted 26 February, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

  14. arXiv:2002.01184  [pdf, ps, other

    stat.CO cs.PL stat.ML

    tfp.mcmc: Modern Markov Chain Monte Carlo Tools Built for Modern Hardware

    Authors: Junpeng Lao, Christopher Suter, Ian Langmore, Cyril Chimisov, Ashish Saxena, Pavel Sountsov, Dave Moore, Rif A. Saurous, Matthew D. Hoffman, Joshua V. Dillon

    Abstract: Markov chain Monte Carlo (MCMC) is widely regarded as one of the most important algorithms of the 20th century. Its guarantees of asymptotic convergence, stability, and estimator-variance bounds using only unnormalized probability functions make it indispensable to probabilistic programming. In this paper, we introduce the TensorFlow Probability MCMC toolkit, and discuss some of the considerations… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: Based on extended abstract submitted to PROBPROG 2020

  15. arXiv:2002.00643  [pdf, other

    stat.ML cs.LG

    Automatic structured variational inference

    Authors: Luca Ambrogioni, Kate Lin, Emily Fertig, Sharad Vikram, Max Hinne, Dave Moore, Marcel van Gerven

    Abstract: Stochastic variational inference offers an attractive option as a default method for differentiable probabilistic programming. However, the performance of the variational approach depends on the choice of an appropriate variational family. Here, we introduce automatic structured variational inference (ASVI), a fully automated method for constructing structured variational families, inspired by the… ▽ More

    Submitted 10 February, 2021; v1 submitted 3 February, 2020; originally announced February 2020.

  16. arXiv:2001.11819  [pdf, ps, other

    cs.PL cs.LG stat.CO stat.ML

    Joint Distributions for TensorFlow Probability

    Authors: Dan Piponi, Dave Moore, Joshua V. Dillon

    Abstract: A central tenet of probabilistic programming is that a model is specified exactly once in a canonical representation which is usable by inference algorithms. We describe JointDistributions, a family of declarative representations of directed graphical models in TensorFlow Probability.

    Submitted 21 January, 2020; originally announced January 2020.

    Comments: Based on extended abstract submitted to PROBPROG 2020

  17. arXiv:1911.00473  [pdf, other

    cs.CL cs.AI cs.LG

    BERT Goes to Law School: Quantifying the Competitive Advantage of Access to Large Legal Corpora in Contract Understanding

    Authors: Emad Elwany, Dave Moore, Gaurav Oberoi

    Abstract: Fine-tuning language models, such as BERT, on domain specific corpora has proven to be valuable in domains like scientific papers and biomedical text. In this paper, we show that fine-tuning BERT on legal documents similarly provides valuable improvements on NLP tasks in the legal domain. Demonstrating this outcome is significant for analyzing commercial agreements, because obtaining large legal c… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

  18. arXiv:1907.04649  [pdf

    cs.AI physics.bio-ph

    Quantifying the pathways to life using assembly spaces

    Authors: Stuart M. Marshall, Douglas Moore, Alastair R. G. Murray, Sara I. Walker, Leroy Cronin

    Abstract: We have developed the concept of pathway assembly to explore the amount of extrinsic information required to build an object. To quantify this information in an agnostic way, we present a method to determine the amount of pathway assembly information contained within such an object by deconstructing the object into its irreducible parts, and then evaluating the minimum number of steps to reconstru… ▽ More

    Submitted 9 August, 2019; v1 submitted 6 July, 2019; originally announced July 2019.

    Comments: manuscript with 10 figures and supplementary data

  19. arXiv:1906.03028  [pdf, other

    stat.ML cs.LG cs.PL

    Automatic Reparameterisation of Probabilistic Programs

    Authors: Maria I. Gorinova, Dave Moore, Matthew D. Hoffman

    Abstract: Probabilistic programming has emerged as a powerful paradigm in statistics, applied science, and machine learning: by decoupling modelling from inference, it promises to allow modellers to directly reason about the processes generating data. However, the performance of inference algorithms can be dramatically affected by the parameterisation used to express a model, requiring users to transform th… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  20. arXiv:1811.06150  [pdf, ps, other

    cs.PL cs.LG stat.CO

    Effect Handling for Composable Program Transformations in Edward2

    Authors: Dave Moore, Maria I. Gorinova

    Abstract: Algebraic effects and handlers have emerged in the programming languages community as a convenient, modular abstraction for controlling computational effects. They have found several applications including concurrent programming, meta programming, and more recently, probabilistic programming, as part of Pyro's Poutines library. We investigate the use of effect handlers as a lightweight abstraction… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

  21. arXiv:1811.02091  [pdf, other

    stat.ML cs.LG cs.PL

    Simple, Distributed, and Accelerated Probabilistic Programming

    Authors: Dustin Tran, Matthew Hoffman, Dave Moore, Christopher Suter, Srinivas Vasudevan, Alexey Radul, Matthew Johnson, Rif A. Saurous

    Abstract: We describe a simple, low-level approach for embedding probabilistic programming in a deep learning ecosystem. In particular, we distill probabilistic programming down to a single abstraction---the random variable. Our lightweight implementation in TensorFlow enables numerous applications: a model-parallel variational auto-encoder (VAE) with 2nd-generation tensor processing units (TPUv2s); a data-… ▽ More

    Submitted 28 November, 2018; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: Appears in Neural Information Processing Systems, 2018. Code available at http://bit.ly/2JpFipt

  22. arXiv:1808.07269  [pdf, other

    hep-ex cs.CV physics.data-an physics.ins-det

    A Deep Neural Network for Pixel-Level Electromagnetic Particle Identification in the MicroBooNE Liquid Argon Time Projection Chamber

    Authors: MicroBooNE collaboration, C. Adams, M. Alrashed, R. An, J. Anthony, J. Asaadi, A. Ashkenazi, M. Auger, S. Balasubramanian, B. Baller, C. Barnes, G. Barr, M. Bass, F. Bay, A. Bhat, K. Bhattacharya, M. Bishai, A. Blake, T. Bolton, L. Camilleri, D. Caratelli, I. Caro Terrazas, R. Carr, R. Castillo Fernandez, F. Cavanna , et al. (148 additional authors not shown)

    Abstract: We have developed a convolutional neural network (CNN) that can make a pixel-level prediction of objects in image data recorded by a liquid argon time projection chamber (LArTPC) for the first time. We describe the network design, training techniques, and software tools developed to train this network. The goal of this work is to develop a complete deep neural network based data reconstruction cha… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Journal ref: Phys. Rev. D 99, 092001 (2019)

  23. arXiv:1807.01466  [pdf, other

    cs.CL

    Polarity and Intensity: the Two Aspects of Sentiment Analysis

    Authors: Leimin Tian, Catherine Lai, Johanna D. Moore

    Abstract: Current multimodal sentiment analysis frames sentiment score prediction as a general Machine Learning task. However, what the sentiment score actually represents has often been overlooked. As a measurement of opinions and affective states, a sentiment score generally consists of two aspects: polarity and intensity. We decompose sentiment scores into these two aspects and study how they are conveye… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

    Comments: Published at the First Grand Challenge and Workshop on Human Multimodal Language (Challenge-HML) of ACL 2018

  24. arXiv:1801.04601  [pdf

    cs.DC

    PACER: Peripheral Activity Completion Estimation and Recognition

    Authors: Daniel Moore, Alexander Dean

    Abstract: Embedded peripheral devices such as memories, sensors and communications interfaces are used to perform a function external to a host microcontroller. The device manufacturer typically specifies worst-case current consumption and latency estimates for each of these peripheral actions. Peripheral Activity Completion, Estimation and Recognition (PACER) is introduced as a suite of algorithms that can… ▽ More

    Submitted 14 January, 2018; originally announced January 2018.

    Comments: 8 pages, 12 figures, Presented at HIP3ES, 2018

    Report number: HIP3ES/2018/3

  25. arXiv:1711.10604  [pdf, ps, other

    cs.LG cs.AI cs.PL stat.ML

    TensorFlow Distributions

    Authors: Joshua V. Dillon, Ian Langmore, Dustin Tran, Eugene Brevdo, Srinivas Vasudevan, Dave Moore, Brian Patton, Alex Alemi, Matt Hoffman, Rif A. Saurous

    Abstract: The TensorFlow Distributions library implements a vision of probability theory adapted to the modern deep-learning paradigm of end-to-end differentiable computation. Building on two basic abstractions, it offers flexible building blocks for probabilistic computation. Distributions provide fast, numerically stable methods for generating samples and computing statistics, e.g., log density. Bijectors… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

  26. arXiv:1708.06040  [pdf, other

    cs.AI cs.LG stat.ML

    Meta-Learning MCMC Proposals

    Authors: Tongzhou Wang, Yi Wu, David A. Moore, Stuart J. Russell

    Abstract: Effective implementations of sampling-based probabilistic inference often require manually constructed, model-specific proposals. Inspired by recent progresses in meta-learning for training learning agents that can generalize to unseen environments, we propose a meta-learning approach to building effective and generalizable MCMC proposals. We parametrize the proposal as a neural network to provide… ▽ More

    Submitted 1 January, 2019; v1 submitted 20 August, 2017; originally announced August 2017.

    Comments: 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montreal, Canada

  27. arXiv:1703.00561  [pdf, other

    cs.LG physics.geo-ph

    Signal-based Bayesian Seismic Monitoring

    Authors: David A. Moore, Stuart J. Russell

    Abstract: Detecting weak seismic events from noisy sensors is a difficult perceptual task. We formulate this task as Bayesian inference and propose a generative model of seismic events and signals across a network of spatially distributed stations. Our system, SIGVISA, is the first to directly model seismic waveforms, allowing it to incorporate a rich representation of the physics underlying the signal gene… ▽ More

    Submitted 1 March, 2017; originally announced March 2017.

    Comments: Appearing at AISTATS 2017

  28. arXiv:1511.00054  [pdf, other

    cs.LG stat.ML

    Gaussian Process Random Fields

    Authors: David A. Moore, Stuart J. Russell

    Abstract: Gaussian processes have been successful in both supervised and unsupervised machine learning tasks, but their computational complexity has constrained practical applications. We introduce a new approximation for large-scale Gaussian processes, the Gaussian Process Random Field (GPRF), in which local GPs are coupled via pairwise potentials. The GPRF likelihood is a simple, tractable, and paralleliz… ▽ More

    Submitted 30 October, 2015; originally announced November 2015.

    Comments: Advances in Neural Information Processing Systems (NIPS), 2015

  29. arXiv:cs/9812011   

    cs.OS cs.DC

    A nested transaction mechanism for LOCUS

    Authors: Erik T. Mueller, Johanna D. Moore, Gerald J. Popek

    Abstract: A working implementation of nested transactions has been produced for LOCUS, an integrated distributed operating system which provides a high degree of network transparency. Several aspects of our mechanism are novel. First, the mechanism allows a transaction to access objects directly without regard to the location of the object. Second, processes running on behalf of a single transaction may b… ▽ More

    Submitted 10 December, 1998; originally announced December 1998.

    Comments: 17 pages. Appears in: Proceedings of the Ninth ACM Symposium on Operating Systems Principles (pp. 71-87). Operating Systems Review. Vol. 17, No. 5. New York: Association for Computing Machinery. 1983

    ACM Class: H.2.4

  30. An Empirical Investigation of Proposals in Collaborative Dialogues

    Authors: Barbara Di Eugenio, Pamela W. Jordan, Johanna D. Moore, Richmond H. Thomason

    Abstract: We describe a corpus-based investigation of proposals in dialogue. First, we describe our DRI compliant coding scheme and report our inter-coder reliability results. Next, we test several hypotheses about what constitutes a well-formed proposal.

    Submitted 25 June, 1998; originally announced June 1998.

    Comments: 5 pages, colacl.sty, formulas.sty

    Journal ref: Proceedings of COLING-ACL 1998

  31. Learning Features that Predict Cue Usage

    Authors: Barbara Di Eugenio, Johanna D. Moore, Massimo Paolucci

    Abstract: Our goal is to identify the features that predict the occurrence and placement of discourse cues in tutorial explanations in order to aid in the automatic generation of explanations. Previous attempts to devise rules for text generation were based on intuition or small numbers of constructed examples. We apply a machine learning program, C4.5, to induce decision trees for cue occurrence and plac… ▽ More

    Submitted 21 October, 1997; originally announced October 1997.

    Comments: 10 pages, 2 Postscript figures, uses aclap.sty, psfig.tex

    Journal ref: Proceedings of ACL/EACL97, Madrid, 1997

  32. arXiv:cmp-lg/9406020  [pdf, ps

    cs.CL

    DPOCL: A Principled Approach to Discourse Planning

    Authors: R. Michael Young, Johanna D. Moore

    Abstract: Research in discourse processing has identified two representational requirements for discourse planning systems. First, discourse plans must adequately represent the intentional structure of the utterances they produce in order to enable a computational discourse agent to respond effectively to communicative failures \cite{MooreParisCL}. Second, discourse plans must represent the informational… ▽ More

    Submitted 10 June, 1994; originally announced June 1994.

    Journal ref: proceedings of the Seventh International Workshop on Natural Langauge Generation, Kennebunkport, ME, June, 1994

  33. Towards a Principled Representation of Discourse Plans

    Authors: R. Michael Young, Johanna D. Moore, Martha E. Pollack

    Abstract: We argue that discourse plans must capture the intended causal and decompositional relations between communicative actions. We present a planning algorithm, DPOCL, that builds plan structures that properly capture these relations, and show how these structures are used to solve the problems that plagued previous discourse planners, and allow a system to participate effectively and flexibly in an… ▽ More

    Submitted 1 June, 1994; originally announced June 1994.

    Comments: requires cogsci94.sty, psfig.sty

    Report number: ISP Technical Report# 94-2

    Journal ref: To appear in Proceedings of the Sixteenth Annual Conference of the Cognitive Science Society, Atlanta, Ga, August, 1994