Skip to main content

Showing 1–36 of 36 results for author: Weber, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06441  [pdf, other

    cs.CL cs.AI

    Interpretability of Language Models via Task Spaces

    Authors: Lucas Weber, Jaap Jumelet, Elia Bruni, Dieuwke Hupkes

    Abstract: The usual way to interpret language models (LMs) is to test their performance on different benchmarks and subsequently infer their internal processes. In this paper, we present an alternative approach, concentrating on the quality of LM processing, with a focus on their language abilities. To this end, we construct 'linguistic task spaces' -- representations of an LM's language conceptualisation -… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: To be published at ACL 2024 (main)

  2. arXiv:2406.04766  [pdf, other

    cs.LG math.OC stat.ML

    Reinforcement Learning and Regret Bounds for Admission Control

    Authors: Lucas Weber, Ana Bušić, Jiamin Zhu

    Abstract: The expected regret of any reinforcement learning algorithm is lower bounded by $Ω\left(\sqrt{DXAT}\right)$ for undiscounted returns, where $D$ is the diameter of the Markov decision process, $X$ the size of the state space, $A$ the size of the action space and $T$ the number of time steps. However, this lower bound is general. A smaller regret can be obtained by taking into account some specific… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2405.17202  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Efficient multi-prompt evaluation of LLMs

    Authors: Felipe Maia Polo, Ronald Xu, Lucas Weber, Mírian Silva, Onkar Bhardwaj, Leshem Choshen, Allysson Flavio Melo de Oliveira, Yuekai Sun, Mikhail Yurochkin

    Abstract: Most popular benchmarks for comparing LLMs rely on a limited set of prompt templates, which may not fully capture the LLMs' abilities and can affect the reproducibility of results on leaderboards. Many recent works empirically verify prompt sensitivity and advocate for changes in LLM evaluation. In this paper, we consider the problem of estimating the performance distribution across many prompt va… ▽ More

    Submitted 7 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2405.02383  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    A Fresh Look at Sanity Checks for Saliency Maps

    Authors: Anna Hedström, Leander Weber, Sebastian Lapuschkin, Marina Höhne

    Abstract: The Model Parameter Randomisation Test (MPRT) is highly recognised in the eXplainable Artificial Intelligence (XAI) community due to its fundamental evaluative criterion: explanations should be sensitive to the parameters of the model they seek to explain. However, recent studies have raised several methodological concerns for the empirical interpretation of MPRT. In response, we propose two modif… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2401.06465

  5. arXiv:2403.07137  [pdf, other

    eess.IV cs.CV cs.LG

    Exploring Cluster Analysis in Nelore Cattle Visual Score Attribution

    Authors: Alexandre de Oliveira Bezerra, Rodrigo Goncalves Mateus, Vanessa Ap. de Moraes Weber, Fabricio de Lima Weber, Yasmin Alves de Arruda, Rodrigo da Costa Gomes, Gabriel Toshio Hirokawa Higa, Hemerson Pistori

    Abstract: Assessing the biotype of cattle through human visual inspection is a very common and important practice in precision cattle breeding. This paper presents the results of a correlation analysis between scores produced by humans for Nelore cattle and a variety of measurements that can be derived from images or other instruments. It also presents a study using the k-means algorithm to generate new way… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  6. arXiv:2402.14992  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    tinyBenchmarks: evaluating LLMs with fewer examples

    Authors: Felipe Maia Polo, Lucas Weber, Leshem Choshen, Yuekai Sun, Gongjun Xu, Mikhail Yurochkin

    Abstract: The versatility of large language models (LLMs) led to the creation of diverse benchmarks that thoroughly test a variety of language models' abilities. These benchmarks consist of tens of thousands of examples making evaluation of LLMs very expensive. In this paper, we investigate strategies to reduce the number of evaluations needed to assess the performance of an LLM on several key benchmarks. F… ▽ More

    Submitted 26 May, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning (ICML)

  7. arXiv:2401.06465  [pdf, other

    cs.AI cs.LG stat.ME

    Sanity Checks Revisited: An Exploration to Repair the Model Parameter Randomisation Test

    Authors: Anna Hedström, Leander Weber, Sebastian Lapuschkin, Marina MC Höhne

    Abstract: The Model Parameter Randomisation Test (MPRT) is widely acknowledged in the eXplainable Artificial Intelligence (XAI) community for its well-motivated evaluative principle: that the explanation function should be sensitive to changes in the parameters of the model function. However, recent works have identified several methodological caveats for the empirical interpretation of MPRT. To address the… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: 19 pages, 12 figures, NeurIPS XAIA 2023

  8. arXiv:2312.04945  [pdf, other

    cs.CL cs.AI cs.LG

    The ICL Consistency Test

    Authors: Lucas Weber, Elia Bruni, Dieuwke Hupkes

    Abstract: Just like the previous generation of task-tuned models, large language models (LLMs) that are adapted to tasks via prompt-based methods like in-context-learning (ICL) perform well in some setups but not in others. This lack of consistency in prompt-based learning hints at a lack of robust generalisation. We here introduce the ICL consistency test -- a contribution to the GenBench collaborative ben… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted as non-archival submission to the GenBench Workshop 2023. arXiv admin note: substantial text overlap with arXiv:2310.13486

  9. arXiv:2310.13486  [pdf, other

    cs.CL cs.AI

    Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning

    Authors: Lucas Weber, Elia Bruni, Dieuwke Hupkes

    Abstract: Finding the best way of adapting pre-trained language models to a task is a big challenge in current NLP. Just like the previous generation of task-tuned models (TT), models that are adapted to tasks via in-context-learning (ICL) are robust in some setups but not in others. Here, we present a detailed analysis of which design choices cause instabilities and inconsistencies in LLM predictions. Firs… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  10. arXiv:2310.05442  [pdf, other

    cs.CL

    Establishing Trustworthiness: Rethinking Tasks and Model Evaluation

    Authors: Robert Litschko, Max Müller-Eberstein, Rob van der Goot, Leon Weber, Barbara Plank

    Abstract: Language understanding is a multi-faceted cognitive capability, which the Natural Language Processing (NLP) community has striven to model computationally for decades. Traditionally, facets of linguistic intelligence have been compartmentalized into tasks with specialized model architectures and corresponding evaluation protocols. With the advent of large language models (LLMs) the community has w… ▽ More

    Submitted 23 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 (Main Conference), camera-ready

  11. arXiv:2308.12202  [pdf, other

    cs.LG cs.CL

    Curriculum Learning with Adam: The Devil Is in the Wrong Details

    Authors: Lucas Weber, Jaap Jumelet, Paul Michel, Elia Bruni, Dieuwke Hupkes

    Abstract: Curriculum learning (CL) posits that machine learning models -- similar to humans -- may learn more efficiently from data that match their current learning progress. However, CL methods are still poorly understood and, in particular for natural language processing (NLP), have achieved only limited success. In this paper, we explore why. Starting from an attempt to replicate and extend a number of… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  12. arXiv:2308.12053  [pdf, other

    cs.LG cs.AI cs.NE

    Layer-wise Feedback Propagation

    Authors: Leander Weber, Jim Berend, Alexander Binder, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: In this paper, we present Layer-wise Feedback Propagation (LFP), a novel training approach for neural-network-like predictors that utilizes explainability, specifically Layer-wise Relevance Propagation(LRP), to assign rewards to individual connections based on their respective contributions to solving a given task. This differs from traditional gradient descent, which updates parameters towards an… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    MSC Class: 68T05

  13. arXiv:2305.20045  [pdf, other

    cs.CL cs.LG

    ActiveAED: A Human in the Loop Improves Annotation Error Detection

    Authors: Leon Weber, Barbara Plank

    Abstract: Manually annotated datasets are crucial for training and evaluating Natural Language Processing models. However, recent work has discovered that even widely-used benchmark datasets contain a substantial number of erroneous annotations. This problem has been addressed with Annotation Error Detection (AED) models, which can flag such errors for human re-annotation. However, even though many of these… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  14. arXiv:2305.10937  [pdf, ps, other

    cs.NE q-bio.NC

    The generalized Hierarchical Gaussian Filter

    Authors: Lilian Aline Weber, Peter Thestrup Waade, Nicolas Legrand, Anna Hedvig Møller, Klaas Enno Stephan, Christoph Mathys

    Abstract: Hierarchical Bayesian models of perception and learning feature prominently in contemporary cognitive neuroscience where, for example, they inform computational concepts of mental disorders. This includes predictive coding and hierarchical Gaussian filtering (HGF), which differ in the nature of hierarchical representations. Predictive coding assumes that higher levels in a given hierarchy influenc… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  15. arXiv:2303.03915  [pdf, other

    cs.CL cs.AI

    The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

    Authors: Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral, Teven Le Scao, Leandro Von Werra, Chenghao Mou, Eduardo González Ponferrada, Huu Nguyen, Jörg Frohberg, Mario Šaško, Quentin Lhoest, Angelina McMillan-Major, Gerard Dupont, Stella Biderman, Anna Rogers, Loubna Ben allal, Francesco De Toni, Giada Pistilli, Olivier Nguyen, Somaieh Nikpoor, Maraim Masoud, Pierre Colombo, Javier de la Rosa , et al. (29 additional authors not shown)

    Abstract: As language models grow ever larger, the need for large-scale high-quality text datasets has never been more pressing, especially in multilingual settings. The BigScience workshop, a 1-year international and multidisciplinary initiative, was formed with the goal of researching and training large language models as a values-driven undertaking, putting issues of ethics, harm, and governance in the f… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: NeurIPS 2022, Datasets and Benchmarks Track

    ACM Class: I.2.7

  16. arXiv:2211.12486  [pdf, other

    cs.LG cs.CV

    Shortcomings of Top-Down Randomization-Based Sanity Checks for Evaluations of Deep Neural Network Explanations

    Authors: Alexander Binder, Leander Weber, Sebastian Lapuschkin, Grégoire Montavon, Klaus-Robert Müller, Wojciech Samek

    Abstract: While the evaluation of explanations is an important step towards trustworthy models, it needs to be done carefully, and the employed metrics need to be well-understood. Specifically model randomization testing is often overestimated and regarded as a sole criterion for selecting or discarding certain explanation methods. To address shortcomings of this test, we start by observing an experimental… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 23 pages

  17. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  18. arXiv:2206.15076  [pdf, other

    cs.CL

    BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

    Authors: Jason Alan Fries, Leon Weber, Natasha Seelam, Gabriel Altay, Debajyoti Datta, Samuele Garda, Myungsun Kang, Ruisi Su, Wojciech Kusa, Samuel Cahyawijaya, Fabio Barth, Simon Ott, Matthias Samwald, Stephen Bach, Stella Biderman, Mario Sänger, Bo Wang, Alison Callahan, Daniel León Periñán, Théo Gigant, Patrick Haller, Jenny Chim, Jose David Posada, John Michael Giorgi, Karthik Rangasai Sivaraman , et al. (18 additional authors not shown)

    Abstract: Training and evaluating language models increasingly requires the construction of meta-datasets --diverse collections of curated data with clear provenance. Natural language prompting has recently lead to improved zero-shot generalization by transforming existing, supervised datasets into a diversity of novel pretraining tasks, highlighting the benefits of meta-dataset curation. While successful i… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Submitted to NeurIPS 2022 Datasets and Benchmarks Track

  19. arXiv:2205.01929  [pdf, other

    cs.LG

    Explain to Not Forget: Defending Against Catastrophic Forgetting with XAI

    Authors: Sami Ede, Serop Baghdadlian, Leander Weber, An Nguyen, Dario Zanca, Wojciech Samek, Sebastian Lapuschkin

    Abstract: The ability to continuously process and retain new information like we do naturally as humans is a feat that is highly sought after when training neural networks. Unfortunately, the traditional optimization algorithms often require large amounts of data available during training time and updates wrt. new data are difficult after the training process has been completed. In fact, when new data or ta… ▽ More

    Submitted 22 June, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: 14 pages including appendix, 5 figures, 2 tables, 1 algorithm listing. v2 update increases figure readability, updates Fig 5 caption, adds our collaborators Dario and An as co-authors v3 brings the preprint in line with the final version accepted for peer-reviewed publication at CD-MAKE 2022. v4 metadata update

  20. arXiv:2203.08008  [pdf, other

    cs.LG

    Beyond Explaining: Opportunities and Challenges of XAI-Based Model Improvement

    Authors: Leander Weber, Sebastian Lapuschkin, Alexander Binder, Wojciech Samek

    Abstract: Explainable Artificial Intelligence (XAI) is an emerging research field bringing transparency to highly complex and opaque machine learning (ML) models. Despite the development of a multitude of methods to explain the decisions of black-box classifiers in recent years, these tools are seldomly used beyond visualization purposes. Only recently, researchers have started to employ explanations in pra… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  21. arXiv:2202.06861  [pdf, other

    cs.LG

    Quantus: An Explainable AI Toolkit for Responsible Evaluation of Neural Network Explanations and Beyond

    Authors: Anna Hedström, Leander Weber, Dilyara Bareeva, Daniel Krakowczyk, Franz Motzkus, Wojciech Samek, Sebastian Lapuschkin, Marina M. -C. Höhne

    Abstract: The evaluation of explanation methods is a research topic that has not yet been explored deeply, however, since explainability is supposed to strengthen trust in artificial intelligence, it is necessary to systematically review and compare explanation methods in order to confirm their correctness. Until now, no tool with focus on XAI evaluation exists that exhaustively and speedily allows research… ▽ More

    Submitted 27 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 4 pages, 1 figure, 1 table

    Journal ref: Journal of Machine Learning Research, Vol. 24 (2023) 1-11

  22. arXiv:2202.06621  [pdf, other

    cs.LG cs.AI

    Measurably Stronger Explanation Reliability via Model Canonization

    Authors: Franz Motzkus, Leander Weber, Sebastian Lapuschkin

    Abstract: While rule-based attribution methods have proven useful for providing local explanations for Deep Neural Networks, explaining modern and more varied network architectures yields new challenges in generating trustworthy explanations, since the established rule sets might not be sufficient or applicable to novel network structures. As an elegant solution to the above issue, network canonization has… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: 5 pages, 4 figures

  23. arXiv:2202.03482  [pdf, other

    cs.CV cs.AI cs.LG

    Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence

    Authors: Frederik Pahde, Maximilian Dreyer, Leander Weber, Moritz Weckbecker, Christopher J. Anders, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

    Abstract: With a growing interest in understanding neural network prediction strategies, Concept Activation Vectors (CAVs) have emerged as a popular tool for modeling human-understandable concepts in the latent space. Commonly, CAVs are computed by leveraging linear classifiers optimizing the separability of latent representations of samples with and without a given concept. However, in this paper we show t… ▽ More

    Submitted 5 February, 2024; v1 submitted 7 February, 2022; originally announced February 2022.

  24. arXiv:2101.11287  [pdf, other

    cs.CL cs.LG

    Language Modelling as a Multi-Task Problem

    Authors: Lucas Weber, Jaap Jumelet, Elia Bruni, Dieuwke Hupkes

    Abstract: In this paper, we propose to study language modelling as a multi-task problem, bringing together three strands of research: multi-task learning, linguistics, and interpretability. Based on hypotheses derived from linguistic theory, we investigate whether language models adhere to learning principles of multi-task learning during training. To showcase the idea, we analyse the generalisation behavio… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted for publication at EACL 2021

  25. arXiv:2011.00034  [pdf, other

    cs.RO eess.SP q-bio.NC

    Adaptive Semi-Supervised Intent Inferral to Control a Powered Hand Orthosis for Stroke

    Authors: Jingxi Xu, Cassie Meeker, Ava Chen, Lauren Winterbottom, Michaela Fraser, Sangwoo Park, Lynne M. Weber, Mitchell Miya, Dawn Nilsen, Joel Stein, Matei Ciocarlie

    Abstract: In order to provide therapy in a functional context, controls for wearable robotic orthoses need to be robust and intuitive. We have previously introduced an intuitive, user-driven, EMG-based method to operate a robotic hand orthosis, but the process of training a control that is robust to concept drift (changes in the input signal) places a substantial burden on the user. In this paper, we explor… ▽ More

    Submitted 1 March, 2022; v1 submitted 30 October, 2020; originally announced November 2020.

    Comments: 7 pages; Accepted to International Conference on Robotics and Automation (ICRA) 2022

  26. arXiv:2008.07347  [pdf, other

    cs.CL

    HunFlair: An Easy-to-Use Tool for State-of-the-Art Biomedical Named Entity Recognition

    Authors: Leon Weber, Mario Sänger, Jannes Münchmeyer, Maryam Habibi, Ulf Leser, Alan Akbik

    Abstract: Summary: Named Entity Recognition (NER) is an important step in biomedical information extraction pipelines. Tools for NER should be easy to use, cover multiple entity types, highly accurate, and robust towards variations in text genre and style. To this end, we propose HunFlair, an NER tagger covering multiple entity types integrated into the widely used NLP framework Flair. HunFlair outperforms… ▽ More

    Submitted 18 August, 2020; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: - Corrected author list - Updated project link

  27. Understanding Integrated Gradients with SmoothTaylor for Deep Neural Network Attribution

    Authors: Gary S. W. Goh, Sebastian Lapuschkin, Leander Weber, Wojciech Samek, Alexander Binder

    Abstract: Integrated Gradients as an attribution method for deep neural network models offers simple implementability. However, it suffers from noisiness of explanations which affects the ease of interpretability. The SmoothGrad technique is proposed to solve the noisiness issue and smoothen the attribution maps of any gradient-based attribution method. In this paper, we present SmoothTaylor as a novel theo… ▽ More

    Submitted 2 September, 2021; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: 8 pages, 3 figures. Accepted in 25th International Conference on Pattern Recognition, (ICPR) 2020. In Proceedings: pp. 4949-4956

  28. menoci: Lightweight Extensible Web Portal enabling FAIR Data Management for Biomedical Research Projects

    Authors: Markus Suhr, Christoph Lehmann, Christian Robert Bauer, Theresa Bender, Cornelius Knopp, Luca Freckmann, Björn Öst Hansen, Christian Henke, Georg Aschenbrandt, Lea Kühlborn, Sophia Rheinländer, Linus Weber, Bartlomiej Marzec, Marcel Hellkamp, Philipp Wieder, Harald Kusch, Ulrich Sax, Sara Yasemin Nussbeck

    Abstract: Background: Biomedical research projects deal with data management requirements from multiple sources like funding agencies' guidelines, publisher policies, discipline best practices, and their own users' needs. We describe functional and quality requirements based on many years of experience implementing data management for the CRC 1002 and CRC 1190. A fully equipped data management software shou… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: Preprint. 19 pages, 2 figures

    Journal ref: BMC Bioinformatics 21, 582 (2020)

  29. arXiv:1912.11425  [pdf, other

    cs.CV cs.LG cs.NE eess.IV

    Finding and Removing Clever Hans: Using Explanation Methods to Debug and Improve Deep Models

    Authors: Christopher J. Anders, Leander Weber, David Neumann, Wojciech Samek, Klaus-Robert Müller, Sebastian Lapuschkin

    Abstract: Contemporary learning models for computer vision are typically trained on very large (benchmark) datasets with millions of samples. These may, however, contain biases, artifacts, or errors that have gone unnoticed and are exploitable by the model. In the worst case, the trained model does not learn a valid and generalizable strategy to solve the problem it was trained for, and becomes a 'Clever-Ha… ▽ More

    Submitted 18 December, 2020; v1 submitted 22 December, 2019; originally announced December 2019.

    Comments: 47 pages, 21 figures

  30. User-Driven Functional Movement Training with a Wearable Hand Robot after Stroke

    Authors: Sangwoo Park, Michaela Fraser, Lynne M. Weber, Cassie Meeker, Lauri Bishop, Daniel Geller, Joel Stein, Matei Ciocarlie

    Abstract: We studied the performance of a robotic orthosis designed to assist the paretic hand after stroke. It is wearable and fully user-controlled, serving two possible roles: as a therapeutic tool that facilitates device mediated hand exercises to recover neuromuscular function or as an assistive device for use in everyday activities to aid functional use of the hand. We present the clinical outcomes of… ▽ More

    Submitted 2 September, 2020; v1 submitted 18 November, 2019; originally announced November 2019.

    Comments: 10 pages, 15 figures, Camera ready version

  31. arXiv:1906.06187  [pdf, other

    cs.CL cs.LO

    NLProlog: Reasoning with Weak Unification for Question Answering in Natural Language

    Authors: Leon Weber, Pasquale Minervini, Jannes Münchmeyer, Ulf Leser, Tim Rocktäschel

    Abstract: Rule-based models are attractive for various tasks because they inherently lead to interpretable and explainable decisions and can easily incorporate prior knowledge. However, such systems are difficult to apply to problems involving natural language, due to its linguistic variability. In contrast, neural models can cope very well with ambiguity by learning distributed representations of words and… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

    Comments: ACL 2019

  32. arXiv:1904.08368  [pdf, other

    cs.LG cs.PL stat.ML

    Relay: A High-Level Compiler for Deep Learning

    Authors: Jared Roesch, Steven Lyubomirsky, Marisa Kirisame, Logan Weber, Josh Pollock, Luis Vega, Ziheng Jiang, Tianqi Chen, Thierry Moreau, Zachary Tatlock

    Abstract: Frameworks for writing, compiling, and optimizing deep learning (DL) models have recently enabled progress in areas like computer vision and natural language processing. Extending these frameworks to accommodate the rapidly diversifying landscape of DL models and hardware platforms presents challenging tradeoffs between expressivity, composability, and portability. We present Relay, a new compiler… ▽ More

    Submitted 24 August, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

  33. Relay: A New IR for Machine Learning Frameworks

    Authors: Jared Roesch, Steven Lyubomirsky, Logan Weber, Josh Pollock, Marisa Kirisame, Tianqi Chen, Zachary Tatlock

    Abstract: Machine learning powers diverse services in industry including search, translation, recommendation systems, and security. The scale and importance of these models require that they be efficient, expressive, and portable across an array of heterogeneous hardware devices. These constraints are often at odds; in order to better accommodate them we propose a new high-level intermediate representation… ▽ More

    Submitted 25 September, 2018; originally announced October 2018.

  34. Multimodal Sensing and Interaction for a Robotic Hand Orthosis

    Authors: Sangwoo Park, Cassie Meeker, Lynne M. Weber, Lauri Bishop, Joel Stein, Matei Ciocarlie

    Abstract: Wearable robotic hand rehabilitation devices can allow greater freedom and flexibility than their workstation-like counterparts. However, the field is generally lacking effective methods by which the user can operate the device: such controls must be effective, intuitive, and robust to the wide range of possible impairment patterns. Even when focusing on a specific condition, such as stroke, the v… ▽ More

    Submitted 18 December, 2018; v1 submitted 31 July, 2018; originally announced August 2018.

    Comments: 8 pages. 9 Figures. IEEE Robotics and Automation Letters. Preprint version. Accepted Dec, 2018

    Journal ref: IEEE Robotics and Automation Letters, Vol. 4, No. 2, pp. 315 - 322, April 2019

  35. arXiv:1804.02462  [pdf, other

    cs.HC cs.RO

    Human Robot Interface for Assistive Grasping

    Authors: David Watkins, Chaiwen Chou, Caroline Weinberg, Jacob Varley, Kenneth Lyons, Sanjay Joshi, Lynne Weber, Joel Stein, Peter Allen

    Abstract: This work describes a new human-in-the-loop (HitL) assistive grasping system for individuals with varying levels of physical capabilities. We investigated the feasibility of using four potential input devices with our assistive grasping system interface, using able-bodied individuals to define a set of quantitative metrics that could be used to assess an assistive grasping system. We then took the… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

    Comments: 8 pages, 21 figures

  36. arXiv:1802.06131  [pdf, other

    cs.RO

    Design and Development of Effective Transmission Mechanisms on a Tendon Driven Hand Orthosis for Stroke Patients

    Authors: Sangwoo Park, Lynne M. Weber, Lauri Bishop, Joel Stein, Matei Ciocarlie

    Abstract: Tendon-driven hand orthoses have advantages over exoskeletons with respect to wearability and safety because of their low-profile design and ability to fit a range of patients without requiring custom joint alignment. However, no existing study on a wearable tendon-driven hand orthosis for stroke patients presents evidence that such devices can overcome spasticity given repeated use and fatigue, o… ▽ More

    Submitted 31 July, 2018; v1 submitted 16 February, 2018; originally announced February 2018.

    Comments: 7 pages, ICRA 2018