Zum Hauptinhalt springen

Showing 1–50 of 53 results for author: Lewis, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.17484  [pdf, ps, other

    cs.HC cs.AI cs.CY

    A Survey of Accessible Explainable Artificial Intelligence Research

    Authors: Chukwunonso Henry Nwokoye, Maria J. P. Peixoto, Akriti Pandey, Lauren Pardy, Mahadeo Sukhai, Peter R. Lewis

    Abstract: The increasing integration of Artificial Intelligence (AI) into everyday life makes it essential to explain AI-based decision-making in a way that is understandable to all users, including those with disabilities. Accessible explanations are crucial as accessibility in technology promotes digital inclusion and allows everyone, regardless of their physical, sensory, or cognitive abilities, to use t… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  2. arXiv:2407.12035  [pdf, other

    cs.HC cs.AI

    Reporting Risks in AI-based Assistive Technology Research: A Systematic Review

    Authors: Zahra Ahmadi, Peter R. Lewis, Mahadeo A. Sukhai

    Abstract: Artificial Intelligence (AI) is increasingly employed to enhance assistive technologies, yet it can fail in various ways. We conducted a systematic literature review of research into AI-based assistive technology for persons with visual impairments. Our study shows that most proposed technologies with a testable prototype have not been evaluated in a human study with members of the sight-loss comm… ▽ More

    Submitted 18 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2407.09093  [pdf, ps, other

    cs.LG cs.AI

    On Exact Bit-level Reversible Transformers Without Changing Architectures

    Authors: Guoqiang Zhang, J. P. Lewis, W. B. Kleijn

    Abstract: In the literature, various reversible deep neural networks (DNN) models have been proposed to reduce memory consumption or improve data-throughput in the training process. However, almost all existing reversible DNNs either are constrained to have special structures or are constructed by modifying the original DNN architectures considerably to enable reversibility. In this work, we propose exact b… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  4. arXiv:2407.01501  [pdf, other

    cs.MA cs.LG cs.NE

    Online Learning of Temporal Dependencies for Sustainable Foraging Problem

    Authors: John Payne, Aishwaryaprajna, Peter R. Lewis

    Abstract: The sustainable foraging problem is a dynamic environment testbed for exploring the forms of agent cognition in dealing with social dilemmas in a multi-agent setting. The agents need to resist the temptation of individual rewards through foraging and choose the collective long-term goal of sustainability. We investigate methods of online learning in Neuro-Evolution and Deep Recurrent Q-Networks to… ▽ More

    Submitted 18 August, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 6 pages, 13 figures, accepted for publication by the Second International Workshop on Sustainability and Scalability of Self-Organisation (SaSSO 2024), DOI to be provided once published

  5. arXiv:2404.18796  [pdf, other

    cs.CL cs.AI

    Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

    Authors: Pat Verga, Sebastian Hofstatter, Sophia Althammer, Yixuan Su, Aleksandra Piktus, Arkady Arkhangorodsky, Minjie Xu, Naomi White, Patrick Lewis

    Abstract: As Large Language Models (LLMs) have become more advanced, they have outpaced our abilities to accurately evaluate their quality. Not only is finding data to adequately probe particular model properties difficult, but evaluating the correctness of a model's freeform generation alone is a challenge. To address this, many evaluations now rely on using LLMs themselves as judges to score the quality o… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  6. arXiv:2404.14469  [pdf, other

    cs.CL cs.AI

    SnapKV: LLM Knows What You are Looking for Before Generation

    Authors: Yuhong Li, Yingbing Huang, Bowen Yang, Bharat Venkitesh, Acyr Locatelli, Hanchen Ye, Tianle Cai, Patrick Lewis, Deming Chen

    Abstract: Large Language Models (LLMs) have made remarkable progress in processing extensive contexts, with the Key-Value (KV) cache playing a vital role in enhancing their performance. However, the growth of the KV cache in response to increasing input length poses challenges to memory and time efficiency. To address this problem, this paper introduces SnapKV, an innovative and fine-tuning-free approach th… ▽ More

    Submitted 16 June, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  7. arXiv:2403.03893  [pdf, other

    cs.CL cs.AI

    From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models

    Authors: Luiza Pozzobon, Patrick Lewis, Sara Hooker, Beyza Ermis

    Abstract: To date, toxicity mitigation in language models has almost entirely been focused on single-language settings. As language models embrace multilingual capabilities, it's crucial our safety measures keep pace. Recognizing this research gap, our approach expands the scope of conventional toxicity mitigation to address the complexities presented by multiple languages. In the absence of sufficient anno… ▽ More

    Submitted 30 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  8. arXiv:2402.15925  [pdf, other

    cs.CL cs.AI cs.IR

    MultiContrievers: Analysis of Dense Retrieval Representations

    Authors: Seraphina Goldfarb-Tarrant, Pedro Rodriguez, Jane Dwivedi-Yu, Patrick Lewis

    Abstract: Dense retrievers compress source documents into (possibly lossy) vector representations, yet there is little analysis of what information is lost versus preserved, and how it affects downstream tasks. We conduct the first analysis of the information captured by dense retrievers compared to the language models they are based on (e.g., BERT versus Contriever). We use 25 MultiBert checkpoints as rand… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  9. arXiv:2401.10284  [pdf, other

    eess.SP cs.AI cs.LG

    MorpheusNet: Resource efficient sleep stage classifier for embedded on-line systems

    Authors: Ali Kavoosi, Morgan P. Mitchell, Raveen Kariyawasam, John E. Fleming, Penny Lewis, Heidi Johansen-Berg, Hayriye Cagnan, Timothy Denison

    Abstract: Sleep Stage Classification (SSC) is a labor-intensive task, requiring experts to examine hours of electrophysiological recordings for manual classification. This is a limiting factor when it comes to leveraging sleep stages for therapeutic purposes. With increasing affordability and expansion of wearable devices, automating SSC may enable deployment of sleep-based therapies at scale. Deep Learning… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: This paper was presented at the 2023 IEEE conference on Systems, Man, and Cybernetics (SMC)

  10. arXiv:2401.00896  [pdf, other

    cs.CV

    TrailBlazer: Trajectory Control for Diffusion-Based Video Generation

    Authors: Wan-Duo Kurt Ma, J. P. Lewis, W. Bastiaan Kleijn

    Abstract: Within recent approaches to text-to-video (T2V) generation, achieving controllability in the synthesized video is often a challenge. Typically, this issue is addressed by providing low-level per-frame guidance in the form of edge maps, depth maps, or an existing video to be altered. However, the process of obtaining such guidance can be labor-intensive. This paper focuses on enhancing controllabil… ▽ More

    Submitted 8 April, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: 14 pages, 18 figures, Project Page: https://hohonu-vicml.github.io/Trailblazer.Page/

  11. arXiv:2312.02969  [pdf, other

    cs.CL cs.IR

    Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models

    Authors: Xinyu Zhang, Sebastian Hofstätter, Patrick Lewis, Raphael Tang, Jimmy Lin

    Abstract: Listwise rerankers based on large language models (LLM) are the zero-shot state-of-the-art. However, current works in this direction all depend on the GPT models, making it a single point of failure in scientific reproducibility. Moreover, it raises the concern that the current research findings only hold for GPT models but not LLM in general. In this work, we lift this pre-condition and build for… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  12. arXiv:2310.07589  [pdf, other

    cs.AI

    Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models

    Authors: Luiza Pozzobon, Beyza Ermis, Patrick Lewis, Sara Hooker

    Abstract: Considerable effort has been dedicated to mitigating toxicity, but existing methods often require drastic modifications to model parameters or the use of computationally intensive auxiliary models. Furthermore, previous approaches have often neglected the crucial factor of language's evolving nature over time. In this work, we present a comprehensive perspective on toxicity mitigation that takes i… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  13. arXiv:2309.14897  [pdf, other

    cs.CV cs.GR cs.LG

    FDLS: A Deep Learning Approach to Production Quality, Controllable, and Retargetable Facial Performances

    Authors: Wan-Duo Kurt Ma, Muhammad Ghifary, J. P. Lewis, Byungkuk Choi, Haekwang Eom

    Abstract: Visual effects commonly requires both the creation of realistic synthetic humans as well as retargeting actors' performances to humanoid characters such as aliens and monsters. Achieving the expressive performances demanded in entertainment requires manipulating complex models with hundreds of parameters. Full creative control requires the freedom to make edits at any stage of the production, whic… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: DigiPro '22: The Digital Production Symposium

  14. arXiv:2307.10829  [pdf, other

    cs.CV

    Exact Diffusion Inversion via Bi-directional Integration Approximation

    Authors: Guoqiang Zhang, J. P. Lewis, W. Bastiaan Kleijn

    Abstract: Recently, various methods have been proposed to address the inconsistency issue of DDIM inversion to enable image editing, such as EDICT [36] and Null-text inversion [22]. However, the above methods introduce considerable computational overhead. In this paper, we propose a new technique, named \emph{bi-directional integration approximation} (BDIA), to perform exact diffusion inversion with neglibl… ▽ More

    Submitted 26 November, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2304.11328. Our code is available at https://github.com/guoqiang-zhang-x/BDIA

  15. arXiv:2307.07434  [pdf, other

    cs.CV eess.IV

    Combining multitemporal optical and SAR data for LAI imputation with BiLSTM network

    Authors: W. Zhao, F. Yin, H. Ma, Q. Wu, J. Gomez-Dans, P. Lewis

    Abstract: The Leaf Area Index (LAI) is vital for predicting winter wheat yield. Acquisition of crop conditions via Sentinel-2 remote sensing images can be hindered by persistent clouds, affecting yield predictions. Synthetic Aperture Radar (SAR) provides all-weather imagery, and the ratio between its cross- and co-polarized channels (C-band) shows a high correlation with time series LAI over winter wheat re… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  16. Training-Free Neural Matte Extraction for Visual Effects

    Authors: Sharif Elcott, J. P. Lewis, Nori Kanazawa, Christoph Bregler

    Abstract: Alpha matting is widely used in video conferencing as well as in movies, television, and social media sites. Deep learning approaches to the matte extraction problem are well suited to video conferencing due to the consistent subject matter (front-facing humans), however training-based approaches are somewhat pointless for entertainment videos where varied subjects (spaceships, monsters, etc.) may… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    ACM Class: I.4.6

    Journal ref: SIGGRAPH Asia 2022 Technical Communications

  17. arXiv:2304.12397  [pdf, other

    cs.CL cs.AI

    On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research

    Authors: Luiza Pozzobon, Beyza Ermis, Patrick Lewis, Sara Hooker

    Abstract: Perception of toxicity evolves over time and often differs between geographies and cultural backgrounds. Similarly, black-box commercially available APIs for detecting toxicity, such as the Perspective API, are not static, but frequently retrained to address any unattended weaknesses and biases. We evaluate the implications of these changes on the reproducibility of findings that compare the relat… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  18. arXiv:2302.13153  [pdf, other

    cs.CV cs.GR cs.LG

    Directed Diffusion: Direct Control of Object Placement through Attention Guidance

    Authors: Wan-Duo Kurt Ma, J. P. Lewis, Avisek Lahiri, Thomas Leung, W. Bastiaan Kleijn

    Abstract: Text-guided diffusion models such as DALLE-2, Imagen, eDiff-I, and Stable Diffusion are able to generate an effectively endless variety of images given only a short text prompt describing the desired image content. In many cases the images are of very high quality. However, these models often struggle to compose scenes containing several key objects such as characters in specified positional relat… ▽ More

    Submitted 26 September, 2023; v1 submitted 25 February, 2023; originally announced February 2023.

    Comments: Our project page: https://hohonu-vicml.github.io/DirectedDiffusion.Page

  19. Reflective Artificial Intelligence

    Authors: Peter R. Lewis, Stefan Sarkadi

    Abstract: Artificial Intelligence (AI) is about making computers that do the sorts of things that minds can do, and as we progress towards this goal, we tend to increasingly delegate human tasks to machines. However, AI systems usually do these tasks with an unusual imbalance of insight and understanding: new, deeper insights are present, yet many important qualities that a human mind would have previously… ▽ More

    Submitted 27 April, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Journal ref: Minds & Machines 34, 14 (2024)

  20. arXiv:2212.10503  [pdf, other

    cs.CL cs.LG

    Mini-Model Adaptation: Efficiently Extending Pretrained Models to New Languages via Aligned Shallow Training

    Authors: Kelly Marchisio, Patrick Lewis, Yihong Chen, Mikel Artetxe

    Abstract: Prior work shows that it is possible to expand pretrained Masked Language Models (MLMs) to new languages by learning a new set of embeddings, while keeping the transformer body frozen. Despite learning a small subset of parameters, this approach is not compute-efficient, as training the new embeddings requires a full forward and backward pass over the entire model. We propose mini-model adaptation… ▽ More

    Submitted 4 July, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Findings of ACL 2023 Camera Ready

  21. arXiv:2211.09260  [pdf, other

    cs.CL

    Task-aware Retrieval with Instructions

    Authors: Akari Asai, Timo Schick, Patrick Lewis, Xilun Chen, Gautier Izacard, Sebastian Riedel, Hannaneh Hajishirzi, Wen-tau Yih

    Abstract: We study the problem of retrieval with instructions, where users of a retrieval system explicitly describe their intent along with their queries. We aim to develop a general-purpose task-aware retrieval system using multi-task instruction tuning, which can follow human-written instructions to find the best documents for a given query. We introduce the first large-scale collection of approximately… ▽ More

    Submitted 19 December, 2022; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Code, data and pretrained model checkpoints are available at https://github.com/facebookresearch/tart

  22. arXiv:2209.13331  [pdf, other

    cs.CL cs.LG

    EditEval: An Instruction-Based Benchmark for Text Improvements

    Authors: Jane Dwivedi-Yu, Timo Schick, Zhengbao Jiang, Maria Lomeli, Patrick Lewis, Gautier Izacard, Edouard Grave, Sebastian Riedel, Fabio Petroni

    Abstract: Evaluation of text generation to date has primarily focused on content created sequentially, rather than improvements on a piece of text. Writing, however, is naturally an iterative and incremental process that requires expertise in different modular skills such as fixing outdated information or making the style more consistent. Even so, comprehensive evaluation of a model's capacity to perform th… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  23. arXiv:2208.11663  [pdf, other

    cs.CL

    PEER: A Collaborative Language Model

    Authors: Timo Schick, Jane Dwivedi-Yu, Zhengbao Jiang, Fabio Petroni, Patrick Lewis, Gautier Izacard, Qingfei You, Christoforos Nalmpantis, Edouard Grave, Sebastian Riedel

    Abstract: Textual content is often the output of a collaborative writing process: We start with an initial draft, ask for suggestions, and repeatedly make changes. Agnostic of this process, today's language models are trained to generate only the final result. As a consequence, they lack several abilities crucial for collaborative writing: They are unable to update existing texts, difficult to control and i… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  24. arXiv:2208.03299  [pdf, other

    cs.CL

    Atlas: Few-shot Learning with Retrieval Augmented Language Models

    Authors: Gautier Izacard, Patrick Lewis, Maria Lomeli, Lucas Hosseini, Fabio Petroni, Timo Schick, Jane Dwivedi-Yu, Armand Joulin, Sebastian Riedel, Edouard Grave

    Abstract: Large language models have shown impressive few-shot results on a wide range of tasks. However, when knowledge is key for such results, as is the case for tasks such as question answering and fact checking, massive parameter counts to store knowledge seem to be needed. Retrieval augmented models are known to excel at knowledge intensive tasks without the need for as many parameters, but it is uncl… ▽ More

    Submitted 16 November, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

  25. arXiv:2207.06220  [pdf, other

    cs.IR cs.AI

    Improving Wikipedia Verifiability with AI

    Authors: Fabio Petroni, Samuel Broscheit, Aleksandra Piktus, Patrick Lewis, Gautier Izacard, Lucas Hosseini, Jane Dwivedi-Yu, Maria Lomeli, Timo Schick, Pierre-Emmanuel Mazaré, Armand Joulin, Edouard Grave, Sebastian Riedel

    Abstract: Verifiability is a core content policy of Wikipedia: claims that are likely to be challenged need to be backed by citations. There are millions of articles available online and thousands of new articles are released each month. For this reason, finding relevant sources is a difficult task: many claims do not have any references that support them. Furthermore, even existing citations might not supp… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  26. arXiv:2204.10628  [pdf, other

    cs.CL cs.IR

    Autoregressive Search Engines: Generating Substrings as Document Identifiers

    Authors: Michele Bevilacqua, Giuseppe Ottaviano, Patrick Lewis, Wen-tau Yih, Sebastian Riedel, Fabio Petroni

    Abstract: Knowledge-intensive language tasks require NLP systems to both provide the correct answer and retrieve supporting evidence for it in a given corpus. Autoregressive language models are emerging as the de-facto standard for generating answers, with newer and more powerful systems emerging at an astonishing pace. In this paper we argue that all this (and future) progress can be directly applied to th… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: 9 pages

  27. arXiv:2203.11027  [pdf, other

    cs.IR cs.AI

    Reasoning over Public and Private Data in Retrieval-Based Systems

    Authors: Simran Arora, Patrick Lewis, Angela Fan, Jacob Kahn, Christopher Ré

    Abstract: Users and organizations are generating ever-increasing amounts of private data from a wide range of sources. Incorporating private data is important to personalize open-domain applications such as question-answering, fact-checking, and personal assistants. State-of-the-art systems for these tasks explicitly retrieve relevant information to a user question from a background corpus before producing… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  28. arXiv:2112.09924  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    The Web Is Your Oyster - Knowledge-Intensive NLP against a Very Large Web Corpus

    Authors: Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Dmytro Okhonko, Samuel Broscheit, Gautier Izacard, Patrick Lewis, Barlas Oğuz, Edouard Grave, Wen-tau Yih, Sebastian Riedel

    Abstract: In order to address increasing demands of real-world applications, the research for knowledge-intensive NLP (KI-NLP) should advance by capturing the challenges of a truly open-domain environment: web-scale knowledge, lack of structure, inconsistent quality and noise. To this end, we propose a new setup for evaluating existing knowledge intensive tasks in which we generalize the background corpus t… ▽ More

    Submitted 24 May, 2022; v1 submitted 18 December, 2021; originally announced December 2021.

  29. arXiv:2112.07771  [pdf, other

    cs.CL cs.IR

    Boosted Dense Retriever

    Authors: Patrick Lewis, Barlas Oğuz, Wenhan Xiong, Fabio Petroni, Wen-tau Yih, Sebastian Riedel

    Abstract: We propose DrBoost, a dense retrieval ensemble inspired by boosting. DrBoost is trained in stages: each component model is learned sequentially and specialized by focusing only on retrieval mistakes made by the current ensemble. The final representation is the concatenation of the output vectors of all the component models, making it a drop-in replacement for standard dense retrievers at test time… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  30. arXiv:2110.06918  [pdf, other

    cs.CL cs.IR cs.LG

    Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?

    Authors: Xilun Chen, Kushal Lakhotia, Barlas Oğuz, Anchit Gupta, Patrick Lewis, Stan Peshterliev, Yashar Mehdad, Sonal Gupta, Wen-tau Yih

    Abstract: Despite their recent popularity and well-known advantages, dense retrievers still lag behind sparse methods such as BM25 in their ability to reliably match salient phrases and rare entities in the query and to generalize to out-of-domain data. It has been argued that this is an inherent limitation of dense models. We rebut this claim by introducing the Salient Phrase Aware Retriever (SPAR), a dens… ▽ More

    Submitted 11 November, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

  31. arXiv:2110.04374  [pdf, other

    cs.CL

    A Few More Examples May Be Worth Billions of Parameters

    Authors: Yuval Kirstain, Patrick Lewis, Sebastian Riedel, Omer Levy

    Abstract: We investigate the dynamics of increasing the number of model parameters versus the number of labeled examples across a wide variety of tasks. Our exploration reveals that while scaling parameters consistently yields performance improvements, the contribution of additional examples highly depends on the task's format. Specifically, in open question answering tasks, enlarging the training set does… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  32. arXiv:2109.01156  [pdf, other

    cs.CL cs.AI

    Challenges in Generalization in Open Domain Question Answering

    Authors: Linqing Liu, Patrick Lewis, Sebastian Riedel, Pontus Stenetorp

    Abstract: Recent work on Open Domain Question Answering has shown that there is a large discrepancy in model performance between novel test questions and those that largely overlap with training questions. However, it is unclear which aspects of novel questions make them challenging. Drawing upon studies on systematic generalization, we introduce and annotate questions according to three categories that mea… ▽ More

    Submitted 15 May, 2022; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: NAACL 2022 Findings

  33. arXiv:2107.13602  [pdf, other

    cs.CL cs.IR

    Domain-matched Pre-training Tasks for Dense Retrieval

    Authors: Barlas Oğuz, Kushal Lakhotia, Anchit Gupta, Patrick Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Wen-tau Yih, Sonal Gupta, Yashar Mehdad

    Abstract: Pre-training on larger datasets with ever increasing model size is now a proven recipe for increased performance across almost all NLP tasks. A notable exception is information retrieval, where additional pre-training has so far failed to produce convincing results. We show that, with the right pre-training setup, this barrier can be overcome. We demonstrate this by pre-training large bi-encoder m… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  34. arXiv:2102.07033  [pdf, other

    cs.CL cs.AI cs.LG

    PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them

    Authors: Patrick Lewis, Yuxiang Wu, Linqing Liu, Pasquale Minervini, Heinrich Küttler, Aleksandra Piktus, Pontus Stenetorp, Sebastian Riedel

    Abstract: Open-domain Question Answering models which directly leverage question-answer (QA) pairs, such as closed-book QA (CBQA) models and QA-pair retrievers, show promise in terms of speed and memory compared to conventional models which retrieve and read from text corpora. QA-pair retrievers also offer interpretable answers, a high degree of control, and are trivial to update at test time with new knowl… ▽ More

    Submitted 13 February, 2021; originally announced February 2021.

  35. arXiv:2101.00133  [pdf, other

    cs.CL cs.AI

    NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

    Authors: Sewon Min, Jordan Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini , et al. (28 additional authors not shown)

    Abstract: We review the EfficientQA competition from NeurIPS 2020. The competition focused on open-domain question answering (QA), where systems take natural language questions as input and return natural language answers. The aim of the competition was to build systems that can predict correct answers while also satisfying strict on-disk memory budgets. These memory budgets were designed to encourage conte… ▽ More

    Submitted 19 September, 2021; v1 submitted 31 December, 2020; originally announced January 2021.

    Comments: 26 pages; Published in Proceedings of Machine Learning Research (PMLR), NeurIPS 2020 Competition and Demonstration Track

  36. arXiv:2009.12756  [pdf, other

    cs.CL

    Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval

    Authors: Wenhan Xiong, Xiang Lorraine Li, Srini Iyer, Jingfei Du, Patrick Lewis, William Yang Wang, Yashar Mehdad, Wen-tau Yih, Sebastian Riedel, Douwe Kiela, Barlas Oğuz

    Abstract: We propose a simple and efficient multi-hop dense retrieval approach for answering complex open-domain questions, which achieves state-of-the-art performance on two multi-hop datasets, HotpotQA and multi-evidence FEVER. Contrary to previous work, our method does not require access to any corpus-specific information, such as inter-document hyperlinks or human-annotated entity markers, and can be ap… ▽ More

    Submitted 19 February, 2021; v1 submitted 27 September, 2020; originally announced September 2020.

  37. arXiv:2009.02252  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    KILT: a Benchmark for Knowledge Intensive Language Tasks

    Authors: Fabio Petroni, Aleksandra Piktus, Angela Fan, Patrick Lewis, Majid Yazdani, Nicola De Cao, James Thorne, Yacine Jernite, Vladimir Karpukhin, Jean Maillard, Vassilis Plachouras, Tim Rocktäschel, Sebastian Riedel

    Abstract: Challenging problems such as open-domain question answering, fact checking, slot filling and entity linking require access to large, external knowledge sources. While some models do well on individual tasks, developing general models is difficult as each task might require computationally expensive indexing of custom knowledge sources, in addition to dedicated infrastructure. To catalyze research… ▽ More

    Submitted 27 May, 2021; v1 submitted 4 September, 2020; originally announced September 2020.

    Comments: accepted at NAACL 2021

  38. arXiv:2008.02637  [pdf, ps, other

    cs.CL cs.AI

    Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets

    Authors: Patrick Lewis, Pontus Stenetorp, Sebastian Riedel

    Abstract: Ideally Open-Domain Question Answering models should exhibit a number of competencies, ranging from simply memorizing questions seen at training time, to answering novel question formulations with answers seen during training, to generalizing to completely novel questions with novel answers. However, single aggregated test set scores do not show the full picture of what capabilities models truly h… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

  39. arXiv:2005.11401  [pdf, other

    cs.CL cs.LG

    Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

    Authors: Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela

    Abstract: Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate knowledge is still limited, and hence on knowledge-intensive tasks, their performance lags behind task-specific architectures. Additionally, providing provenance for… ▽ More

    Submitted 12 April, 2021; v1 submitted 22 May, 2020; originally announced May 2020.

    Comments: Accepted at NeurIPS 2020

  40. arXiv:2005.04611  [pdf, other

    cs.CL

    How Context Affects Language Models' Factual Predictions

    Authors: Fabio Petroni, Patrick Lewis, Aleksandra Piktus, Tim Rocktäschel, Yuxiang Wu, Alexander H. Miller, Sebastian Riedel

    Abstract: When pre-trained on large unsupervised textual corpora, language models are able to store and retrieve factual knowledge to some extent, making it possible to use them directly for zero-shot cloze-style question answering. However, storing factual knowledge in a fixed number of weights of a language model clearly has limitations. Previous approaches have successfully provided access to information… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

    Comments: accepted at AKBC 2020

  41. arXiv:2004.04906  [pdf, other

    cs.CL

    Dense Passage Retrieval for Open-Domain Question Answering

    Authors: Vladimir Karpukhin, Barlas Oğuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, Wen-tau Yih

    Abstract: Open-domain question answering relies on efficient passage retrieval to select candidate contexts, where traditional sparse vector space models, such as TF-IDF or BM25, are the de facto method. In this work, we show that retrieval can be practically implemented using dense representations alone, where embeddings are learned from a small number of questions and passages by a simple dual-encoder fra… ▽ More

    Submitted 30 September, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

    Comments: EMNLP 2020

  42. arXiv:2002.09758  [pdf, other

    cs.CL cs.AI cs.LG

    Unsupervised Question Decomposition for Question Answering

    Authors: Ethan Perez, Patrick Lewis, Wen-tau Yih, Kyunghyun Cho, Douwe Kiela

    Abstract: We aim to improve question answering (QA) by decomposing hard questions into simpler sub-questions that existing QA systems are capable of answering. Since labeling questions with decompositions is cumbersome, we take an unsupervised approach to produce sub-questions, also enabling us to leverage millions of questions from the internet. Specifically, we propose an algorithm for One-to-N Unsupervis… ▽ More

    Submitted 6 October, 2020; v1 submitted 22 February, 2020; originally announced February 2020.

    Comments: EMNLP 2020 Camera-Ready. Code available at https://github.com/facebookresearch/UnsupervisedDecomposition

  43. arXiv:1911.07721  [pdf, other

    cs.LG stat.ML

    Program synthesis performance constrained by non-linear spatial relations in Synthetic Visual Reasoning Test

    Authors: Lu Yihe, Scott C. Lowe, Penelope A. Lewis, Mark C. W. van Rossum

    Abstract: Despite remarkable advances in automated visual recognition by machines, some visual tasks remain challenging for machines. Fleuret et al. (2011) introduced the Synthetic Visual Reasoning Test (SVRT) to highlight this point, which required classification of images consisting of randomly generated shapes based on hidden abstract rules using only a few examples. Ellis et al. (2015) demonstrated that… ▽ More

    Submitted 19 November, 2019; v1 submitted 18 November, 2019; originally announced November 2019.

  44. arXiv:1910.07475  [pdf, other

    cs.CL cs.AI cs.LG

    MLQA: Evaluating Cross-lingual Extractive Question Answering

    Authors: Patrick Lewis, Barlas Oğuz, Ruty Rinott, Sebastian Riedel, Holger Schwenk

    Abstract: Question answering (QA) models have shown rapid progress enabled by the availability of large, high-quality benchmark datasets. Such annotated datasets are difficult and costly to collect, and rarely exist in languages other than English, making training QA systems in other languages challenging. An alternative to building large monolingual training datasets is to develop cross-lingual systems whi… ▽ More

    Submitted 3 May, 2020; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: To appear in ACL 2020

  45. arXiv:1909.01066  [pdf, other

    cs.CL

    Language Models as Knowledge Bases?

    Authors: Fabio Petroni, Tim Rocktäschel, Patrick Lewis, Anton Bakhtin, Yuxiang Wu, Alexander H. Miller, Sebastian Riedel

    Abstract: Recent progress in pretraining language models on large textual corpora led to a surge of improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these models may also be storing relational knowledge present in the training data, and may be able to answer queries structured as "fill-in-the-blank" cloze statements. Language models have many advantages over structured knowledge… ▽ More

    Submitted 4 September, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: accepted at EMNLP 2019

  46. arXiv:1908.01580  [pdf, other

    cs.LG stat.ML

    The HSIC Bottleneck: Deep Learning without Back-Propagation

    Authors: Wan-Duo Kurt Ma, J. P. Lewis, W. Bastiaan Kleijn

    Abstract: We introduce the HSIC (Hilbert-Schmidt independence criterion) bottleneck for training deep neural networks. The HSIC bottleneck is an alternative to the conventional cross-entropy loss and backpropagation that has a number of distinct advantages. It mitigates exploding and vanishing gradients, resulting in the ability to learn very deep networks without skip connections. There is no requirement f… ▽ More

    Submitted 5 December, 2019; v1 submitted 5 August, 2019; originally announced August 2019.

  47. arXiv:1906.04980  [pdf, other

    cs.CL cs.AI cs.LG

    Unsupervised Question Answering by Cloze Translation

    Authors: Patrick Lewis, Ludovic Denoyer, Sebastian Riedel

    Abstract: Obtaining training data for Question Answering (QA) is time-consuming and resource-intensive, and existing QA datasets are only available for limited domains and languages. In this work, we explore to what extent high quality training data is actually required for Extractive QA, and investigate the possibility of unsupervised Extractive QA. We approach this problem by first learning to generate co… ▽ More

    Submitted 27 June, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: To appear in ACL 2019

    Journal ref: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019

  48. arXiv:1905.08126  [pdf, ps, other

    cs.NE

    Can Bio-Inspired Swarm Algorithms Scale to Modern Societal Problems

    Authors: Darren M. Chitty, Elizabeth Wanner, Rakhi Parmar, Peter R. Lewis

    Abstract: Taking inspiration from nature for meta-heuristics has proven popular and relatively successful. Many are inspired by the collective intelligence exhibited by insects, fish and birds. However, there is a question over their scalability to the types of complex problems experienced in the modern world. Natural systems evolved to solve simpler problems effectively, replicating these processes for com… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: To be presented at the ALife 2019 conference

  49. arXiv:1904.07636  [pdf, ps, other

    cs.NE

    Applying Partial-ACO to Large-scale Vehicle Fleet Optimisation

    Authors: Darren M. Chitty, Elizabeth Wanner, Rakhi Parmar, Peter R. Lewis

    Abstract: Optimisation of fleets of commercial vehicles with regards scheduling tasks from various locations to vehicles can result in considerably lower fleet traversal times. This has significant benefits including reduced expenses for the company and more importantly, a reduction in the degree of road use and hence vehicular emissions. Exact optimisation methods fail to scale to real commercial problem i… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

  50. arXiv:1809.01494  [pdf, other

    cs.CL cs.LG stat.ML

    Interpretation of Natural Language Rules in Conversational Machine Reading

    Authors: Marzieh Saeidi, Max Bartolo, Patrick Lewis, Sameer Singh, Tim Rocktäschel, Mike Sheldon, Guillaume Bouchard, Sebastian Riedel

    Abstract: Most work in machine reading focuses on question answering problems where the answer is directly expressed in the text to read. However, many real-world question answering problems require the reading of text not because it contains the literal answer, but because it contains a recipe to derive an answer together with the reader's background knowledge. One example is the task of interpreting regul… ▽ More

    Submitted 28 August, 2018; originally announced September 2018.

    Comments: EMNLP 2018