Skip to main content

Showing 1–27 of 27 results for author: Webber, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04461  [pdf, other

    cs.CL

    Multi-Label Classification for Implicit Discourse Relation Recognition

    Authors: Wanqiu Long, N. Siddharth, Bonnie Webber

    Abstract: Discourse relations play a pivotal role in establishing coherence within textual content, uniting sentences and clauses into a cohesive narrative. The Penn Discourse Treebank (PDTB) stands as one of the most extensively utilized datasets in this domain. In PDTB-3, the annotators can assign multiple labels to an example, when they believe that multiple relations are present. Prior research in disco… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ACL2024 Finding

  2. arXiv:2405.20967  [pdf, other

    cs.CL

    Superlatives in Context: Explicit and Implicit Domain Restrictions for Superlative Frames

    Authors: Valentina Pyatkin, Bonnie Webber, Ido Dagan, Reut Tsarfaty

    Abstract: Superlatives are used to single out elements with a maximal/minimal property. Semantically, superlatives perform a set comparison: something (or some things) has the min/max property out of a set. As such, superlatives provide an ideal phenomenon for studying implicit phenomena and discourse restrictions. While this comparison set is often not explicitly defined, its (implicit) restrictions can be… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 11 pages

  3. arXiv:2311.03127  [pdf, other

    cs.CL cs.AI

    Findings of the WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in the Cosmos of LLMs

    Authors: Longyue Wang, Zhaopeng Tu, Yan Gu, Siyou Liu, Dian Yu, Qingsong Ma, Chenyang Lyu, Liting Zhou, Chao-Hong Liu, Yufeng Ma, Weiyu Chen, Yvette Graham, Bonnie Webber, Philipp Koehn, Andy Way, Yulin Yuan, Shuming Shi

    Abstract: Translating literary works has perennially stood as an elusive dream in machine translation (MT), a journey steeped in intricate challenges. To foster progress in this domain, we hold a new shared task at WMT 2023, the first edition of the Discourse-Level Literary Translation. First, we (Tencent AI Lab and China Literature Ltd.) release a copyrighted and document-level Chinese-English web novel co… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: WMT2023 Discourse-Level Literary Translation Shared Task Overview Paper

  4. arXiv:2310.15513  [pdf, other

    cs.CL

    A Joint Matrix Factorization Analysis of Multilingual Representations

    Authors: Zheng Zhao, Yftah Ziser, Bonnie Webber, Shay B. Cohen

    Abstract: We present an analysis tool based on joint matrix factorization for comparing latent representations of multilingual and monolingual models. An alternative to probing, this tool allows us to analyze multiple sets of representations in a joint manner. Using this tool, we study to what extent and how morphosyntactic features are reflected in the representations learned by multilingual pre-trained mo… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted to Findings of EMNLP 2023

  5. arXiv:2301.02724  [pdf, other

    cs.CL cs.AI

    Facilitating Contrastive Learning of Discourse Relational Senses by Exploiting the Hierarchy of Sense Relations

    Authors: Wanqiu Long, Bonnie Webber

    Abstract: Implicit discourse relation recognition is a challenging task that involves identifying the sense or senses that hold between two adjacent spans of text, in the absence of an explicit connective between them. In both PDTB-2 and PDTB-3, discourse relational senses are organized into a three-level hierarchy ranging from four broad top-level senses, to more specific senses below them. Most previous w… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: EMNLP2022

  6. arXiv:2206.02280  [pdf, other

    cs.CL

    Annotation Error Detection: Analyzing the Past and Present for a More Coherent Future

    Authors: Jan-Christoph Klie, Bonnie Webber, Iryna Gurevych

    Abstract: Annotated data is an essential ingredient in natural language processing for training and evaluating machine learning models. It is therefore very desirable for the annotations to be of high quality. Recent work, however, has shown that several popular datasets contain a surprising amount of annotation errors or inconsistencies. To alleviate this issue, many methods for annotation error detection… ▽ More

    Submitted 25 September, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: To appear in Computational Linguistics (CL) journal

  7. arXiv:2204.00350  [pdf, other

    cs.CL

    Revisiting Shallow Discourse Parsing in the PDTB-3: Handling Intra-sentential Implicits

    Authors: Zheng Zhao, Bonnie Webber

    Abstract: In the PDTB-3, several thousand implicit discourse relations were newly annotated \textit{within} individual sentences, adding to the over 15,000 implicit relations annotated \textit{across} adjacent sentences in the PDTB-2. Given that the position of the arguments to these \textit{intra-sentential implicits} is no longer as well-defined as with \textit{inter-sentential implicits}, a discourse par… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: Accepted to CODI 2021

  8. arXiv:2109.05140  [pdf, other

    cs.CL cs.CY cs.HC

    Refocusing on Relevance: Personalization in NLG

    Authors: Shiran Dudy, Steven Bedrick, Bonnie Webber

    Abstract: Many NLG tasks such as summarization, dialogue response, or open domain question answering focus primarily on a source text in order to generate a target response. This standard approach falls short, however, when a user's intent or context of work is not easily recoverable based solely on that source text -- a scenario that we argue is more of the rule than the exception. In this work, we argue t… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: was accepted to EMNLP 2021 main conference

  9. arXiv:2010.08980  [pdf, other

    cs.CL

    Querent Intent in Multi-Sentence Questions

    Authors: Laurie Burchell, Jie Chi, Tom Hosking, Nina Markl, Bonnie Webber

    Abstract: Multi-sentence questions (MSQs) are sequences of questions connected by relations which, unlike sequences of standalone questions, need to be answered as a unit. Following Rhetorical Structure Theory (RST), we recognise that different "question discourse relations" between the subparts of MSQs reflect different speaker intents, and consequently elicit different answering strategies. Correctly iden… ▽ More

    Submitted 18 October, 2020; originally announced October 2020.

    Comments: LAW XIV, COLING 2020

  10. arXiv:2010.06294  [pdf, other

    cs.CL

    Extending Implicit Discourse Relation Recognition to the PDTB-3

    Authors: Li Liang, Zheng Zhao, Bonnie Webber

    Abstract: The PDTB-3 contains many more Implicit discourse relations than the previous PDTB-2. This is in part because implicit relations have now been annotated within sentences as well as between them. In addition, some now co-occur with explicit discourse relations, instead of standing on their own. Here we show that while this can complicate the problem of identifying the location of implicit discourse… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

  11. arXiv:2009.13312  [pdf, other

    cs.CL

    Reducing Quantity Hallucinations in Abstractive Summarization

    Authors: Zheng Zhao, Shay B. Cohen, Bonnie Webber

    Abstract: It is well-known that abstractive summaries are subject to hallucination---including material that is not supported by the original text. While summaries can be made hallucination-free by limiting them to general phrases, such summaries would fail to be very informative. Alternatively, one can try to avoid hallucinations by verifying that any specific entities in the summary appear in the original… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    Comments: Accepted to Findings of EMNLP 2020

  12. arXiv:2003.04032  [pdf, other

    cs.CL

    Shallow Discourse Annotation for Chinese TED Talks

    Authors: Wanqiu Long, Xinyi Cai, James E. M. Reid, Bonnie Webber, Deyi Xiong

    Abstract: Text corpora annotated with language-related properties are an important resource for the development of Language Technology. The current work contributes a new resource for Chinese Language Technology and for Chinese-English translation, in the form of a set of TED talks (some originally given in English, some in Chinese) that have been annotated with discourse relations in the style of the Penn… ▽ More

    Submitted 6 April, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

  13. arXiv:1911.12091  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction

    Authors: Liane Guillou, Christian Hardmeier, Preslav Nakov, Sara Stymne, Jörg Tiedemann, Yannick Versley, Mauro Cettolo, Bonnie Webber, Andrei Popescu-Belis

    Abstract: We describe the design, the evaluation setup, and the results of the 2016 WMT shared task on cross-lingual pronoun prediction. This is a classification task in which participants are asked to provide predictions on what pronoun class label should replace a placeholder value in the target-language text, provided in lemmatised and PoS-tagged form. We provided four subtasks, for the English-French an… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

    Comments: cross-lingual pronoun prediction, WMT, shared task, English, German, French

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: WMT-2016

  14. arXiv:1909.12086  [pdf, ps, other

    cs.CL cs.AI cs.LG

    GECOR: An End-to-End Generative Ellipsis and Co-reference Resolution Model for Task-Oriented Dialogue

    Authors: Jun Quan, Deyi Xiong, Bonnie Webber, Changjian Hu

    Abstract: Ellipsis and co-reference are common and ubiquitous especially in multi-turn dialogues. In this paper, we treat the resolution of ellipsis and co-reference in dialogue as a problem of generating omitted or referred expressions from the dialogue context. We therefore propose a unified end-to-end Generative Ellipsis and CO-reference Resolution model (GECOR) in the context of dialogue. The model can… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

    Comments: accepted to appear at EMNLP 2019

  15. arXiv:1908.10461  [pdf, ps, other

    cs.CL

    A survey of cross-lingual features for zero-shot cross-lingual semantic parsing

    Authors: Jingfeng Yang, Federico Fancellu, Bonnie Webber

    Abstract: The availability of corpora to train semantic parsers in English has lead to significant advances in the field. Unfortunately, for languages other than English, annotation is scarce and so are developed parsers. We then ask: could a parser trained in English be applied to language that it hasn't been trained on? To answer this question we explore zero-shot cross-lingual semantic parsing where we t… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

  16. arXiv:1810.02156  [pdf, other

    cs.CL

    Neural Networks for Cross-lingual Negation Scope Detection

    Authors: Federico Fancellu, Adam Lopez, Bonnie Webber

    Abstract: Negation scope has been annotated in several English and Chinese corpora, and highly accurate models for this task in these languages have been learned from these annotations. Unfortunately, annotations are not available in other languages. Could a model that detects negation scope be applied to a language that it hasn't been trained on? We develop neural models that learn from cross-lingual word… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

    Comments: 8 pages

  17. arXiv:1809.06641  [pdf, other

    cs.CL cs.AI

    Talking to myself: self-dialogues as data for conversational agents

    Authors: Joachim Fainberg, Ben Krause, Mihai Dobre, Marco Damonte, Emmanuel Kahembwe, Daniel Duma, Bonnie Webber, Federico Fancellu

    Abstract: Conversational agents are gaining popularity with the increasing ubiquity of smart devices. However, training agents in a data driven manner is challenging due to a lack of suitable corpora. This paper presents a novel method for gathering topical, unstructured conversational data in an efficient way: self-dialogues through crowd-sourcing. Alongside this paper, we include a corpus of 3.6 million w… ▽ More

    Submitted 19 September, 2018; v1 submitted 18 September, 2018; originally announced September 2018.

    Comments: 5 pages, 5 pages appendix, 2 figures

  18. arXiv:1711.07646  [pdf, ps, other

    cs.CL

    Evaluating Machine Translation Performance on Chinese Idioms with a Blacklist Method

    Authors: Yutong Shao, Rico Sennrich, Bonnie Webber, Federico Fancellu

    Abstract: Idiom translation is a challenging problem in machine translation because the meaning of idioms is non-compositional, and a literal (word-by-word) translation is likely to be wrong. In this paper, we focus on evaluating the quality of idiom translation of MT systems. We introduce a new evaluation method based on an idiom-specific blacklist of literal translations, based on the insight that the occ… ▽ More

    Submitted 20 February, 2018; v1 submitted 21 November, 2017; originally announced November 2017.

    Comments: Full paper accepted by LREC, 8 pages

  19. arXiv:1709.09816  [pdf, other

    cs.CL cs.AI

    Edina: Building an Open Domain Socialbot with Self-dialogues

    Authors: Ben Krause, Marco Damonte, Mihai Dobre, Daniel Duma, Joachim Fainberg, Federico Fancellu, Emmanuel Kahembwe, Jianpeng Cheng, Bonnie Webber

    Abstract: We present Edina, the University of Edinburgh's social bot for the Amazon Alexa Prize competition. Edina is a conversational agent whose responses utilize data harvested from Amazon Mechanical Turk (AMT) through an innovative new technique we call self-dialogues. These are conversations in which a single AMT Worker plays both participants in a dialogue. Such dialogues are surprisingly natural, eff… ▽ More

    Submitted 28 September, 2017; originally announced September 2017.

    Comments: 10 pages; submitted to the 1st Proceedings of the Alexa Prize

  20. arXiv:1702.03305  [pdf, ps, other

    cs.CL

    Universal Dependencies to Logical Forms with Negation Scope

    Authors: Federico Fancellu, Siva Reddy, Adam Lopez, Bonnie Webber

    Abstract: Many language technology applications would benefit from the ability to represent negation and its scope on top of widely-used linguistic resources. In this paper, we investigate the possibility of obtaining a first-order logic representation with negation scope marked using Universal Dependencies. To do so, we enhance UDepLambda, a framework that converts dependency graphs to logical forms. The r… ▽ More

    Submitted 10 February, 2017; originally announced February 2017.

    Comments: This a draft version of the paper. We welcome any comments you may have regarding the content and presentation

    MSC Class: 03B65

  21. arXiv:cs/0109010  [pdf, ps, other

    cs.CL

    Anaphora and Discourse Structure

    Authors: Bonnie Webber, Matthew Stone, Aravind Joshi, Alistair Knott

    Abstract: We argue in this paper that many common adverbial phrases generally taken to signal a discourse relation between syntactically connected units within discourse structure, instead work anaphorically to contribute relational meaning, with only indirect dependence on discourse structure. This allows a simpler discourse structure to provide scaffolding for compositional semantics, and reveals multip… ▽ More

    Submitted 13 September, 2002; v1 submitted 9 September, 2001; originally announced September 2001.

    Comments: 45 pages, 17 figures. Revised resubmission to Computational Linguistics

    ACM Class: I.2.7

  22. arXiv:cs/0104022  [pdf, ps, other

    cs.CL

    Microplanning with Communicative Intentions: The SPUD System

    Authors: Matthew Stone, Christine Doran, Bonnie Webber, Tonia Bleam, Martha Palmer

    Abstract: The process of microplanning encompasses a range of problems in Natural Language Generation (NLG), such as referring expression generation, lexical choice, and aggregation, problems in which a generator must bridge underlying domain-specific representations and general linguistic representations. In this paper, we describe a uniform approach to microplanning based on declarative representations… ▽ More

    Submitted 30 April, 2001; originally announced April 2001.

    ACM Class: I.2.7

  23. Textual Economy through Close Coupling of Syntax and Semantics

    Authors: Matthew Stone, Bonnie Webber

    Abstract: We focus on the production of efficient descriptions of objects, actions and events. We define a type of efficiency, textual economy, that exploits the hearer's recognition of inferential links to material elsewhere within a sentence. Textual economy leads to efficient descriptions because the material that supports such inferences has been included to satisfy independent communicative goals, an… ▽ More

    Submitted 29 June, 1998; originally announced June 1998.

    Comments: 10 pages, uses QobiTree.tex

    Journal ref: Proceedings 1998 Int'l Workshop on Natural Language Generation, Niagara-on-the-Lake, Canada, August 1998

  24. Anchoring a Lexicalized Tree-Adjoining Grammar for Discourse

    Authors: Bonnie Lynn Webber, Aravind K. Joshi

    Abstract: We here explore a ``fully'' lexicalized Tree-Adjoining Grammar for discourse that takes the basic elements of a (monologic) discourse to be not simply clauses, but larger structures that are anchored on variously realized discourse cues. This link with intra-sentential grammar suggests an account for different patterns of discourse cues, while the different structures and operations suggest thre… ▽ More

    Submitted 24 June, 1998; originally announced June 1998.

    Comments: 7 pages, uses aclcol.sty

    Journal ref: Proceedings of COLING-ACL'98 Workshop on Discourse Relations and Discourse Markers. (Reproduced with permission of the Universite de Montreal

  25. Structure and Ostension in the Interpretation of Discourse Deixis

    Authors: Bonnie L. Webber

    Abstract: This paper examines demonstrative pronouns used as deictics to refer to the interpretation of one or more clauses. Although this usage is frowned upon in style manuals (for example Strunk and White (1959) state that ``This. The pronoun 'this', referring to the complete sense of a preceding sentence or clause, cannot always carry the load and so may produce an imprecise statement.''), it is never… ▽ More

    Submitted 7 August, 1997; originally announced August 1997.

    Comments: 22 pages, uses psfig

    Journal ref: Language and Cognitive Processes 6(2), May 1991, pp. 107-135

  26. Natural Language Generation in Healthcare: Brief Review

    Authors: Alison J. Cawsey, Bonnie L. Webber, Ray B. Jones

    Abstract: Good communication is vital in healthcare, both among healthcare professionals, and between healthcare professionals and their patients. And well-written documents, describing and/or explaining the information in structured databases may be easier to comprehend, more edifying and even more convincing, than the structured data, even when presented in tabular or graphic form. Documents may be auto… ▽ More

    Submitted 7 August, 1997; originally announced August 1997.

    Comments: 15 pages, to appear in the Journal of the American Medical Informatics Association

  27. Expectations in Incremental Discourse Processing

    Authors: Dan Cristea, Bonnie Lynn Webber

    Abstract: The way in which discourse features express connections back to the previous discourse has been described in the literature in terms of adjoining at the right frontier of discourse structure. But this does not allow for discourse features that express expectations about what is to come in the subsequent discourse. After characterizing these expectations and their distribution in text, we show ho… ▽ More

    Submitted 5 August, 1997; originally announced August 1997.

    Comments: 9 pages, uses aclap.sty, psfig.tex

    Journal ref: Proceedings 35th Annual ACL, Madrid - June 1997