Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Pergola, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01585  [pdf, other

    cs.HC cs.AI cs.IR

    DrugWatch: A Comprehensive Multi-Source Data Visualisation Platform for Drug Safety Information

    Authors: Artem Bobrov, Domantas Saltenis, Zhaoyue Sun, Gabriele Pergola, Yulan He

    Abstract: Drug safety research is crucial for maintaining public health, often requiring comprehensive data support. However, the resources currently available to the public are limited and fail to provide a comprehensive understanding of the relationship between drugs and their side effects. This paper introduces DrugWatch, an easy-to-use and interactive multi-source information visualisation platform for… ▽ More

    Submitted 18 June, 2024; originally announced July 2024.

    Comments: 10 pages, 14 figures, accepted by ACL 2024 Demo Track

  2. arXiv:2406.18449  [pdf, other

    cs.CL cs.AI

    Cascading Large Language Models for Salient Event Graph Generation

    Authors: Xingwei Tan, Yuxiang Zhou, Gabriele Pergola, Yulan He

    Abstract: Generating event graphs from long documents is challenging due to the inherent complexity of multiple tasks involved such as detecting events, identifying their relationships, and reconciling unstructured input with structured graphs. Recent studies typically consider all events with equal importance, failing to distinguish salient events crucial for understanding narratives. This paper presents C… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 9 + 12 pages

  3. arXiv:2405.02040  [pdf

    cs.CL

    Large Multimodal Model based Standardisation of Pathology Reports with Confidence and their Prognostic Significance

    Authors: Ethar Alzaid, Gabriele Pergola, Harriet Evans, David Snead, Fayyaz Minhas

    Abstract: Pathology reports are rich in clinical and pathological details but are often presented in free-text format. The unstructured nature of these reports presents a significant challenge limiting the accessibility of their content. In this work, we present a practical approach based on the use of large multimodal models (LMMs) for automatically extracting information from scanned images of pathology r… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 19 pages, 6 figures

  4. arXiv:2404.01532  [pdf, other

    cs.CL cs.IR

    Set-Aligning Framework for Auto-Regressive Event Temporal Graph Generation

    Authors: Xingwei Tan, Yuxiang Zhou, Gabriele Pergola, Yulan He

    Abstract: Event temporal graphs have been shown as convenient and effective representations of complex temporal relations between events in text. Recent studies, which employ pre-trained language models to auto-regressively generate linearised graphs for constructing event temporal graphs, have shown promising results. However, these methods have often led to suboptimal graph generation as the linearised gr… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024. 9 + 10 pages

  5. arXiv:2402.15663  [pdf, other

    cs.CL

    Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study

    Authors: Zhaoyue Sun, Gabriele Pergola, Byron C. Wallace, Yulan He

    Abstract: With the advent of large language models (LLMs), there has been growing interest in exploring their potential for medical applications. This research aims to investigate the ability of LLMs, specifically ChatGPT, in the context of pharmacovigilance event extraction, of which the main goal is to identify and extract adverse events or potential therapeutic events from textual medical sources. We con… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 14 pages, 2 figures, accepted by EACL 2024

  6. arXiv:2308.08239  [pdf, other

    cs.CL

    MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation

    Authors: Junru Lu, Siyu An, Mingbao Lin, Gabriele Pergola, Yulan He, Di Yin, Xing Sun, Yunsheng Wu

    Abstract: We propose MemoChat, a pipeline for refining instructions that enables large language models (LLMs) to effectively employ self-composed memos for maintaining consistent long-range open-domain conversations. We demonstrate a long-range open-domain conversation through iterative "memorization-retrieval-response" cycles. This requires us to carefully design tailored tuning instructions for each disti… ▽ More

    Submitted 22 August, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

  7. arXiv:2305.04522  [pdf, other

    cs.CL

    Event Knowledge Incorporation with Posterior Regularization for Event-Centric Question Answering

    Authors: Junru Lu, Gabriele Pergola, Lin Gui, Yulan He

    Abstract: We propose a simple yet effective strategy to incorporate event knowledge extracted from event trigger annotations via posterior regularization to improve the event reasoning capability of mainstream question-answering (QA) models for event-centric QA. In particular, we define event-related knowledge constraints based on the event trigger annotations in the QA datasets, and subsequently use them t… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: work in process

  8. arXiv:2302.05574  [pdf, other

    cs.CL

    NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching Summarization

    Authors: Junru Lu, Jiazheng Li, Byron C. Wallace, Yulan He, Gabriele Pergola

    Abstract: Accessing medical literature is difficult for laypeople as the content is written for specialists and contains medical jargon. Automated text simplification methods offer a potential means to address this issue. In this work, we propose a summarize-then-simplify two-stage strategy, which we call NapSS, identifying the relevant content to simplify while ensuring that the original narrative flow is… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Findings of EACL 2023

  9. arXiv:2302.04985  [pdf, other

    cs.CL cs.LG

    Event Temporal Relation Extraction with Bayesian Translational Model

    Authors: Xingwei Tan, Gabriele Pergola, Yulan He

    Abstract: Existing models to extract temporal relations between events lack a principled method to incorporate external knowledge. In this study, we introduce Bayesian-Trans, a Bayesian learning-based method that models the temporal relation representations as latent variables and infers their values via Bayesian inference and translational functions. Compared to conventional neural approaches, instead of p… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: 9 pages + 2

  10. arXiv:2210.12902  [pdf, other

    cs.CL

    Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation

    Authors: Junru Lu, Xingwei Tan, Gabriele Pergola, Lin Gui, Yulan He

    Abstract: Human reading comprehension often requires reasoning of event semantic relations in narratives, represented by Event-centric Question-Answering (QA). To address event-centric QA, we propose a novel QA model with contrastive learning and invertible event transformation, call TranCLR. Our proposed model utilizes an invertible transformation matrix to project semantic vectors of events into a common… ▽ More

    Submitted 13 December, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  11. arXiv:2210.12560  [pdf, other

    cs.CL

    PHEE: A Dataset for Pharmacovigilance Event Extraction from Text

    Authors: Zhaoyue Sun, Jiazheng Li, Gabriele Pergola, Byron C. Wallace, Bino John, Nigel Greene, Joseph Kim, Yulan He

    Abstract: The primary goal of drug safety researchers and regulators is to promptly identify adverse drug reactions. Doing so may in turn prevent or reduce the harm to patients and ultimately improve public health. Evaluating and monitoring drug safety (i.e., pharmacovigilance) involves analyzing an ever growing collection of spontaneous reports from health professionals, physicians, and pharmacists, and in… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: 17 pages, 3 figures, EMNLP2022 accepted

  12. arXiv:2205.03296  [pdf, other

    cs.CL

    Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media

    Authors: Lixing Zhu, Zheng Fang, Gabriele Pergola, Rob Procter, Yulan He

    Abstract: Building models to detect vaccine attitudes on social media is challenging because of the composite, often intricate aspects involved, and the limited availability of annotated data. Existing approaches have relied heavily on supervised training that requires abundant annotations and pre-defined aspect categories. Instead, with the aim of leveraging the large amount of unannotated data now availab… ▽ More

    Submitted 20 June, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

  13. arXiv:2109.05527  [pdf, other

    cs.CL

    Extracting Event Temporal Relations via Hyperbolic Geometry

    Authors: Xingwei Tan, Gabriele Pergola, Yulan He

    Abstract: Detecting events and their evolution through time is a crucial task in natural language understanding. Recent neural approaches to event temporal relation extraction typically map events to embeddings in the Euclidean space and train a classifier to detect temporal relations between event pairs. However, embeddings in the Euclidean space cannot capture richer asymmetric relations such as event tem… ▽ More

    Submitted 8 June, 2024; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: Accepted by EMNLP 2021, 9 pages + 4 pages (References and Appendix)

  14. arXiv:2106.03518  [pdf, other

    cs.CL cs.AI

    Position Bias Mitigation: A Knowledge-Aware Graph Model for Emotion Cause Extraction

    Authors: Hanqi Yan, Lin Gui, Gabriele Pergola, Yulan He

    Abstract: The Emotion Cause Extraction (ECE)} task aims to identify clauses which contain emotion-evoking information for a particular emotion expressed in text. We observe that a widely-used ECE dataset exhibits a bias that the majority of annotated cause clauses are either directly before their associated emotion clauses or are the emotion clauses themselves. Existing models for ECE tend to explore such r… ▽ More

    Submitted 18 December, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: ACL2021 Main Conference, Oral paper

  15. arXiv:2106.01071  [pdf, other

    cs.CL

    Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection

    Authors: Lixing Zhu, Gabriele Pergola, Lin Gui, Deyu Zhou, Yulan He

    Abstract: Emotion detection in dialogues is challenging as it often requires the identification of thematic topics underlying a conversation, the relevant commonsense knowledge, and the intricate transition patterns between the affective states. In this paper, we propose a Topic-Driven Knowledge-Aware Transformer to handle the challenges above. We firstly design a topic-augmented language model (LM) with an… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

  16. arXiv:2102.08366  [pdf, other

    cs.CL cs.IR cs.LG

    Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies

    Authors: Gabriele Pergola, Elena Kochkina, Lin Gui, Maria Liakata, Yulan He

    Abstract: Biomedical question-answering (QA) has gained increased attention for its capability to provide users with high-quality information from a vast scientific literature. Although an increasing number of biomedical QA datasets has been recently made available, those resources are still rather limited and expensive to produce. Transfer learning via pre-trained language models (LMs) has been shown as a… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: EACL 2021 - Short Paper - European Chapter of the Association for Computational Linguistics

  17. arXiv:2101.11656  [pdf, other

    q-bio.QM cs.LG eess.IV

    G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for Biomarker Identification and Disease Classification

    Authors: Sayan Ghosal, Qiang Chen, Giulio Pergola, Aaron L. Goldman, William Ulrich, Karen F. Berman, Giuseppe Blasi, Leonardo Fazio, Antonio Rampino, Alessandro Bertolino, Daniel R. Weinberger, Venkata S. Mattay, Archana Venkataraman

    Abstract: We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers. Our model consists of an encoder, a decoder and a classifier. The encoder learns a non-linear subspace shared between the input data modalities. The classifier and the decoder act as regularizers to ensure that the low-dimensional encoding capt… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

  18. arXiv:2101.10150  [pdf, other

    cs.LG

    Adversarial Learning of Poisson Factorisation Model for Gauging Brand Sentiment in User Reviews

    Authors: Runcong Zhao, Lin Gui, Gabriele Pergola, Yulan He

    Abstract: In this paper, we propose the Brand-Topic Model (BTM) which aims to detect brand-associated polarity-bearing topics from product reviews. Different from existing models for sentiment-topic extraction which assume topics are grouped under discrete sentiment categories such as `positive', `negative' and `neural', BTM is able to automatically infer real-valued brand-associated sentiment scores and ge… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  19. arXiv:2011.00519  [pdf, other

    cs.CL

    CHIME: Cross-passage Hierarchical Memory Network for Generative Review Question Answering

    Authors: Junru Lu, Gabriele Pergola, Lin Gui, Binyang Li, Yulan He

    Abstract: We introduce CHIME, a cross-passage hierarchical memory network for question answering (QA) via text generation. It extends XLNet introducing an auxiliary memory module consisting of two components: the context memory collecting cross-passage evidences, and the answer memory working as a buffer continually refining the generated answers. Empirically, we show the efficacy of the proposed architectu… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

    Comments: COLING 2020

  20. A Disentangled Adversarial Neural Topic Model for Separating Opinions from Plots in User Reviews

    Authors: Gabriele Pergola, Lin Gui, Yulan He

    Abstract: The flexibility of the inference process in Variational Autoencoders (VAEs) has recently led to revising traditional probabilistic topic models giving rise to Neural Topic Models (NTMs). Although these approaches have achieved significant results, surprisingly very little work has been done on how to disentangle the latent topics. Existing topic models when applied to reviews may extract topics as… ▽ More

    Submitted 19 June, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2021

  21. arXiv:1911.10180  [pdf, other

    cs.CL

    Topical Phrase Extraction from Clinical Reports by Incorporating both Local and Global Context

    Authors: Gabriele Pergola, Yulan He, David Lowe

    Abstract: Making sense of words often requires to simultaneously examine the surrounding context of a term as well as the global themes characterizing the overall corpus. Several topic models have already exploited word embeddings to recognize local context, however, it has been weakly combined with the global context during the topic inference. This paper proposes to extract topical phrases corroborating t… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: The 2nd AAAI Workshop on Health Intelligence, AAAI18

  22. TDAM: a Topic-Dependent Attention Model for Sentiment Analysis

    Authors: Gabriele Pergola, Lin Gui, Yulan He

    Abstract: We propose a topic-dependent attention model for sentiment classification and topic extraction. Our model assumes that a global topic embedding is shared across documents and employs an attention mechanism to derive local topic embedding for words and sentences. These are subsequently incorporated in a modified Gated Recurrent Unit (GRU) for sentiment classification and extraction of topics bearin… ▽ More

    Submitted 18 August, 2019; originally announced August 2019.

    Journal ref: Information Processing & Management, 56 (6), 102084, July 2019