Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Stacey, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.00462  [pdf, other

    cs.CL

    LUCID: LLM-Generated Utterances for Complex and Interesting Dialogues

    Authors: Joe Stacey, Jianpeng Cheng, John Torr, Tristan Guigue, Joris Driesen, Alexandru Coca, Mark Gaynor, Anders Johannsen

    Abstract: Spurred by recent advances in Large Language Models (LLMs), virtual assistants are poised to take a leap forward in terms of their dialogue capabilities. Yet a major bottleneck to achieving genuinely transformative task-oriented dialogue capabilities remains the scarcity of high quality data. Existing datasets, while impressive in scale, have limited domain coverage and contain few genuinely chall… ▽ More

    Submitted 3 May, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: Accepted at NAACL SRW 2024

    ACM Class: I.2.7

  2. arXiv:2305.13214  [pdf, other

    cs.CL

    Logical Reasoning for Natural Language Inference Using Generated Facts as Atoms

    Authors: Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei

    Abstract: State-of-the-art neural models can now reach human performance levels across various natural language understanding tasks. However, despite this impressive performance, models are known to learn from annotation artefacts at the expense of the underlying task. While interpretability methods can identify influential features for each prediction, there are no guarantees that these features are respon… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    ACM Class: I.2.7

  3. arXiv:2305.13067  [pdf, other

    cs.CL cs.LG

    Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation

    Authors: Joe Stacey, Marek Rei

    Abstract: Knowledge distillation optimises a smaller student model to behave similarly to a larger teacher model, retaining some of the performance benefits. While this method can improve results on in-distribution examples, it does not necessarily generalise to out-of-distribution (OOD) settings. We investigate two complementary methods for improving the robustness of the resulting student models on OOD do… ▽ More

    Submitted 24 July, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL Findings 2024

    ACM Class: I.2.7

  4. arXiv:2205.11432  [pdf, other

    cs.CL cs.LG

    Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models

    Authors: Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Marek Rei

    Abstract: Current Natural Language Inference (NLI) models achieve impressive results, sometimes outperforming humans when evaluating on in-distribution test sets. However, as these models are known to learn from annotation artefacts and dataset biases, it is unclear to what extent the models are learning the task of NLI instead of learning from shallow heuristics in their training data. We address this issu… ▽ More

    Submitted 21 October, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted at EMNLP 2022

  5. arXiv:2104.08142  [pdf, other

    cs.CL cs.LG

    Supervising Model Attention with Human Explanations for Robust Natural Language Inference

    Authors: Joe Stacey, Yonatan Belinkov, Marek Rei

    Abstract: Natural Language Inference (NLI) models are known to learn from biases and artefacts within their training data, impacting how well they generalise to other unseen datasets. Existing de-biasing approaches focus on preventing the models from learning these biases, which can result in restrictive models and lower performance. We instead investigate teaching the model how a human would approach the N… ▽ More

    Submitted 1 May, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at AAAI 2022

  6. arXiv:2004.07790  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Avoiding the Hypothesis-Only Bias in Natural Language Inference via Ensemble Adversarial Training

    Authors: Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Sebastian Riedel, Tim Rocktäschel

    Abstract: Natural Language Inference (NLI) datasets contain annotation artefacts resulting in spurious correlations between the natural language utterances and their respective entailment classes. These artefacts are exploited by neural networks even when only considering the hypothesis and ignoring the premise, leading to unwanted biases. Belinkov et al. (2019b) proposed tackling this problem via adversari… ▽ More

    Submitted 27 May, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted at EMNLP 2020