Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models

Stacey, Joe; Minervini, Pasquale; Dubossarsky, Haim; Rei, Marek

Computer Science > Computation and Language

arXiv:2205.11432 (cs)

[Submitted on 23 May 2022 (v1), last revised 21 Oct 2022 (this version, v3)]

Title:Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models

Authors:Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Marek Rei

View PDF

Abstract:Current Natural Language Inference (NLI) models achieve impressive results, sometimes outperforming humans when evaluating on in-distribution test sets. However, as these models are known to learn from annotation artefacts and dataset biases, it is unclear to what extent the models are learning the task of NLI instead of learning from shallow heuristics in their training data. We address this issue by introducing a logical reasoning framework for NLI, creating highly transparent model decisions that are based on logical rules. Unlike prior work, we show that improved interpretability can be achieved without decreasing the predictive accuracy. We almost fully retain performance on SNLI, while also identifying the exact hypothesis spans that are responsible for each model prediction. Using the e-SNLI human explanations, we verify that our model makes sensible decisions at a span level, despite not using any span labels during training. We can further improve model performance and span-level decisions by using the e-SNLI explanations during training. Finally, our model is more robust in a reduced data setting. When training with only 1,000 examples, out-of-distribution performance improves on the MNLI matched and mismatched validation sets by 13% and 16% relative to the baseline. Training with fewer observations yields further improvements, both in-distribution and out-of-distribution.

Comments:	Accepted at EMNLP 2022
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2205.11432 [cs.CL]
	(or arXiv:2205.11432v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.11432

Submission history

From: Joe Stacey [view email]
[v1] Mon, 23 May 2022 16:24:27 UTC (226 KB)
[v2] Thu, 20 Oct 2022 12:43:33 UTC (401 KB)
[v3] Fri, 21 Oct 2022 09:26:16 UTC (442 KB)

Computer Science > Computation and Language

Title:Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators