Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Sherburn, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07436  [pdf, other

    cs.LG cs.AI

    Can Language Models Explain Their Own Classification Behavior?

    Authors: Dane Sherburn, Bilal Chughtai, Owain Evans

    Abstract: Large language models (LLMs) perform well at a myriad of tasks, but explaining the processes behind this performance is a challenge. This paper investigates whether LLMs can give faithful high-level explanations of their own internal processes. To explore this, we introduce a dataset, ArticulateRules, of few-shot text-based classification tasks generated by simple rules. Each rule is associated wi… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  2. arXiv:1904.05811  [pdf, other

    cs.LG cs.AI stat.ML

    Relational Graph Attention Networks

    Authors: Dan Busbridge, Dane Sherburn, Pietro Cavallo, Nils Y. Hammerla

    Abstract: We investigate Relational Graph Attention Networks, a class of models that extends non-relational graph attention mechanisms to incorporate relational information, opening up these methods to a wider variety of problems. A thorough evaluation of these models is performed, and comparisons are made against established benchmarks. To provide a meaningful comparison, we retrain Relational Graph Convol… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Comments: 10 pages + 8 pages of appendices. Layer implementation available at https://github.com/Babylonpartners/rgat/