Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Cooney, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07321  [pdf, other

    cs.LG cs.CL

    Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs

    Authors: Bilal Chughtai, Alan Cooney, Neel Nanda

    Abstract: How do transformer-based large language models (LLMs) store and retrieve knowledge? We focus on the most basic form of this task -- factual recall, where the model is tasked with explicitly surfacing stored facts in prompts of form `Fact: The Colosseum is in the country of'. We find that the mechanistic story behind factual recall is more complex than previously thought. It comprises several disti… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: NeurIPS 2023 Attributing Model Behaviour at Scale Workshop