Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Crosbie, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.07011  [pdf, other

    cs.CL

    Induction Heads as an Essential Mechanism for Pattern Matching in In-context Learning

    Authors: J. Crosbie, E. Shutova

    Abstract: Large language models (LLMs) have shown a remarkable ability to learn and perform complex tasks through in-context learning (ICL). However, a comprehensive understanding of its internal mechanisms is still lacking. This paper explores the role of induction heads in a few-shot ICL setting. We analyse two state-of-the-art models, Llama-3-8B and InternLM2-20B on abstract pattern recognition and NLP t… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 9 pages, 7 figures