Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Tigges, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10827  [pdf, other

    cs.LG cs.CL

    LLM Circuit Analyses Are Consistent Across Training and Scale

    Authors: Curt Tigges, Michael Hanna, Qinan Yu, Stella Biderman

    Abstract: Most currently deployed large language models (LLMs) undergo continuous training or additional finetuning. By contrast, most research into LLMs' internal mechanisms focuses on models at one snapshot in time (the end of pre-training), raising the question of whether their results generalize to real-world settings. Existing studies of mechanisms over time focus on encoder-only or toy models, which d… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2401.12947  [pdf, other

    cs.CL cs.AI cs.FL cs.LO cs.PL

    Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion

    Authors: Dylan Zhang, Curt Tigges, Zory Zhang, Stella Biderman, Maxim Raginsky, Talia Ringer

    Abstract: This paper investigates the ability of transformer-based models to learn structural recursion from examples. Recursion is a universal concept in both natural and formal languages. Structural recursion is central to the programming language and formal mathematics tasks where symbolic tools currently excel beyond neural models, such as inferring semantic relations between datatypes and emulating pro… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.14699

  3. arXiv:2310.15154  [pdf, other

    cs.LG cs.AI cs.CL

    Linear Representations of Sentiment in Large Language Models

    Authors: Curt Tigges, Oskar John Hollinsworth, Atticus Geiger, Neel Nanda

    Abstract: Sentiment is a pervasive feature in natural language text, yet it is an open question how sentiment is represented within Large Language Models (LLMs). In this study, we reveal that across a range of models, sentiment is represented linearly: a single direction in activation space mostly captures the feature across a range of tasks with one extreme for positive and the other for negative. Through… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  4. arXiv:2305.14699  [pdf, other

    cs.LG cs.AI cs.LO cs.PL

    Can Transformers Learn to Solve Problems Recursively?

    Authors: Shizhuo Dylan Zhang, Curt Tigges, Stella Biderman, Maxim Raginsky, Talia Ringer

    Abstract: Neural networks have in recent years shown promise for helping software engineers write programs and even formally verify them. While semantic information plays a crucial part in these processes, it remains unclear to what degree popular neural architectures like transformers are capable of modeling that information. This paper examines the behavior of neural networks learning algorithms relevant… ▽ More

    Submitted 25 June, 2023; v1 submitted 24 May, 2023; originally announced May 2023.