Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Vazquez, J J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12220  [pdf, other

    cs.LG cs.CL cs.CY

    Questionable practices in machine learning

    Authors: Gavin Leech, Juan J. Vazquez, Misha Yagudin, Niclas Kupper, Laurence Aitchison

    Abstract: Evaluating modern ML models is hard. The strong incentive for researchers and companies to report a state-of-the-art result on some metric often leads to questionable research practices (QRPs): bad practices which fall short of outright research fraud. We describe 43 such practices which can undermine reported results, giving examples where possible. Our list emphasises the evaluation of large lan… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2308.10248  [pdf, other

    cs.CL cs.LG

    Activation Addition: Steering Language Models Without Optimization

    Authors: Alexander Matt Turner, Lisa Thiergart, Gavin Leech, David Udell, Juan J. Vazquez, Ulisse Mini, Monte MacDiarmid

    Abstract: Reliably controlling the behavior of large language models is a pressing open problem. Existing methods include supervised finetuning, reinforcement learning from human feedback, prompt engineering and guided decoding. We instead investigate activation engineering: modifying activations at inference-time to predictably alter model behavior. We bias the forward pass with a 'steering vector' implici… ▽ More

    Submitted 4 June, 2024; v1 submitted 20 August, 2023; originally announced August 2023.