Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Chadwick, M J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.15006  [pdf, other

    cs.LG cs.CL

    Fine-tuning language models to find agreement among humans with diverse preferences

    Authors: Michiel A. Bakker, Martin J. Chadwick, Hannah R. Sheahan, Michael Henry Tessler, Lucy Campbell-Gillingham, Jan Balaguer, Nat McAleese, Amelia Glaese, John Aslanides, Matthew M. Botvinick, Christopher Summerfield

    Abstract: Recent work in large language modeling (LLMs) has used fine-tuning to align outputs with the preferences of a prototypical user. This work assumes that human preferences are static and homogeneous across individuals, so that aligning to a a single "generic" user will confer more general alignment. Here, we embrace the heterogeneity of human preferences to consider a different challenge: how might… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

  2. arXiv:2103.04982  [pdf, other

    cs.MA cs.AI cs.GT

    A multi-agent reinforcement learning model of reputation and cooperation in human groups

    Authors: Kevin R. McKee, Edward Hughes, Tina O. Zhu, Martin J. Chadwick, Raphael Koster, Antonio Garcia Castaneda, Charlie Beattie, Thore Graepel, Matt Botvinick, Joel Z. Leibo

    Abstract: Collective action demands that individuals efficiently coordinate how much, where, and when to cooperate. Laboratory experiments have extensively explored the first part of this process, demonstrating that a variety of social-cognitive mechanisms influence how much individuals choose to invest in group efforts. However, experimental research has been unable to shed light on how social cognitive me… ▽ More

    Submitted 22 February, 2023; v1 submitted 8 March, 2021; originally announced March 2021.

  3. arXiv:2001.10913  [pdf, other

    cs.LG cs.AI

    MEMO: A Deep Network for Flexible Combination of Episodic Memories

    Authors: Andrea Banino, Adrià Puigdomènech Badia, Raphael Köster, Martin J. Chadwick, Vinicius Zambaldi, Demis Hassabis, Caswell Barry, Matthew Botvinick, Dharshan Kumaran, Charles Blundell

    Abstract: Recent research developing neural network architectures with external memory have often used the benchmark bAbI question and answering dataset which provides a challenging number of tasks requiring reasoning. Here we employed a classic associative inference task from the memory-based reasoning neuroscience literature in order to more carefully probe the reasoning capacity of existing memory-augmen… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

    Comments: 9 pages, 2 figures, 3 tables, to be published as a conference paper at ICLR 2020

    ACM Class: I.2.6