Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Kao, J C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01641  [pdf, other

    cs.MA cs.AI

    Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents

    Authors: John L. Zhou, Weizhe Hong, Jonathan C. Kao

    Abstract: Emergent cooperation among self-interested individuals is a widespread phenomenon in the natural world, but remains elusive in interactions between artificially intelligent agents. Instead, naïve reinforcement learning algorithms typically converge to Pareto-dominated outcomes in even the simplest of social dilemmas. An emerging class of opponent-shaping methods have demonstrated the ability to re… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages, 4 figures

  2. arXiv:2406.01538  [pdf, other

    cs.CL cs.AI

    What Are Large Language Models Mapping to in the Brain? A Case Against Over-Reliance on Brain Scores

    Authors: Ebrahim Feghhi, Nima Hadidi, Bryan Song, Idan A. Blank, Jonathan C. Kao

    Abstract: Given the remarkable capabilities of large language models (LLMs), there has been a growing interest in evaluating their similarity to the human brain. One approach towards quantifying this similarity is by measuring how well a model predicts neural signals, also called "brain score". Internal representations from LLMs achieve state-of-the-art brain scores, leading to speculation that they share c… ▽ More

    Submitted 20 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures in the main paper

  3. arXiv:2010.02459  [pdf, other

    cs.LG cs.IT stat.ML

    Usable Information and Evolution of Optimal Representations During Training

    Authors: Michael Kleinman, Alessandro Achille, Daksh Idnani, Jonathan C. Kao

    Abstract: We introduce a notion of usable information contained in the representation learned by a deep network, and use it to study how optimal representations for the task emerge during training. We show that the implicit regularization coming from training with Stochastic Gradient Descent with a high learning-rate and small batch size plays an important role in learning minimal sufficient representations… ▽ More

    Submitted 28 February, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: ICLR 2021