Skip to main content

Showing 1–20 of 20 results for author: Schaeffer, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14549  [pdf, other

    cs.CV cs.LG q-bio.NC

    Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models

    Authors: Sunny Duan, Mikail Khona, Abhiram Iyer, Rylan Schaeffer, Ila R Fiete

    Abstract: The proliferation of large language models has revolutionized natural language processing tasks, yet it raises profound concerns regarding data privacy and security. Language models are trained on extensive corpora including potentially sensitive or proprietary information, and the risk of data leakage -- where the model response reveals pieces of such information -- remains inadequately understoo… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.12785  [pdf, other

    cs.LG

    In-Context Learning of Energy Functions

    Authors: Rylan Schaeffer, Mikail Khona, Sanmi Koyejo

    Abstract: In-context learning is a powerful capability of certain machine learning models that arguably underpins the success of today's frontier AI models. However, in-context learning is critically limited to settings where the in-context distribution of interest $p_θ^{ICL}( x|\mathcal{D})$ can be straightforwardly expressed and/or parameterized by the model; for instance, language modeling relies on expr… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Proceedings of the 1st Workshop on In-Context Learning at the 41st International Conference on Machine Learning, Vienna, Austria. 2024. arXiv admin note: text overlap with arXiv:2402.10202

  3. arXiv:2406.10229  [pdf, other

    cs.LG cs.AI

    Quantifying Variance in Evaluation Benchmarks

    Authors: Lovish Madaan, Aaditya K. Singh, Rylan Schaeffer, Andrew Poulton, Sanmi Koyejo, Pontus Stenetorp, Sharan Narang, Dieuwke Hupkes

    Abstract: Evaluation benchmarks are the cornerstone of measuring capabilities of large language models (LLMs), as well as driving progress in said capabilities. Originally designed to make claims about capabilities (or lack thereof) in fully pretrained models, evaluation benchmarks are now also extensively used to decide between various training choices. Despite this widespread usage, we rarely quantify the… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  4. arXiv:2406.09366  [pdf, other

    cs.LG cs.CV q-bio.NC

    Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations

    Authors: Rylan Schaeffer, Victor Lecomte, Dhruv Bhandarkar Pai, Andres Carranza, Berivan Isik, Alyssa Unell, Mikail Khona, Thomas Yerxa, Yann LeCun, SueYeon Chung, Andrey Gromov, Ravid Shwartz-Ziv, Sanmi Koyejo

    Abstract: Maximum Manifold Capacity Representations (MMCR) is a recent multi-view self-supervised learning (MVSSL) method that matches or surpasses other leading MVSSL methods. MMCR is intriguing because it does not fit neatly into any of the commonplace MVSSL lineages, instead originating from a statistical mechanical perspective on the linear separability of data manifolds. In this paper, we seek to impro… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.04391  [pdf, other

    cs.LG cs.AI cs.CL

    Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

    Authors: Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda, Gabriel Mukobi, Varun Madan, Adam Ibrahim, Herbie Bradley, Stella Biderman, Sanmi Koyejo

    Abstract: Predictable behavior from scaling advanced AI systems is an extremely desirable property. Although a well-established literature exists on how pretraining performance scales, the literature on how particular downstream capabilities scale is significantly muddier. In this work, we take a step back and ask: why has predicting specific downstream capabilities with scale remained elusive? While many f… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  6. arXiv:2404.01413  [pdf, other

    cs.LG cs.AI cs.CL cs.ET stat.ML

    Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

    Authors: Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Henry Sleight, John Hughes, Tomasz Korbak, Rajashree Agrawal, Dhruv Pai, Andrey Gromov, Daniel A. Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo

    Abstract: The proliferation of generative models, combined with pretraining on web-scale data, raises a timely question: what happens when these models are trained on their own generated outputs? Recent investigations into model-data feedback loops proposed that such loops would lead to a phenomenon termed model collapse, under which performance progressively degrades with each model-data feedback iteration… ▽ More

    Submitted 29 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  7. arXiv:2402.10202  [pdf, other

    cs.LG

    Bridging Associative Memory and Probabilistic Modeling

    Authors: Rylan Schaeffer, Nika Zahedi, Mikail Khona, Dhruv Pai, Sang Truong, Yilun Du, Mitchell Ostrow, Sarthak Chandra, Andres Carranza, Ila Rani Fiete, Andrey Gromov, Sanmi Koyejo

    Abstract: Associative memory and probabilistic modeling are two fundamental topics in artificial intelligence. The first studies recurrent neural networks designed to denoise, complete and retrieve data, whereas the second studies learning and sampling from probability distributions. Based on the observation that associative memory's energy functions can be seen as probabilistic modeling's negative log like… ▽ More

    Submitted 13 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  8. arXiv:2401.06059  [pdf, other

    cs.CL cs.AI cs.LG

    Investigating Data Contamination for Pre-training Language Models

    Authors: Minhao Jiang, Ken Ziyu Liu, Ming Zhong, Rylan Schaeffer, Siru Ouyang, Jiawei Han, Sanmi Koyejo

    Abstract: Language models pre-trained on web-scale corpora demonstrate impressive capabilities on diverse downstream tasks. However, there is increasing concern whether such capabilities might arise from evaluation datasets being included in the pre-training corpus -- a phenomenon known as \textit{data contamination} -- in a manner that artificially increases performance. There has been little understanding… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 16 pages, 5 figures

  9. arXiv:2312.03096  [pdf, other

    cs.LG cs.AI cs.NE

    What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity from Incidental Causes

    Authors: Victor Lecomte, Kushal Thaman, Rylan Schaeffer, Naomi Bashkansky, Trevor Chow, Sanmi Koyejo

    Abstract: Polysemantic neurons -- neurons that activate for a set of unrelated features -- have been seen as a significant obstacle towards interpretability of task-optimized deep networks, with implications for AI safety. The classic origin story of polysemanticity is that the data contains more ``features" than neurons, such that learning to perform a task forces the network to co-allocate multiple unrela… ▽ More

    Submitted 13 February, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

  10. arXiv:2311.02316  [pdf, other

    cs.LG cs.NE

    Self-Supervised Learning of Representations for Space Generates Multi-Modular Grid Cells

    Authors: Rylan Schaeffer, Mikail Khona, Tzuhsuan Ma, Cristóbal Eyzaguirre, Sanmi Koyejo, Ila Rani Fiete

    Abstract: To solve the spatial problems of mapping, localization and navigation, the mammalian lineage has developed striking spatial representations. One important spatial representation is the Nobel-prize winning grid cells: neurons that represent self-location, a local and aperiodic quantity, with seemingly bizarre non-local and spatially periodic activity patterns of a few discrete periods. Why has the… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  11. arXiv:2309.08632  [pdf, other

    cs.CL cs.AI

    Pretraining on the Test Set Is All You Need

    Authors: Rylan Schaeffer

    Abstract: Inspired by recent work demonstrating the promise of smaller Transformer-based language models pretrained on carefully curated data, we supercharge such approaches by investing heavily in curating a novel, high quality, non-synthetic data mixture based solely on evaluation benchmarks. Using our novel dataset mixture consisting of less than 100 thousand tokens, we pretrain a 1 million parameter tra… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 3 pages, satire

  12. arXiv:2307.10573  [pdf, other

    cs.AI

    Invalid Logic, Equivalent Gains: The Bizarreness of Reasoning in Language Model Prompting

    Authors: Rylan Schaeffer, Kateryna Pistunova, Samar Khanna, Sarthak Consul, Sanmi Koyejo

    Abstract: Language models can be prompted to reason through problems in a manner that significantly improves performance. However, \textit{why} such prompting improves performance is unclear. Recent work showed that using logically \textit{invalid} Chain-of-Thought (CoT) prompting improves performance almost as much as logically \textit{valid} CoT prompting, and that editing CoT prompts to replace problem-s… ▽ More

    Submitted 22 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: ICML 2023 Workshop: Knowledge and Logical Reasoning in the Era of Data-driven Learning

  13. arXiv:2307.10569  [pdf, ps, other

    cs.LG cs.AI

    Deceptive Alignment Monitoring

    Authors: Andres Carranza, Dhruv Pai, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo

    Abstract: As the capabilities of large machine learning models continue to grow, and as the autonomy afforded to such models continues to expand, the spectre of a new adversary looms: the models themselves. The threat that a model might behave in a seemingly reasonable manner, while secretly and subtly modifying its behavior for ulterior reasons is often referred to as deceptive alignment in the AI Safety &… ▽ More

    Submitted 25 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted as BlueSky Oral to 2023 ICML AdvML Workshop

  14. arXiv:2307.10563  [pdf, other

    cs.LG cs.AI

    FACADE: A Framework for Adversarial Circuit Anomaly Detection and Evaluation

    Authors: Dhruv Pai, Andres Carranza, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo

    Abstract: We present FACADE, a novel probabilistic and geometric framework designed for unsupervised mechanistic anomaly detection in deep neural networks. Its primary goal is advancing the understanding and mitigation of adversarial attacks. FACADE aims to generate probabilistic distributions over circuits, which provide critical insights to their contribution to changes in the manifold properties of pseud… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted as BlueSky Poster at 2023 ICML AdvML Workshop

  15. arXiv:2306.11698  [pdf, other

    cs.CL cs.AI cs.CR

    DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

    Authors: Boxin Wang, Weixin Chen, Hengzhi Pei, Chulin Xie, Mintong Kang, Chenhui Zhang, Chejian Xu, Zidi Xiong, Ritik Dutta, Rylan Schaeffer, Sang T. Truong, Simran Arora, Mantas Mazeika, Dan Hendrycks, Zinan Lin, Yu Cheng, Sanmi Koyejo, Dawn Song, Bo Li

    Abstract: Generative Pre-trained Transformer (GPT) models have exhibited exciting progress in their capabilities, capturing the interest of practitioners and the public alike. Yet, while the literature on the trustworthiness of GPT models remains limited, practitioners have proposed employing capable GPT models for sensitive applications such as healthcare and finance -- where mistakes can be costly. To thi… ▽ More

    Submitted 26 February, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 Outstanding Paper (Datasets and Benchmarks Track)

  16. arXiv:2304.15004  [pdf, other

    cs.AI cs.LG

    Are Emergent Abilities of Large Language Models a Mirage?

    Authors: Rylan Schaeffer, Brando Miranda, Sanmi Koyejo

    Abstract: Recent work claims that large language models display emergent abilities, abilities not present in smaller-scale models that are present in larger-scale models. What makes emergent abilities intriguing is two-fold: their sharpness, transitioning seemingly instantaneously from not present to present, and their unpredictability, appearing at seemingly unforeseeable model scales. Here, we present an… ▽ More

    Submitted 22 May, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

  17. arXiv:2303.14151  [pdf, other

    cs.LG stat.ML

    Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle

    Authors: Rylan Schaeffer, Mikail Khona, Zachary Robertson, Akhilan Boopathy, Kateryna Pistunova, Jason W. Rocks, Ila Rani Fiete, Oluwasanmi Koyejo

    Abstract: Double descent is a surprising phenomenon in machine learning, in which as the number of model parameters grows relative to the number of data, test error drops as models grow ever larger into the highly overparameterized (data undersampled) regime. This drop in test error flies against classical learning theory on overfitting and has arguably underpinned the success of large models in machine lea… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  18. arXiv:2205.01212  [pdf, other

    cs.LG cs.AI

    Streaming Inference for Infinite Non-Stationary Clustering

    Authors: Rylan Schaeffer, Gabrielle Kaili-May Liu, Yilun Du, Scott Linderman, Ila Rani Fiete

    Abstract: Learning from a continuous stream of non-stationary data in an unsupervised manner is arguably one of the most common and most challenging settings facing intelligent agents. Here, we attack learning under all three conditions (unsupervised, streaming, non-stationary) in the context of clustering, also known as mixture modeling. We introduce a novel clustering algorithm that endows mixture models… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: Published at the Workshop on Agent Learning in Open-Endedness (ALOE) at ICLR 2022

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:19366-19387, 2022

  19. arXiv:2202.06892  [pdf, other

    cs.LG cs.DC

    DeCorus: Hierarchical Multivariate Anomaly Detection at Cloud-Scale

    Authors: Bruno Wassermann, David Ohana, Ronen Schaffer, Robert Shahla, Elliot K. Kolodner, Eran Raichstein, Michal Malka

    Abstract: Multivariate anomaly detection can be used to identify outages within large volumes of telemetry data for computing systems. However, developing an efficient anomaly detector that can provide users with relevant information is a challenging problem. We introduce our approach to hierarchical multivariate anomaly detection called DeCorus, a statistical multivariate anomaly detector which achieves li… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: 11 pages, 4 figures, draft

  20. arXiv:2111.03745  [pdf, other

    cs.AI cs.LG

    An Algorithmic Theory of Metacognition in Minds and Machines

    Authors: Rylan Schaeffer

    Abstract: Humans sometimes choose actions that they themselves can identify as sub-optimal, or wrong, even in the absence of additional information. How is this possible? We present an algorithmic theory of metacognition based on a well-understood trade-off in reinforcement learning (RL) between value-based RL and policy-based RL. To the cognitive (neuro)science community, our theory answers the outstanding… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.