Skip to main content

Showing 1–8 of 8 results for author: Fiete, I R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14549  [pdf, other

    cs.CV cs.LG q-bio.NC

    Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models

    Authors: Sunny Duan, Mikail Khona, Abhiram Iyer, Rylan Schaeffer, Ila R Fiete

    Abstract: The proliferation of large language models has revolutionized natural language processing tasks, yet it raises profound concerns regarding data privacy and security. Language models are trained on extensive corpora including potentially sensitive or proprietary information, and the risk of data leakage -- where the model response reveals pieces of such information -- remains inadequately understoo… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2402.10202  [pdf, other

    cs.LG

    Bridging Associative Memory and Probabilistic Modeling

    Authors: Rylan Schaeffer, Nika Zahedi, Mikail Khona, Dhruv Pai, Sang Truong, Yilun Du, Mitchell Ostrow, Sarthak Chandra, Andres Carranza, Ila Rani Fiete, Andrey Gromov, Sanmi Koyejo

    Abstract: Associative memory and probabilistic modeling are two fundamental topics in artificial intelligence. The first studies recurrent neural networks designed to denoise, complete and retrieve data, whereas the second studies learning and sampling from probability distributions. Based on the observation that associative memory's energy functions can be seen as probabilistic modeling's negative log like… ▽ More

    Submitted 13 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  3. arXiv:2311.02316  [pdf, other

    cs.LG cs.NE

    Self-Supervised Learning of Representations for Space Generates Multi-Modular Grid Cells

    Authors: Rylan Schaeffer, Mikail Khona, Tzuhsuan Ma, Cristóbal Eyzaguirre, Sanmi Koyejo, Ila Rani Fiete

    Abstract: To solve the spatial problems of mapping, localization and navigation, the mammalian lineage has developed striking spatial representations. One important spatial representation is the Nobel-prize winning grid cells: neurons that represent self-location, a local and aperiodic quantity, with seemingly bizarre non-local and spatially periodic activity patterns of a few discrete periods. Why has the… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  4. arXiv:2310.07711  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE

    Growing Brains: Co-emergence of Anatomical and Functional Modularity in Recurrent Neural Networks

    Authors: Ziming Liu, Mikail Khona, Ila R. Fiete, Max Tegmark

    Abstract: Recurrent neural networks (RNNs) trained on compositional tasks can exhibit functional modularity, in which neurons can be clustered by activity similarity and participation in shared computational subtasks. Unlike brains, these RNNs do not exhibit anatomical modularity, in which functional clustering is correlated with strong recurrent coupling and spatial localization of functional clusters. Con… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 8 pages, 6 figures

  5. arXiv:2303.14151  [pdf, other

    cs.LG stat.ML

    Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle

    Authors: Rylan Schaeffer, Mikail Khona, Zachary Robertson, Akhilan Boopathy, Kateryna Pistunova, Jason W. Rocks, Ila Rani Fiete, Oluwasanmi Koyejo

    Abstract: Double descent is a surprising phenomenon in machine learning, in which as the number of model parameters grows relative to the number of data, test error drops as models grow ever larger into the highly overparameterized (data undersampled) regime. This drop in test error flies against classical learning theory on overfitting and has arguably underpinned the success of large models in machine lea… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  6. arXiv:2205.01212  [pdf, other

    cs.LG cs.AI

    Streaming Inference for Infinite Non-Stationary Clustering

    Authors: Rylan Schaeffer, Gabrielle Kaili-May Liu, Yilun Du, Scott Linderman, Ila Rani Fiete

    Abstract: Learning from a continuous stream of non-stationary data in an unsupervised manner is arguably one of the most common and most challenging settings facing intelligent agents. Here, we attack learning under all three conditions (unsupervised, streaming, non-stationary) in the context of clustering, also known as mixture modeling. We introduce a novel clustering algorithm that endows mixture models… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: Published at the Workshop on Agent Learning in Open-Endedness (ALOE) at ICLR 2022

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:19366-19387, 2022

  7. arXiv:2202.12887  [pdf, other

    cs.LG cs.NE q-bio.NC stat.ML

    Fault-Tolerant Neural Networks from Biological Error Correction Codes

    Authors: Alexander Zlokapa, Andrew K. Tan, John M. Martyn, Ila R. Fiete, Max Tegmark, Isaac L. Chuang

    Abstract: It has been an open question in deep learning if fault-tolerant computation is possible: can arbitrarily reliable computation be achieved using only unreliable neurons? In the grid cells of the mammalian cortex, analog error correction codes have been observed to protect states against neural spiking noise, but their role in information processing is unclear. Here, we use these biological error co… ▽ More

    Submitted 9 February, 2024; v1 submitted 25 February, 2022; originally announced February 2022.

    Report number: MIT-CTP/5395

  8. arXiv:2202.00159  [pdf, other

    cs.AI cs.IT cs.LG

    Content Addressable Memory Without Catastrophic Forgetting by Heteroassociation with a Fixed Scaffold

    Authors: Sugandha Sharma, Sarthak Chandra, Ila R. Fiete

    Abstract: Content-addressable memory (CAM) networks, so-called because stored items can be recalled by partial or corrupted versions of the items, exhibit near-perfect recall of a small number of information-dense patterns below capacity and a 'memory cliff' beyond, such that inserting a single additional pattern results in catastrophic loss of all stored patterns. We propose a novel CAM architecture, Memor… ▽ More

    Submitted 4 July, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: Last two authors contributed equally