Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Naselaris, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.11207  [pdf, other

    cs.CV cs.AI q-bio.NC

    MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

    Authors: Paul S. Scotti, Mihir Tripathy, Cesar Kadir Torrico Villanueva, Reese Kneeland, Tong Chen, Ashutosh Narang, Charan Santhirasegaran, Jonathan Xu, Thomas Naselaris, Kenneth A. Norman, Tanishq Mathew Abraham

    Abstract: Reconstructions of visual perception from brain activity have improved tremendously, but the practical utility of such methods has been limited. This is because such models are trained independently per subject where each subject requires dozens of hours of expensive fMRI training data to attain high-quality results. The present work showcases high-quality reconstructions using only 1 hour of fMRI… ▽ More

    Submitted 15 June, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: In Forty-first International Conference on Machine Learning, 2024. Code at https://github.com/MedARC-AI/MindEyeV2. Published as a conference paper at ICML 2024

  2. arXiv:2401.06005  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG

    How does the primate brain combine generative and discriminative computations in vision?

    Authors: Benjamin Peters, James J. DiCarlo, Todd Gureckis, Ralf Haefner, Leyla Isik, Joshua Tenenbaum, Talia Konkle, Thomas Naselaris, Kimberly Stachenfeld, Zenna Tavares, Doris Tsao, Ilker Yildirim, Nikolaus Kriegeskorte

    Abstract: Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-up signal flow, describing vision as a largely feedforward, discriminative inference process that filters and transforms the visual information to remo… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  3. arXiv:2312.07705  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG

    Brain-optimized inference improves reconstructions of fMRI brain activity

    Authors: Reese Kneeland, Jordyn Ojeda, Ghislain St-Yves, Thomas Naselaris

    Abstract: The release of large datasets and developments in AI have led to dramatic improvements in decoding methods that reconstruct seen images from human brain activity. We evaluate the prospect of further improving recent decoding methods by optimizing for consistency between reconstructions and brain activity during inference. We sample seed reconstructions from a base decoding method, then iteratively… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 7 pages, 8 figures, submitted to the 2023 AAAI Workshop on Brain Encoding and Decoding. arXiv admin note: text overlap with arXiv:2306.00927

  4. arXiv:2306.00927  [pdf, other

    q-bio.NC cs.CV cs.LG

    Second Sight: Using brain-optimized encoding models to align image distributions with human brain activity

    Authors: Reese Kneeland, Jordyn Ojeda, Ghislain St-Yves, Thomas Naselaris

    Abstract: Two recent developments have accelerated progress in image reconstruction from human brain activity: large datasets that offer samples of brain activity in response to many thousands of natural scenes, and the open-sourcing of powerful stochastic image-generators that accept both low- and high-level guidance. Most work in this space has focused on obtaining point estimates of the target image, wit… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 15 Figures, 19 pages including the appendix

  5. arXiv:2305.00556  [pdf, other

    q-bio.NC cs.CV cs.LG eess.IV

    Reconstructing seen images from human brain activity via guided stochastic search

    Authors: Reese Kneeland, Jordyn Ojeda, Ghislain St-Yves, Thomas Naselaris

    Abstract: Visual reconstruction algorithms are an interpretive tool that map brain activity to pixels. Past reconstruction algorithms employed brute-force search through a massive library to select candidate images that, when passed through an encoding model, accurately predict brain activity. Here, we use conditional generative diffusion models to extend and improve this search-based strategy. We decode a… ▽ More

    Submitted 1 May, 2023; v1 submitted 30 April, 2023; originally announced May 2023.

    Comments: 4 pages, 5 figures, submitted to the 2023 Conference on Cognitive Computational Neuroscience

  6. arXiv:2209.11737  [pdf

    cs.CV cs.LG q-bio.NC

    Visual representations in the human brain are aligned with large language models

    Authors: Adrien Doerig, Tim C Kietzmann, Emily Allen, Yihan Wu, Thomas Naselaris, Kendrick Kay, Ian Charest

    Abstract: The human brain extracts complex information from visual inputs, including objects, their spatial and semantic interrelations, and their interactions with the environment. However, a quantitative approach for studying this information remains elusive. Here, we test whether the contextual information encoded in large language models (LLMs) is beneficial for modelling the complex visual information… ▽ More

    Submitted 6 July, 2024; v1 submitted 23 September, 2022; originally announced September 2022.

  7. arXiv:2105.07140  [pdf, other

    q-bio.NC cs.CV q-bio.QM

    NeuroGen: activation optimized image synthesis for discovery neuroscience

    Authors: Zijin Gu, Keith W. Jamison, Meenakshi Khosla, Emily J. Allen, Yihan Wu, Thomas Naselaris, Kendrick Kay, Mert R. Sabuncu, Amy Kuceyeski

    Abstract: Functional MRI (fMRI) is a powerful technique that has allowed us to characterize visual cortex responses to stimuli, yet such experiments are by nature constructed based on a priori hypotheses, limited to the set of images presented to the individual while they are in the scanner, are subject to noise in the observed brain responses, and may vary widely across individuals. In this work, we propos… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.