Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Grayson, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04236  [pdf, other

    cs.CV

    Understanding Information Storage and Transfer in Multi-modal Large Language Models

    Authors: Samyadeep Basu, Martin Grayson, Cecily Morrison, Besmira Nushi, Soheil Feizi, Daniela Massiceti

    Abstract: Understanding the mechanisms of information storage and transfer in Transformer-based models is important for driving model understanding progress. Recent work has studied these mechanisms for Large Language Models (LLMs), revealing insights on how information is stored in a model's parameters and how information flows to and from these parameters in response to specific prompts. However, these st… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 20 pages

  2. arXiv:2311.17315  [pdf, other

    cs.CV

    Explaining CLIP's performance disparities on data from blind/low vision users

    Authors: Daniela Massiceti, Camilla Longden, Agnieszka Słowik, Samuel Wills, Martin Grayson, Cecily Morrison

    Abstract: Large multi-modal models (LMMs) hold the potential to usher in a new era of automated visual assistance for people who are blind or low vision (BLV). Yet, these models have not been systematically evaluated on data captured by BLV users. We address this by empirically assessing CLIP, a widely-used LMM likely to underpin many assistive technologies. Testing 25 CLIP variants in a zero-shot classific… ▽ More

    Submitted 25 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted at 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)