Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Betthauser, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.00313  [pdf, other

    cs.CL

    Decoding In-Context Learning: Neuroscience-inspired Analysis of Representations in Large Language Models

    Authors: Safoora Yousefi, Leo Betthauser, Hosein Hasanbeig, Raphaël Millière, Ida Momennejad

    Abstract: Large language models (LLMs) exhibit remarkable performance improvement through in-context learning (ICL) by leveraging task-specific examples in the input. However, the mechanisms behind this improvement remain elusive. In this work, we investigate how LLM embeddings and attention representations change following in-context-learning, and how these changes mediate improvement in behavior. We emplo… ▽ More

    Submitted 21 February, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

  2. arXiv:2309.13701  [pdf, other

    cs.CL cs.AI cs.HC

    ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning

    Authors: Hosein Hasanbeig, Hiteshi Sharma, Leo Betthauser, Felipe Vieira Frujeri, Ida Momennejad

    Abstract: From grading papers to summarizing medical documents, large language models (LLMs) are evermore used for evaluation of text generated by humans and AI alike. However, despite their extensive utility, LLMs exhibit distinct failure modes, necessitating a thorough audit and improvement of their text evaluation capabilities. Here we introduce ALLURE, a systematic approach to Auditing Large Language Mo… ▽ More

    Submitted 26 September, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

  3. arXiv:2202.02339  [pdf, other

    cs.LG stat.AP stat.ML

    Discovering Distribution Shifts using Latent Space Representations

    Authors: Leo Betthauser, Urszula Chajewska, Maurice Diesendruck, Rohith Pesala

    Abstract: Rapid progress in representation learning has led to a proliferation of embedding models, and to associated challenges of model selection and practical application. It is non-trivial to assess a model's generalizability to new, candidate datasets and failure to generalize may lead to poor performance on downstream tasks. Distribution shifts are one cause of reduced generalizability, and are often… ▽ More

    Submitted 16 February, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: 10 pages, 5 figures, 3 tables, 2 algorithms

  4. Stable Electromyographic Sequence Prediction During Movement Transitions using Temporal Convolutional Networks

    Authors: Joseph L. Betthauser, John T. Krall, Rahul R. Kaliki, Matthew S. Fifer, Nitish V. Thakor

    Abstract: Transient muscle movements influence the temporal structure of myoelectric signal patterns, often leading to unstable prediction behavior from movement-pattern classification methods. We show that temporal convolutional network sequential models leverage the myoelectric signal's history to discover contextual temporal features that aid in correctly predicting movement intentions, especially during… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Comments: 4 pages, 5 figures, accepted for Neural Engineering (NER) 2019 Conference