Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Wedin, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.04894  [pdf, other

    cs.CL cs.AI

    ConstitutionalExperts: Training a Mixture of Principle-based Prompts

    Authors: Savvas Petridis, Ben Wedin, Ann Yuan, James Wexler, Nithum Thain

    Abstract: Large language models (LLMs) are highly capable at a variety of tasks given the right prompt, but writing one is still a difficult and tedious process. In this work, we introduce ConstitutionalExperts, a method for learning a prompt consisting of constitutional principles (i.e. rules), given a training dataset. Unlike prior methods that optimize the prompt as a single entity, our method incrementa… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  2. arXiv:2310.15428  [pdf, other

    cs.HC cs.AI

    ConstitutionMaker: Interactively Critiquing Large Language Models by Converting Feedback into Principles

    Authors: Savvas Petridis, Ben Wedin, James Wexler, Aaron Donsbach, Mahima Pushkarna, Nitesh Goyal, Carrie J. Cai, Michael Terry

    Abstract: Large language model (LLM) prompting is a promising new approach for users to create and customize their own chatbots. However, current methods for steering a chatbot's outputs, such as prompt engineering and fine-tuning, do not support users in converting their natural feedback on the model's outputs to changes in the prompt or model. In this work, we explore how to enable users to interactively… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  3. arXiv:2307.14225  [pdf, ps, other

    cs.IR cs.LG

    Large Language Models are Competitive Near Cold-start Recommenders for Language- and Item-based Preferences

    Authors: Scott Sanner, Krisztian Balog, Filip Radlinski, Ben Wedin, Lucas Dixon

    Abstract: Traditional recommender systems leverage users' item preference history to recommend novel content that users may like. However, modern dialog interfaces that allow users to express language-based preferences offer a fundamentally different modality for preference input. Inspired by recent successes of prompting paradigms for large language models (LLMs), we study their use for making recommendati… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: To appear at RecSys'23

  4. On Natural Language User Profiles for Transparent and Scrutable Recommendation

    Authors: Filip Radlinski, Krisztian Balog, Fernando Diaz, Lucas Dixon, Ben Wedin

    Abstract: Natural interaction with recommendation and personalized search systems has received tremendous attention in recent years. We focus on the challenge of supporting people's understanding and control of these systems and explore a fundamentally new way of thinking about representation of knowledge in recommendation and personalization systems. Specifically, we argue that it may be both desirable and… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '22), 2022

  5. arXiv:2201.11196  [pdf, other

    cs.LG cs.HC

    IMACS: Image Model Attribution Comparison Summaries

    Authors: Eldon Schoop, Ben Wedin, Andrei Kapishnikov, Tolga Bolukbasi, Michael Terry

    Abstract: Developing a suitable Deep Neural Network (DNN) often requires significant iteration, where different model versions are evaluated and compared. While metrics such as accuracy are a powerful means to succinctly describe a model's performance across a dataset or to directly compare model versions, practitioners often wish to gain a deeper understanding of the factors that influence a model's predic… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  6. arXiv:2106.09788  [pdf, other

    cs.CV cs.LG

    Guided Integrated Gradients: An Adaptive Path Method for Removing Noise

    Authors: Andrei Kapishnikov, Subhashini Venugopalan, Besim Avci, Ben Wedin, Michael Terry, Tolga Bolukbasi

    Abstract: Integrated Gradients (IG) is a commonly used feature attribution method for deep neural networks. While IG has many desirable properties, the method often produces spurious/noisy pixel attributions in regions that are not related to the predicted class when applied to visual models. While this has been previously noted, most existing solutions are aimed at addressing the symptoms by explicitly red… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 13 pages, 11 figures, for implementation sources see https://github.com/PAIR-code/saliency

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 5050-5058