Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Don-Yehiya, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16961  [pdf, other

    cs.HC cs.AI

    The Future of Open Human Feedback

    Authors: Shachar Don-Yehiya, Ben Burtenshaw, Ramon Fernandez Astudillo, Cailean Osborne, Mimansa Jaiswal, Tzu-Sheng Kuo, Wenting Zhao, Idan Shenfeld, Andi Peng, Mikhail Yurochkin, Atoosa Kasirzadeh, Yangsibo Huang, Tatsunori Hashimoto, Yacine Jernite, Daniel Vila-Suero, Omri Abend, Jennifer Ding, Sara Hooker, Hannah Rose Kirk, Leshem Choshen

    Abstract: Human feedback on conversations with language language models (LLMs) is central to how these systems learn about the world, improve their capabilities, and are steered toward desirable and safe behaviors. However, this feedback is mostly collected by frontier AI labs and kept behind closed doors. In this work, we bring together interdisciplinary experts to assess the opportunities and challenges t… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  2. arXiv:2408.08291  [pdf, other

    cs.CL

    The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community

    Authors: Shachar Don-Yehiya, Leshem Choshen, Omri Abend

    Abstract: Human-model conversations provide a window into users' real-world scenarios, behavior, and needs, and thus are a valuable resource for model development and research. While for-profit companies collect user data through the APIs of their models, using it internally to improve their own models, the open source and research community lags behind. We introduce the ShareLM collection, a unified set… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  3. arXiv:2407.10944  [pdf, other

    cs.CL

    Learning from Naturally Occurring Feedback

    Authors: Shachar Don-Yehiya, Leshem Choshen, Omri Abend

    Abstract: Human feedback data is a critical component in developing language models. However, collecting this feedback is costly and ultimately not scalable. We propose a scalable method for extracting feedback that users naturally include when interacting with chat models, and leveraging it for model training. We are further motivated by previous work that showed there are also qualitative advantages to us… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  4. arXiv:2311.12131  [pdf, other

    cs.CL

    Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney

    Authors: Shachar Don-Yehiya, Leshem Choshen, Omri Abend

    Abstract: Generating images with a Text-to-Image model often requires multiple trials, where human users iteratively update their prompt based on feedback, namely the output image. Taking inspiration from cognitive work on reference games and dialogue alignment, this paper analyzes the dynamics of the user prompts along such iterations. We compile a dataset of iterative interactions of human users with Midj… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: EMNLP23

  5. arXiv:2212.01378  [pdf, other

    cs.LG cs.CL cs.DC

    ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning

    Authors: Shachar Don-Yehiya, Elad Venezian, Colin Raffel, Noam Slonim, Yoav Katz, Leshem Choshen

    Abstract: We propose a new paradigm to continually evolve pretrained models, denoted ColD Fusion. It provides the benefits of multitask learning but leverages distributed computation with limited communication and eliminates the need for shared data. Consequentially, ColD Fusion can give rise to a synergistic loop, where finetuned models can be recycled to continually improve the pretrained model they are b… ▽ More

    Submitted 13 September, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: ACL 23

  6. arXiv:2205.09178  [pdf, other

    cs.CL cs.LG

    PreQuEL: Quality Estimation of Machine Translation Outputs in Advance

    Authors: Shachar Don-Yehiya, Leshem Choshen, Omri Abend

    Abstract: We present the task of PreQuEL, Pre-(Quality-Estimation) Learning. A PreQuEL system predicts how well a given sentence will be translated, without recourse to the actual translation, thus eschewing unnecessary resource allocation when translation quality is bound to be low. PreQuEL can be defined relative to a given MT system (e.g., some industry service) or generally relative to the state-of-the-… ▽ More

    Submitted 4 December, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: Accepted to the main conference of EMNLP 2022