Skip to content

shachardon/naturally_occurring_feedback

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Learning from Naturally Occurring Feedback

Human feedback data is a critical component in developing language models. However, collecting this feedback is costly and ultimately not scalable.

In the paper we propose a scalable method for extracting feedback that users naturally include when interacting with chat models, and leveraging it for model training.

This repo contains the code for feedback extraction, and for using the already extracted feedback data from the paper.

Code

The code for extracting feedback data from chat data is in the extract_feedback.py file.

Data

The dataset that was collected and used in this paper is available in the data folder.

The data is in a csv format, use get_natural_feedback_dataset.py to parse it. It is also available as a Huggingface 🤗 dataset here.

Citation

If you find this work useful, please cite our paper:

@misc{donyehiya2024learningnaturallyoccurringfeedback,
      title={Learning from Naturally Occurring Feedback}, 
      author={Shachar Don-Yehiya and Leshem Choshen and Omri Abend},
      year={2024},
      eprint={2407.10944},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2407.10944}, 
}

Über uns

No description, website, or topics provided.

Ressourcen

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages