Learning from Naturally Occurring Feedback

Human feedback data is a critical component in developing language models. However, collecting this feedback is costly and ultimately not scalable.

In the paper we propose a scalable method for extracting feedback that users naturally include when interacting with chat models, and leveraging it for model training.

This repo contains the code for feedback extraction, and for using the already extracted feedback data from the paper.

Code

The code for extracting feedback data from chat data is in the extract_feedback.py file.

Data

The dataset that was collected and used in this paper is available in the data folder.

The data is in a csv format, use get_natural_feedback_dataset.py to parse it. It is also available as a Huggingface 🤗 dataset here.

Citation

If you find this work useful, please cite our paper:

@misc{donyehiya2024learningnaturallyoccurringfeedback,
      title={Learning from Naturally Occurring Feedback}, 
      author={Shachar Don-Yehiya and Leshem Choshen and Omri Abend},
      year={2024},
      eprint={2407.10944},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2407.10944}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
figs		figs
prompt_prefix		prompt_prefix
README.md		README.md
extract_feedback.py		extract_feedback.py
get_natural_feedback_dataset.py		get_natural_feedback_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning from Naturally Occurring Feedback

Code

Data

Citation

Über uns

Releases

Packages

Languages

shachardon/naturally_occurring_feedback

Folders and files

Latest commit

History

Repository files navigation

Learning from Naturally Occurring Feedback

Code

Data

Citation

Über uns

Ressourcen

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages