Human feedback data is a critical component in developing language models. However, collecting this feedback is costly and ultimately not scalable.
In the paper we propose a scalable method for extracting feedback that users naturally include when interacting with chat models, and leveraging it for model training.
This repo contains the code for feedback extraction, and for using the already extracted feedback data from the paper.
The code for extracting feedback data from chat data is in the extract_feedback.py
file.
The dataset that was collected and used in this paper is available in the data
folder.
The data is in a csv format, use get_natural_feedback_dataset.py
to parse it.
It is also available as a Huggingface 🤗 dataset here.
If you find this work useful, please cite our paper:
@misc{donyehiya2024learningnaturallyoccurringfeedback,
title={Learning from Naturally Occurring Feedback},
author={Shachar Don-Yehiya and Leshem Choshen and Omri Abend},
year={2024},
eprint={2407.10944},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2407.10944},
}