Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Understanding the Effects of RLHF on LLM Generalisation and Diversity., , , , , , and . CoRR, (2023)Teaching Large Language Models to Reason with Reinforcement Learning., , , , , , , , and . CoRR, (2024)LLaMA: Open and Efficient Foundation Language Models., , , , , , , , , and 4 other author(s). CoRR, (2023)Generalization to New Sequential Decision Making Tasks with In-Context Learning., , , , and . ICML, OpenReview.net, (2024)Dungeons and Data: A Large-Scale NetHack Dataset., , , , , , and . NeurIPS, (2022)Know When To Stop: A Study of Semantic Drift in Text Generation., , , and . NAACL-HLT, page 3656-3671. Association for Computational Linguistics, (2024)LLaMA: Open and Efficient Foundation Language Models, , , , , , , , , and 4 other author(s). CoRR, (2023)Llama: Open and efficient foundation language models, , , , , , , , , and 1 other author(s). arXiv preprint arXiv:2302.13971, (2023)Understanding the Effects of RLHF on LLM Generalisation and Diversity., , , , , , and . ICLR, OpenReview.net, (2024)MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research., , , , , , , , , and . NeurIPS Datasets and Benchmarks, (2021)