Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Edwards, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13692  [pdf, other

    cs.CL

    Prover-Verifier Games improve legibility of LLM outputs

    Authors: Jan Hendrik Kirchner, Yining Chen, Harri Edwards, Jan Leike, Nat McAleese, Yuri Burda

    Abstract: One way to increase confidence in the outputs of Large Language Models (LLMs) is to support them with reasoning that is clear and easy to check -- a property we call legibility. We study legibility in the context of solving grade-school math problems and show that optimizing chain-of-thought solutions only for answer correctness can make them less legible. To mitigate the loss in legibility, we pr… ▽ More

    Submitted 1 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2308.09612  [pdf, other

    cs.LG eess.SY

    Constrained Bayesian Optimization Using a Lagrange Multiplier Applied to Power Transistor Design

    Authors: Ping-Ju Chuang, Ali Saadat, Sara Ghazvini, Hal Edwards, William G. Vandenberghe

    Abstract: We propose a novel constrained Bayesian Optimization (BO) algorithm optimizing the design process of Laterally-Diffused Metal-Oxide-Semiconductor (LDMOS) transistors while realizing a target Breakdown Voltage (BV). We convert the constrained BO problem into a conventional BO problem using a Lagrange multiplier. Instead of directly optimizing the traditional Figure-of-Merit (FOM), we set the Lagran… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 7 pages, 5 figures

  3. arXiv:2305.20050  [pdf, other

    cs.LG cs.AI cs.CL

    Let's Verify Step by Step

    Authors: Hunter Lightman, Vineet Kosaraju, Yura Burda, Harri Edwards, Bowen Baker, Teddy Lee, Jan Leike, John Schulman, Ilya Sutskever, Karl Cobbe

    Abstract: In recent years, large language models have greatly improved in their ability to perform complex multi-step reasoning. However, even state-of-the-art models still regularly produce logical mistakes. To train more reliable models, we can turn either to outcome supervision, which provides feedback for a final result, or process supervision, which provides feedback for each intermediate reasoning ste… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  4. arXiv:2203.02481  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    AutoDIME: Automatic Design of Interesting Multi-Agent Environments

    Authors: Ingmar Kanitscheider, Harri Edwards

    Abstract: Designing a distribution of environments in which RL agents can learn interesting and useful skills is a challenging and poorly understood task, for multi-agent environments the difficulties are only exacerbated. One approach is to train a second RL agent, called a teacher, who samples environments that are conducive for the learning of student agents. However, most previous proposals for teacher… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: first submission

  5. arXiv:2201.02177  [pdf, other

    cs.LG

    Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

    Authors: Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin, Vedant Misra

    Abstract: In this paper we propose to study generalization of neural networks on small algorithmically generated datasets. In this setting, questions about data efficiency, memorization, generalization, and speed of learning can be studied in great detail. In some situations we show that neural networks learn through a process of "grokking" a pattern in the data, improving generalization performance from ra… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

    Comments: Correspondence to [email protected]. Code available at: https://github.com/openai/grok

  6. arXiv:2110.05448  [pdf, other

    cs.CL cs.AI

    Unsupervised Neural Machine Translation with Generative Language Models Only

    Authors: Jesse Michael Han, Igor Babuschkin, Harrison Edwards, Arvind Neelakantan, Tao Xu, Stanislas Polu, Alex Ray, Pranav Shyam, Aditya Ramesh, Alec Radford, Ilya Sutskever

    Abstract: We show how to derive state-of-the-art unsupervised neural machine translation systems from generatively pre-trained language models. Our method consists of three steps: few-shot amplification, distillation, and backtranslation. We first use the zero-shot translation ability of large pre-trained language models to generate translations for a small set of unlabeled sentences. We then amplify these… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: 10 pages

  7. arXiv:2107.03374  [pdf, other

    cs.LG

    Evaluating Large Language Models Trained on Code

    Authors: Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter , et al. (33 additional authors not shown)

    Abstract: We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J sol… ▽ More

    Submitted 14 July, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: corrected typos, added references, added authors, added acknowledgements

  8. arXiv:2103.04912  [pdf

    cs.RO cs.CV

    Autonomous object harvesting using synchronized optoelectronic microrobots

    Authors: Christopher Bendkowski, Laurent Mennillo, Tao Xu, Mohamed Elsayed, Filip Stojic, Harrison Edwards, Shuailong Zhang, Cindi Morshead, Vijay Pawar, Aaron R. Wheeler, Danail Stoyanov, Michael Shaw

    Abstract: Optoelectronic tweezer-driven microrobots (OETdMs) are a versatile micromanipulation technology based on the use of light induced dielectrophoresis to move small dielectric structures (microrobots) across a photoconductive substrate. The microrobots in turn can be used to exert forces on secondary objects and carry out a wide range of micromanipulation operations, including collecting, transportin… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: 7 pages, 5 figures

  9. mdspan in C++: A Case Study in the Integration of Performance Portable Features into International Language Standards

    Authors: D. S. Hollman, Bryce Adelstein Lelbach, H. Carter Edwards, Mark Hoemmen, Daniel Sunderland, Christian R. Trott

    Abstract: Multi-dimensional arrays are ubiquitous in high-performance computing (HPC), but their absence from the C++ language standard is a long-standing and well-known limitation of their use for HPC. This paper describes the design and implementation of mdspan, a proposed C++ standard multidimensional array view (planned for inclusion in C++23). The proposal is largely inspired by work done in the Kokkos… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Journal ref: 2019 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC), Denver, CO, USA, 2019, pp. 60-70

  10. arXiv:2008.04600  [pdf, other

    cs.AI

    Planimation

    Authors: Gang Chen, Yi Ding, Hugo Edwards, Chong Hin Chau, Sai Hou, Grace Johnson, Mohammed Sharukh Syed, Haoyuan Tang, Yue Wu, Ye Yan, Gil Tidhar, Nir Lipovetzky

    Abstract: Planimation is a modular and extensible open source framework to visualise sequential solutions of planning problems specified in PDDL. We introduce a preliminary declarative PDDL-like animation profile specification, expressive enough to synthesise animations of arbitrary initial states and goals of a benchmark with just a single profile.

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: Best ICAPS 19 - Systen Demo Award - technical report

  11. arXiv:1810.12894  [pdf, other

    cs.LG cs.AI stat.ML

    Exploration by Random Network Distillation

    Authors: Yuri Burda, Harrison Edwards, Amos Storkey, Oleg Klimov

    Abstract: We introduce an exploration bonus for deep reinforcement learning methods that is easy to implement and adds minimal overhead to the computation performed. The bonus is the error of a neural network predicting features of the observations given by a fixed randomly initialized neural network. We also introduce a method to flexibly combine intrinsic and extrinsic rewards. We find that the random net… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

  12. arXiv:1810.09502  [pdf, other

    cs.LG stat.ML

    How to train your MAML

    Authors: Antreas Antoniou, Harrison Edwards, Amos Storkey

    Abstract: The field of few-shot learning has recently seen substantial advancements. Most of these advancements came from casting few-shot learning as a meta-learning problem. Model Agnostic Meta Learning or MAML is currently one of the best approaches for few-shot learning via meta-learning. MAML is simple, elegant and very powerful, however, it has a variety of issues, such as being very sensitive to neur… ▽ More

    Submitted 5 March, 2019; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: Published in ICLR 2019

  13. arXiv:1808.04355  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Large-Scale Study of Curiosity-Driven Learning

    Authors: Yuri Burda, Harri Edwards, Deepak Pathak, Amos Storkey, Trevor Darrell, Alexei A. Efros

    Abstract: Reinforcement learning algorithms rely on carefully engineering environment rewards that are extrinsic to the agent. However, annotating each environment with hand-designed, dense rewards is not scalable, motivating the need for developing reward functions that are intrinsic to the agent. Curiosity is a type of intrinsic reward function which uses prediction error as reward signal. In this paper:… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

    Comments: First three authors contributed equally and ordered alphabetically. Website at https://pathak22.github.io/large-scale-curiosity/

  14. arXiv:1807.10299  [pdf, other

    cs.AI

    Variational Option Discovery Algorithms

    Authors: Joshua Achiam, Harrison Edwards, Dario Amodei, Pieter Abbeel

    Abstract: We explore methods for option discovery based on variational inference and make two algorithmic contributions. First: we highlight a tight connection between variational option discovery methods and variational autoencoders, and introduce Variational Autoencoding Learning of Options by Reinforcement (VALOR), a new method derived from the connection. In VALOR, the policy encodes contexts from a noi… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

  15. arXiv:1806.06464  [pdf, other

    cs.MA cs.AI cs.LG cs.NE stat.ML

    Learning Policy Representations in Multiagent Systems

    Authors: Aditya Grover, Maruan Al-Shedivat, Jayesh K. Gupta, Yura Burda, Harrison Edwards

    Abstract: Modeling agent behavior is central to understanding the emergence of complex phenomena in multiagent systems. Prior work in agent modeling has largely been task-specific and driven by hand-engineering domain-specific prior knowledge. We propose a general learning framework for modeling agent behavior in any multiagent system using only a handful of interaction data. Our framework casts agent model… ▽ More

    Submitted 31 July, 2018; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: ICML 2018

  16. arXiv:1711.04340  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Data Augmentation Generative Adversarial Networks

    Authors: Antreas Antoniou, Amos Storkey, Harrison Edwards

    Abstract: Effective training of neural networks requires much data. In the low-data regime, parameters are underdetermined, and learnt networks generalise poorly. Data Augmentation alleviates this by using existing data more effectively. However standard data augmentation produces only limited plausible alternative data. Given there is potential to generate a much broader set of augmentations, we design and… ▽ More

    Submitted 21 March, 2018; v1 submitted 12 November, 2017; originally announced November 2017.

    Comments: 10 pages

  17. arXiv:1606.02185  [pdf, other

    stat.ML cs.LG

    Towards a Neural Statistician

    Authors: Harrison Edwards, Amos Storkey

    Abstract: An efficient learner is one who reuses what they already know to tackle a new problem. For a machine learner, this means understanding the similarities amongst datasets. In order to do this, one must take seriously the idea of working with datasets, rather than datapoints, as the key objects to model. Towards this goal, we demonstrate an extension of a variational autoencoder that can learn a meth… ▽ More

    Submitted 20 March, 2017; v1 submitted 7 June, 2016; originally announced June 2016.

    Comments: Updated to camera ready version for ICLR 2017

  18. arXiv:1601.05871  [pdf, other

    cs.MS

    Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout

    Authors: Kyungjoo Kim, Sivasankaran Rajamanickam, George Stelle, H. Carter Edwards, Stephen L. Olivier

    Abstract: We introduce a task-parallel algorithm for sparse incomplete Cholesky factorization that utilizes a 2D sparse partitioned-block layout of a matrix. Our factorization algorithm follows the idea of algorithms-by-blocks by using the block layout. The algorithm-by-blocks approach induces a task graph for the factorization. These tasks are inter-related to each other through their data dependences in t… ▽ More

    Submitted 21 January, 2016; originally announced January 2016.

    Comments: 25 pages

    Report number: SAND2016-0637 R MSC Class: 68W10

  19. arXiv:1511.05897  [pdf, other

    cs.LG cs.AI stat.ML

    Censoring Representations with an Adversary

    Authors: Harrison Edwards, Amos Storkey

    Abstract: In practice, there are often explicit constraints on what representations or decisions are acceptable in an application of machine learning. For example it may be a legal requirement that a decision must not favour a particular group. Alternatively it can be that that representation of data must not have identifying information. We address these two related issues by learning flexible representati… ▽ More

    Submitted 4 March, 2016; v1 submitted 18 November, 2015; originally announced November 2015.

    Comments: Paper accepted to ICLR

  20. arXiv:1511.03703  [pdf, other

    cs.MS cs.CE

    Embedded Ensemble Propagation for Improving Performance, Portability and Scalability of Uncertainty Quantification on Emerging Computational Architectures

    Authors: E. Phipps, M. D'Elia, H. C. Edwards, M. Hoemmen, J. Hu, S. Rajamanickam

    Abstract: Quantifying simulation uncertainties is a critical component of rigorous predictive simulation. A key component of this is forward propagation of uncertainties in simulation input data to output quantities of interest. Typical approaches involve repeated sampling of the simulation over the uncertain input data, and can require numerous samples when accurately propagating uncertainties from large n… ▽ More

    Submitted 11 November, 2015; originally announced November 2015.

    Report number: SAND2015-9921 J

  21. arXiv:cs/0607072  [pdf

    cs.HC

    Effect of Interface Style in Peer Review Comments for UML Designs

    Authors: Scott A. Turner, Manuel A. Perez-Quinones, Stephen H. Edwards

    Abstract: This paper presents our evaluation of using a Tablet-PC to provide peer-review comments in the first year Computer Science course. Our exploration consisted of an evaluation of how students write comments on other students' assignments using three different methods: pen and paper, a Tablet-PC, and a desktop computer. Our ultimate goal is to explore the effect that interface style (Tablet vs. Des… ▽ More

    Submitted 14 July, 2006; originally announced July 2006.

    Comments: 8 pages, 7 figures

    ACM Class: H.1; H.4; H.5

  22. arXiv:cs/0603121  [pdf

    cs.HC cs.SE

    minimUML: A Minimalist Approach to UML Diagraming for Early Computer Science Education

    Authors: Scott Turner, Manuel A. Perez-Quinones, Stephen H. Edwards

    Abstract: The Unified Modeling Language (UML) is commonly used in introductory Computer Science to teach basic object-oriented design. However, there appears to be a lack of suitable software to support this task. Many of the available programs that support UML focus on developing code and not on enhancing learning. Those that were designed for educational use sometimes have poor interfaces or are missing… ▽ More

    Submitted 30 March, 2006; originally announced March 2006.

    Comments: 38 pages, 15 figures

    ACM Class: K.3.2; H.5.2; D.2.2