Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Burda, Y

.
  1. arXiv:2407.13692  [pdf, other

    cs.CL

    Prover-Verifier Games improve legibility of LLM outputs

    Authors: Jan Hendrik Kirchner, Yining Chen, Harri Edwards, Jan Leike, Nat McAleese, Yuri Burda

    Abstract: One way to increase confidence in the outputs of Large Language Models (LLMs) is to support them with reasoning that is clear and easy to check -- a property we call legibility. We study legibility in the context of solving grade-school math problems and show that optimizing chain-of-thought solutions only for answer correctness can make them less legible. To mitigate the loss in legibility, we pr… ▽ More

    Submitted 1 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2305.20050  [pdf, other

    cs.LG cs.AI cs.CL

    Let's Verify Step by Step

    Authors: Hunter Lightman, Vineet Kosaraju, Yura Burda, Harri Edwards, Bowen Baker, Teddy Lee, Jan Leike, John Schulman, Ilya Sutskever, Karl Cobbe

    Abstract: In recent years, large language models have greatly improved in their ability to perform complex multi-step reasoning. However, even state-of-the-art models still regularly produce logical mistakes. To train more reliable models, we can turn either to outcome supervision, which provides feedback for a final result, or process supervision, which provides feedback for each intermediate reasoning ste… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  3. arXiv:2201.02177  [pdf, other

    cs.LG

    Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

    Authors: Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin, Vedant Misra

    Abstract: In this paper we propose to study generalization of neural networks on small algorithmically generated datasets. In this setting, questions about data efficiency, memorization, generalization, and speed of learning can be studied in great detail. In some situations we show that neural networks learn through a process of "grokking" a pattern in the data, improving generalization performance from ra… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

    Comments: Correspondence to [email protected]. Code available at: https://github.com/openai/grok

  4. arXiv:2107.03374  [pdf, other

    cs.LG

    Evaluating Large Language Models Trained on Code

    Authors: Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter , et al. (33 additional authors not shown)

    Abstract: We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J sol… ▽ More

    Submitted 14 July, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: corrected typos, added references, added authors, added acknowledgements

  5. arXiv:1810.12894  [pdf, other

    cs.LG cs.AI stat.ML

    Exploration by Random Network Distillation

    Authors: Yuri Burda, Harrison Edwards, Amos Storkey, Oleg Klimov

    Abstract: We introduce an exploration bonus for deep reinforcement learning methods that is easy to implement and adds minimal overhead to the computation performed. The bonus is the error of a neural network predicting features of the observations given by a fixed randomly initialized neural network. We also introduce a method to flexibly combine intrinsic and extrinsic rewards. We find that the random net… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

  6. arXiv:1808.04355  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Large-Scale Study of Curiosity-Driven Learning

    Authors: Yuri Burda, Harri Edwards, Deepak Pathak, Amos Storkey, Trevor Darrell, Alexei A. Efros

    Abstract: Reinforcement learning algorithms rely on carefully engineering environment rewards that are extrinsic to the agent. However, annotating each environment with hand-designed, dense rewards is not scalable, motivating the need for developing reward functions that are intrinsic to the agent. Curiosity is a type of intrinsic reward function which uses prediction error as reward signal. In this paper:… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

    Comments: First three authors contributed equally and ordered alphabetically. Website at https://pathak22.github.io/large-scale-curiosity/

  7. arXiv:1806.06464  [pdf, other

    cs.MA cs.AI cs.LG cs.NE stat.ML

    Learning Policy Representations in Multiagent Systems

    Authors: Aditya Grover, Maruan Al-Shedivat, Jayesh K. Gupta, Yura Burda, Harrison Edwards

    Abstract: Modeling agent behavior is central to understanding the emergence of complex phenomena in multiagent systems. Prior work in agent modeling has largely been task-specific and driven by hand-engineering domain-specific prior knowledge. We propose a general learning framework for modeling agent behavior in any multiagent system using only a handful of interaction data. Our framework casts agent model… ▽ More

    Submitted 31 July, 2018; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: ICML 2018

  8. arXiv:1710.03641  [pdf, other

    cs.LG cs.AI

    Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments

    Authors: Maruan Al-Shedivat, Trapit Bansal, Yuri Burda, Ilya Sutskever, Igor Mordatch, Pieter Abbeel

    Abstract: Ability to continuously learn and adapt from limited experience in nonstationary environments is an important milestone on the path towards general intelligence. In this paper, we cast the problem of continuous adaptation into the learning-to-learn framework. We develop a simple gradient-based meta-learning algorithm suitable for adaptation in dynamically changing and adversarial scenarios. Additi… ▽ More

    Submitted 23 February, 2018; v1 submitted 10 October, 2017; originally announced October 2017.

    Comments: Published as a conference paper at ICLR 2018

  9. arXiv:1611.04273  [pdf, other

    cs.LG

    On the Quantitative Analysis of Decoder-Based Generative Models

    Authors: Yuhuai Wu, Yuri Burda, Ruslan Salakhutdinov, Roger Grosse

    Abstract: The past several years have seen remarkable progress in generative models which produce convincing samples of images and other modalities. A shared component of many powerful generative models is a decoder network, a parametric deep neural net that defines a generative distribution. Examples include variational autoencoders, generative adversarial networks, and generative moment matching networks.… ▽ More

    Submitted 6 June, 2017; v1 submitted 14 November, 2016; originally announced November 2016.

    Comments: Accepted to ICLR2017

  10. arXiv:1509.00519  [pdf, other

    cs.LG stat.ML

    Importance Weighted Autoencoders

    Authors: Yuri Burda, Roger Grosse, Ruslan Salakhutdinov

    Abstract: The variational autoencoder (VAE; Kingma, Welling (2014)) is a recently proposed generative model pairing a top-down generative network with a bottom-up recognition network which approximates posterior inference. It typically makes strong assumptions about posterior inference, for instance that the posterior distribution is approximately factorial, and that its parameters can be approximated with… ▽ More

    Submitted 7 November, 2016; v1 submitted 1 September, 2015; originally announced September 2015.

    Comments: Submitted to ICLR 2015

  11. arXiv:1412.8566  [pdf, other

    cs.LG stat.ML

    Accurate and Conservative Estimates of MRF Log-likelihood using Reverse Annealing

    Authors: Yuri Burda, Roger B. Grosse, Ruslan Salakhutdinov

    Abstract: Markov random fields (MRFs) are difficult to evaluate as generative models because computing the test log-probabilities requires the intractable partition function. Annealed importance sampling (AIS) is widely used to estimate MRF partition functions, and often yields quite accurate results. However, AIS is prone to overestimate the log-likelihood with little indication that anything is wrong. We… ▽ More

    Submitted 30 December, 2014; originally announced December 2014.

  12. arXiv:1209.5137  [pdf, other

    math.AG

    Polynomials invertible in k-radicals

    Authors: Yuri Burda, Askold Khovanskii

    Abstract: A classic result of Ritt describes polynomials invertible in radicals: they are compositions of power polynomials, Chebyshev polynomials and polynomials of degree at most 4. In this paper we prove that a polynomial invertible in radicals and solutions of equations of degree at most k is a composition of power polynomials, Chebyshev polynomials, polynomials of degree at most k and, if k < 15, certa… ▽ More

    Submitted 23 September, 2012; originally announced September 2012.

    Comments: 19 pages, 14 figures

    MSC Class: 12F10; 20E34

  13. arXiv:1207.1211  [pdf, ps, other

    math.AG math.CV

    Signatures of Branched Coverings

    Authors: Yuri Burda, Askold Khovanskii

    Abstract: In this paper we deal with branched coverings over the complement to finitely many exceptional points on the Riemann sphere having the property that the local monodromy around each of the branching points is of finite order. To such a covering we assign its \textit{signature}, i.e. the set of its exceptional and branching points together with the orders of local monodromy operators around the bran… ▽ More

    Submitted 5 July, 2012; originally announced July 2012.

    Comments: To be submitted for publication in proceedings of 6th European Congress of Mathematics

    MSC Class: 34M15

  14. arXiv:1110.0568  [pdf, ps, other

    math.CO math.MG

    On a Problem of Gromov about Generalizing Alexandrov-Fenchel Inequality

    Authors: Yuri Burda

    Abstract: In this note we give an answer to a question about mixed volumes asked by Gromov in his paper "Convex Sets and Kahler Manifolds". For reader's convenience we remind definitions and some of the properties of mixed volumes and mixed discriminants.

    Submitted 16 October, 2011; v1 submitted 3 October, 2011; originally announced October 2011.

    Comments: 4 pages, an arithmetic mistake and few omissions have been corrected

  15. arXiv:1107.3444  [pdf, other

    math.AG math.AT

    Coverings over Tori and Topological Approach to Klein's Resolvent Problem

    Authors: Yuri Burda

    Abstract: This work answers the question what coverings over a topological torus can be induced from a covering over a space of dimension $k$. The answer to this question is then applied in algebro-geometric context to present obstructions to transforming an algebraic equation depending on several parameters to an equation depending on less parameters by means of a rational transformation.

    Submitted 18 July, 2011; originally announced July 2011.

    Comments: 35 pages, 1 figure

    MSC Class: 14E20; 14H30; 12G99; 14B10; 14B20; 57R20;

  16. arXiv:1104.0424  [pdf, other

    math.AG

    Branching Data for Algebraic Functions and Representability by Radicals

    Authors: Yuri Burda, Askold Khovanskii

    Abstract: The branching data of an algebraic function is a list of orders of local monodromies around branching points. We present branching data that ensure that the algebraic functions having them are representable by radicals. This paper is a review of recent work by the authors and of closely related classical work by Ritt.

    Submitted 3 April, 2011; originally announced April 2011.

    Comments: Submitted for publication to Banach Center Publications on April 1st, 2011

    MSC Class: Primary 12F10; Secondary 14H05; 14H30

  17. arXiv:1005.4101  [pdf, ps, other

    math.AG

    Around rational functions invertible in radicals

    Authors: Yuri Burda

    Abstract: A class of rational functions characterized by some wonderful properties is studied. The properties that identify this class include simple algebra (their inverses can be expressed in radicals), simple topology (the total space of the minimal Galois covering dominating them has genus 0 or 1) and simple local topol- ogy (branching data). Explicit formulae for these functions are obtained as well as… ▽ More

    Submitted 21 May, 2010; originally announced May 2010.