-
GPT-4 Technical Report
Authors:
OpenAI,
Josh Achiam,
Steven Adler,
Sandhini Agarwal,
Lama Ahmad,
Ilge Akkaya,
Florencia Leoni Aleman,
Diogo Almeida,
Janko Altenschmidt,
Sam Altman,
Shyamal Anadkat,
Red Avila,
Igor Babuschkin,
Suchir Balaji,
Valerie Balcom,
Paul Baltescu,
Haiming Bao,
Mohammad Bavarian,
Jeff Belgum,
Irwan Bello,
Jake Berdine,
Gabriel Bernadett-Shapiro,
Christopher Berner,
Lenny Bogdonoff,
Oleg Boiko
, et al. (256 additional authors not shown)
Abstract:
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo…
▽ More
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was developing infrastructure and optimization methods that behave predictably across a wide range of scales. This allowed us to accurately predict some aspects of GPT-4's performance based on models trained with no more than 1/1,000th the compute of GPT-4.
△ Less
Submitted 4 March, 2024; v1 submitted 15 March, 2023;
originally announced March 2023.
-
WebGPT: Browser-assisted question-answering with human feedback
Authors:
Reiichiro Nakano,
Jacob Hilton,
Suchir Balaji,
Jeff Wu,
Long Ouyang,
Christina Kim,
Christopher Hesse,
Shantanu Jain,
Vineet Kosaraju,
William Saunders,
Xu Jiang,
Karl Cobbe,
Tyna Eloundou,
Gretchen Krueger,
Kevin Button,
Matthew Knight,
Benjamin Chess,
John Schulman
Abstract:
We fine-tune GPT-3 to answer long-form questions using a text-based web-browsing environment, which allows the model to search and navigate the web. By setting up the task so that it can be performed by humans, we are able to train models on the task using imitation learning, and then optimize answer quality with human feedback. To make human evaluation of factual accuracy easier, models must coll…
▽ More
We fine-tune GPT-3 to answer long-form questions using a text-based web-browsing environment, which allows the model to search and navigate the web. By setting up the task so that it can be performed by humans, we are able to train models on the task using imitation learning, and then optimize answer quality with human feedback. To make human evaluation of factual accuracy easier, models must collect references while browsing in support of their answers. We train and evaluate our models on ELI5, a dataset of questions asked by Reddit users. Our best model is obtained by fine-tuning GPT-3 using behavior cloning, and then performing rejection sampling against a reward model trained to predict human preferences. This model's answers are preferred by humans 56% of the time to those of our human demonstrators, and 69% of the time to the highest-voted answer from Reddit.
△ Less
Submitted 1 June, 2022; v1 submitted 17 December, 2021;
originally announced December 2021.
-
Semicontinuous Banach spaces for Schrödinger's Eq. with Dirac-$δ'$ potential
Authors:
Bradly K Button
Abstract:
Schródinger's equation with distributional $δ$, or $δ'$ potentials has been well studied in the past. There are challenges in simultaneously addressing some of the inherent issues of the system: The functional operator cannot exist entirely within the standard $L^2$ Hilbert spaces. On differentiable manifolds, the domain of the free kinetic energy operator is in the space of harmonic forms. Locall…
▽ More
Schródinger's equation with distributional $δ$, or $δ'$ potentials has been well studied in the past. There are challenges in simultaneously addressing some of the inherent issues of the system: The functional operator cannot exist entirely within the standard $L^2$ Hilbert spaces. On differentiable manifolds, the domain of the free kinetic energy operator is in the space of harmonic forms. Locally, by the Hodge decomposition theorem and the standard distributional calculus, the space of functionals of a $δ$ or $δ'$ potential must be orthogonal to the free kinetic energy operator. Restricting to semicontinuous topologies presents opportunities to address these, and other issues. We develop, in great detail, a formalism of Banach spaces with semicontinuous topologies, and their properties are extensively defined and studied. For $C(\overline{\mathbb{R}})$ functions, the spaces are indistinguishable. The semicontinuous analogs of the $L^p$ spaces, are nontrivial and result in a dense topologically continuous embedding of the semicontinuous $L^p$ spaces into the semicontinuous $C(\overline{\mathbb{R}})$ spaces. Here, certain classes of distributions may be inverted in terms of their primitive functions. Also many operators are inherently self adjoint. We define equivalence relations between the cohomology classes of distributions and derivatives of their associated primitives on local sections of $\overline{\mathbb{R}}$. Here Hamilton's equations are canonical, and define a connection on the fibers of the base space. Semicontinuity provides a resolution to the above domain and interaction problems, and easily integrable Feynman functional. We arrive at a compatible domain which is Krein ($\mathfrak{H}$) over disjoint components of $\overline{\mathbb{R}}$. The subspaces of $\mathfrak{H}$ are isomorphic to the semicontinuous Hilbert spaces of the Hamiltonian.
△ Less
Submitted 29 December, 2017; v1 submitted 19 February, 2017;
originally announced February 2017.
-
Near-Extremal Black Hole Thermodynamics from AdS2/CFT1 Correspondence in The Low Energy Limit of 4D Heterotic String Theory
Authors:
Bradly K. Button,
Leo Rodriguez,
Sujeev Wickramasekara
Abstract:
We compute the asymptotic symmetry group of the four dimensional near-extremal Kerr-Sen black hole within an AdS2/CFT1 correspondence. We do this by performing a Robinson-Wilczek two dimensional reduction and construct a quantum theory of the remaining field content. The resulting energy momentum tensor generates an asymptotic Virasoro algebra, to s-wave, with a calculable central extension. This…
▽ More
We compute the asymptotic symmetry group of the four dimensional near-extremal Kerr-Sen black hole within an AdS2/CFT1 correspondence. We do this by performing a Robinson-Wilczek two dimensional reduction and construct a quantum theory of the remaining field content. The resulting energy momentum tensor generates an asymptotic Virasoro algebra, to s-wave, with a calculable central extension. This center in conjunction with the proper regularized lowest Virasoro eigen-mode yields the near-extremal Kerr-Sen entropy via the statistical Cardy formula. Finally we analyze quantum holomorphic fluxes of the dual CFT giving rise to a finite Hawking temperature weighted by the central charge of the near-extremal Kerr-Sen metric.
△ Less
Submitted 4 October, 2013; v1 submitted 26 July, 2013;
originally announced July 2013.
-
A Near Horizon CFT Dual for Kerr-Newman-$AdS$
Authors:
Bradly K. Button,
Leo Rodriguez,
Catherine A. Whiting,
Tuna Yildirim
Abstract:
We show that the near horizon regime of a Kerr-Newman-$AdS$ (KN$AdS$) black hole, given by its two dimensional analogue $a la$ Robinson and Wilczek (2005 Phys. Rev. Lett. 95 011303), is asymptotically $AdS_2$ and dual to a one dimensional quantum conformal field theory (CFT). The s-wave contribution of the resulting CFT's energy-momentum-tensor together with the asymptotic symmetries, generate a c…
▽ More
We show that the near horizon regime of a Kerr-Newman-$AdS$ (KN$AdS$) black hole, given by its two dimensional analogue $a la$ Robinson and Wilczek (2005 Phys. Rev. Lett. 95 011303), is asymptotically $AdS_2$ and dual to a one dimensional quantum conformal field theory (CFT). The s-wave contribution of the resulting CFT's energy-momentum-tensor together with the asymptotic symmetries, generate a centrally extended Virasoro algebra, whose central charge reproduces the Bekenstein-Hawking entropy via Cardy's Formula. Our derived central charge also agrees with the near extremal Kerr/CFT Correspondence (2009 Phys. Rev. D 80, 124008) in the appropriate limits. We also compute the Hawking temperature of the KN$AdS$ black hole by coupling its Robinson and Wilczek two dimensional analogue (RW2DA) to conformal matter.
△ Less
Submitted 27 May, 2011; v1 submitted 8 September, 2010;
originally announced September 2010.