"An empirical analysis of compute-optimal large language model training."

Jordan Hoffmann et al. (2022)

> Startseite

Details and statistics

DOI: —

access: open

type: Conference or Workshop Paper

metadata version: 2024-01-08

- view
  - electronic edition @ nips.cc (open access)
- export record
  dblp key:
  - conf/nips/HoffmannBMBCRCH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HoffmannBMBCRCH22
Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katherine Millican, George van den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Oriol Vinyals, Jack W. Rae, Laurent Sifre:
An empirical analysis of compute-optimal large language model training. NeurIPS 2022

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.