Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Zucker, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07839  [pdf, other

    cs.LG cs.AI cs.CL

    RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

    Authors: Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti , et al. (37 additional authors not shown)

    Abstract: We introduce RecurrentGemma, a family of open language models which uses Google's novel Griffin architecture. Griffin combines linear recurrences with local attention to achieve excellent performance on language. It has a fixed-sized state, which reduces memory use and enables efficient inference on long sequences. We provide two sizes of models, containing 2B and 9B parameters, and provide pre-tr… ▽ More

    Submitted 28 August, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  2. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  3. arXiv:2311.06872  [pdf, other

    math.CO cs.DM math.LO

    Ramsey theorem for trees with successor operation

    Authors: Martin Balko, David Chodounský, Natasha Dobrinen, Jan Hubička, Matěj Konečný, Jaroslav Nešetřil, Andy Zucker

    Abstract: We prove a general Ramsey theorem for trees with a successor operation. This theorem is a common generalization of the Carlson-Simpson Theorem and the Milliken Tree Theorem for regularly branching trees. Our theorem has a number of applications both in finite and infinite combinatorics. For example, we give a short proof of the unrestricted Nešetřil-Rödl theorem, and we recover the Graham-Rothsc… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: 37 pages, 9 figures

    MSC Class: 05D10; 05C05; 05C65; 05C55 ACM Class: G.2.2; F.4.1

  4. arXiv:2303.12679  [pdf, ps, other

    math.CO cs.DM math.LO

    Type-respecting amalgamation and big Ramsey degrees

    Authors: Andrés Aranda, Samuel Braunfeld, David Chodounský, Jan Hubička, Matěj Konečný, Jaroslav Nešetřil, Andy Zucker

    Abstract: We give an infinitary extension of the Nešetřil-Rödl theorem for category of relational structures with special type-respecting embeddings.

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: 5 pages. Extended abstract

    MSC Class: 05D10; 05C05; 05C65; 05C55 ACM Class: G.2.2; F.4.1

  5. arXiv:2303.10088  [pdf, other

    math.CO cs.DM math.LO

    Characterisation of the big Ramsey degrees of the generic partial order

    Authors: Martin Balko, David Chodounský, Natasha Dobrinen, Jan Hubička, Matěj Konečný, Lluis Vena, Andy Zucker

    Abstract: As a result of 33 intercontinental Zoom calls, we characterise big Ramsey degrees of the generic partial order. This is an infinitary extension of the well known fact that finite partial orders endowed with linear extensions form a Ramsey class (this result was announced by Nešetřil and Rödl in 1984 with first published proof by Paoli, Trotter and Walker in 1985). Towards this, we refine earlier u… ▽ More

    Submitted 17 June, 2024; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: 34 pages, 7 figures. Minor revision incorporating suggestions of the anonymous referee. Fixed typo in Definition 1.5

    MSC Class: 05D10; 05C05; 05C65; 05C55; 06A07 ACM Class: G.2.2; F.4.1

  6. arXiv:2105.10542  [pdf, ps, other

    math.CO cs.DM cs.LO math.LO

    Big Ramsey degrees of the generic partial order

    Authors: Martin Balko, David Chodounský, Natasha Dobrinen, Jan Hubička, Matěj Konečný, Lluis Vena, Andy Zucker

    Abstract: As a result of 33 intercontinental Zoom calls, we characterise big Ramsey degrees of the generic partial order in a similar way as Devlin characterised big Ramsey degrees of the generic linear order (the order of rationals).

    Submitted 21 May, 2021; originally announced May 2021.

    Comments: 6 pages, extended abstract accepted to EUROCOMB 2021

    MSC Class: 05D10; 05C05; 05C65; 05C55= ACM Class: G.2.2; F.4.1