Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Piotrowski, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04165  [pdf, other

    cs.LG

    Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe

    Authors: Alicja Ziarko, Albert Q. Jiang, Bartosz Piotrowski, Wenda Li, Mateja Jamnik, Piotr Miłoś

    Abstract: Text embeddings are essential for many tasks, such as document retrieval, clustering, and semantic similarity assessment. In this paper, we study how to contrastively train text embedding models in a compute-optimal fashion, given a suite of pre-trained decoder-only language models. Our innovation is an algorithm that produces optimal configurations of model sizes, data quantities, and fine-tuning… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2304.00994  [pdf, other

    cs.AI cs.LG cs.LO cs.SC

    Machine-Learned Premise Selection for Lean

    Authors: Bartosz Piotrowski, Ramon Fernández Mir, Edward Ayers

    Abstract: We introduce a machine-learning-based tool for the Lean proof assistant that suggests relevant premises for theorems being proved by a user. The design principles for the tool are (1) tight integration with the proof assistant, (2) ease of use and installation, (3) a lightweight and fast approach. For this purpose, we designed a custom version of the random forest model, trained in an online fashi… ▽ More

    Submitted 14 June, 2023; v1 submitted 17 March, 2023; originally announced April 2023.

  3. arXiv:2303.06686  [pdf, other

    cs.AI cs.LG cs.LO cs.SC

    MizAR 60 for Mizar 50

    Authors: Jan Jakubův, Karel Chvalovský, Zarathustra Goertzel, Cezary Kaliszyk, Mirek Olšák, Bartosz Piotrowski, Stephan Schulz, Martin Suda, Josef Urban

    Abstract: As a present to Mizar on its 50th anniversary, we develop an AI/TP system that automatically proves about 60\% of the Mizar theorems in the hammer setting. We also automatically prove 75\% of the Mizar theorems when the automated provers are helped by using only the premises used in the human-written Mizar proofs. We describe the methods and large-scale experiments leading to these results. This i… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

  4. arXiv:2303.04488  [pdf, other

    cs.LG cs.AI cs.LO

    Magnushammer: A Transformer-Based Approach to Premise Selection

    Authors: Maciej Mikuła, Szymon Tworkowski, Szymon Antoniak, Bartosz Piotrowski, Albert Qiaochu Jiang, Jin Peng Zhou, Christian Szegedy, Łukasz Kuciński, Piotr Miłoś, Yuhuai Wu

    Abstract: This paper presents a novel approach to premise selection, a crucial reasoning task in automated theorem proving. Traditionally, symbolic methods that rely on extensive domain knowledge and engineering effort are applied to this task. In contrast, this work demonstrates that contrastive training with the transformer architecture can achieve higher-quality retrieval of relevant premises, without th… ▽ More

    Submitted 18 March, 2024; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: ICLR 2024

  5. arXiv:2302.12433  [pdf, ps, other

    cs.CL cs.AI cs.LO

    ProofNet: Autoformalizing and Formally Proving Undergraduate-Level Mathematics

    Authors: Zhangir Azerbayev, Bartosz Piotrowski, Hailey Schoelkopf, Edward W. Ayers, Dragomir Radev, Jeremy Avigad

    Abstract: We introduce ProofNet, a benchmark for autoformalization and formal proving of undergraduate-level mathematics. The ProofNet benchmarks consists of 371 examples, each consisting of a formal theorem statement in Lean 3, a natural language theorem statement, and a natural language proof. The problems are primarily drawn from popular undergraduate pure mathematics textbooks and cover topics such as r… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  6. arXiv:2104.05207  [pdf, other

    cs.LO cs.AI

    Online Machine Learning Techniques for Coq: A Comparison

    Authors: Liao Zhang, Lasse Blaauwbroek, Bartosz Piotrowski, Prokop Černý, Cezary Kaliszyk, Josef Urban

    Abstract: We present a comparison of several online machine learning techniques for tactical learning and proving in the Coq proof assistant. This work builds on top of Tactician, a plugin for Coq that learns from proofs written by the user to synthesize new proofs. Learning happens in an online manner, meaning that Tactician's machine learning model is updated immediately every time the user performs a ste… ▽ More

    Submitted 7 June, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Intelligent Computer Mathematics 14th International Conference, CICM 2021

  7. arXiv:2004.08212  [pdf, other

    cs.LO cs.AI cs.LG stat.ML

    Stateful Premise Selection by Recurrent Neural Networks

    Authors: Bartosz Piotrowski, Josef Urban

    Abstract: In this work, we develop a new learning-based method for selecting facts (premises) when proving new goals over large formal libraries. Unlike previous methods that choose sets of facts independently of each other by their rank, the new method uses the notion of \emph{state} that is updated each time a choice of a fact is made. Our stateful architecture is based on recurrent neural networks which… ▽ More

    Submitted 11 March, 2020; originally announced April 2020.

    Comments: 13 pages, 1 figure

  8. arXiv:2002.05406  [pdf, other

    cs.AI cs.LG cs.LO cs.NE cs.SC

    ENIGMA Anonymous: Symbol-Independent Inference Guiding Machine (system description)

    Authors: Jan Jakubův, Karel Chvalovský, Miroslav Olšák, Bartosz Piotrowski, Martin Suda, Josef Urban

    Abstract: We describe an implementation of gradient boosting and neural guidance of saturation-style automated theorem provers that does not depend on consistent symbol names across problems. For the gradient-boosting guidance, we manually create abstracted features by considering arity-based encodings of formulas. For the neural guidance, we use symbol-independent graph neural networks (GNNs) and their emb… ▽ More

    Submitted 28 April, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

  9. arXiv:1911.04873  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Can Neural Networks Learn Symbolic Rewriting?

    Authors: Bartosz Piotrowski, Josef Urban, Chad E. Brown, Cezary Kaliszyk

    Abstract: This work investigates if the current neural architectures are adequate for learning symbolic rewriting. Two kinds of data sets are proposed for this research -- one based on automated proofs and the other being a synthetic set of polynomial terms. The experiments with use of the current neural machine translation models are performed and its results are discussed. Ideas for extending this line of… ▽ More

    Submitted 26 May, 2020; v1 submitted 7 November, 2019; originally announced November 2019.

  10. arXiv:1905.07961  [pdf, ps, other

    cs.AI cs.LG cs.LO cs.NE stat.ML

    Guiding Inferences in Connection Tableau by Recurrent Neural Networks

    Authors: Bartosz Piotrowski, Josef Urban

    Abstract: We present a dataset and experiments on applying recurrent neural networks (RNNs) for guiding clause selection in the connection tableau proof calculus. The RNN encodes a sequence of literals from the current branch of the partial proof tree to a hidden vector state; using it, the system selects a clause for extending the proof tree. The training data and learning setup are described, and the resu… ▽ More

    Submitted 9 April, 2020; v1 submitted 20 May, 2019; originally announced May 2019.

  11. arXiv:1802.03375  [pdf, other

    cs.AI cs.LO stat.ML

    ATPboost: Learning Premise Selection in Binary Setting with ATP Feedback

    Authors: Bartosz Piotrowski, Josef Urban

    Abstract: ATPboost is a system for solving sets of large-theory problems by interleaving ATP runs with state-of-the-art machine learning of premise selection from the proofs. Unlike many previous approaches that use multi-label setting, the learning is implemented as binary classification that estimates the pairwise-relevance of (theorem, premise) pairs. ATPboost uses for this the XGBoost gradient boosting… ▽ More

    Submitted 9 February, 2018; originally announced February 2018.