Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Ringer, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09237  [pdf, other

    cs.SE cs.LG cs.PL

    QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning

    Authors: Alex Sanchez-Stern, Abhishek Varghese, Zhanna Kaufman, Dylan Zhang, Talia Ringer, Yuriy Brun

    Abstract: Formal verification is a promising method for producing reliable software, but the difficulty of manually writing verification proofs severely limits its utility in practice. Recent methods have automated some proof synthesis by guiding a search through the proof space using a theorem prover. Unfortunately, the theorem prover provides only the crudest estimate of progress, resulting in effectively… ▽ More

    Submitted 9 September, 2024; v1 submitted 17 August, 2024; originally announced August 2024.

    Comments: Published in the International Conference on Software Engineering (ICSE) 2025: Alex Sanchez-Stern, Abhishek Varghese, Zhanna Kaufman, Dylan Zhang, Talia Ringer, and Yuriy Brun, QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning, in Proceedings of the 47th International Conference on Software Engineering (ICSE), 2025

  2. arXiv:2401.12947  [pdf, other

    cs.CL cs.AI cs.FL cs.LO cs.PL

    Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion

    Authors: Dylan Zhang, Curt Tigges, Zory Zhang, Stella Biderman, Maxim Raginsky, Talia Ringer

    Abstract: This paper investigates the ability of transformer-based models to learn structural recursion from examples. Recursion is a universal concept in both natural and formal languages. Structural recursion is central to the programming language and formal mathematics tasks where symbolic tools currently excel beyond neural models, such as inferring semantic relations between datatypes and emulating pro… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.14699

  3. arXiv:2310.06959  [pdf, ps, other

    cs.PL

    Proof Repair across Quotient Type Equivalences

    Authors: Cosmo Viola, Max Fan, Talia Ringer

    Abstract: Proofs in proof assistants like Coq can be brittle, breaking easily in response to changes in the terms and types those proofs depend on. To address this, recent work introduced an algorithm and tool in Coq to automatically repair broken proofs in response to changes that correspond to type equivalences. However, many changes remained out of the scope of this algorithm and tool -- especially chang… ▽ More

    Submitted 18 March, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: for associated code, see https://github.com/InnovativeInventor/proof-repair-quotients

  4. arXiv:2305.14699  [pdf, other

    cs.LG cs.AI cs.LO cs.PL

    Can Transformers Learn to Solve Problems Recursively?

    Authors: Shizhuo Dylan Zhang, Curt Tigges, Stella Biderman, Maxim Raginsky, Talia Ringer

    Abstract: Neural networks have in recent years shown promise for helping software engineers write programs and even formally verify them. While semantic information plays a crucial part in these processes, it remains unclear to what degree popular neural architectures like transformers are capable of modeling that information. This paper examines the behavior of neural networks learning algorithms relevant… ▽ More

    Submitted 25 June, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  5. arXiv:2305.04369  [pdf, other

    cs.FL

    Getting More out of Large Language Models for Proofs

    Authors: Shizhuo Dylan Zhang, Talia Ringer, Emily First

    Abstract: Large language models have the potential to simplify formal theorem proving and make it more accessible. But how to get the most out of these models is still an open question. To answer this question, we take a step back and explore the failure cases of these models using common prompting-based techniques. Our talk will discuss these failure cases and what they can teach us about how to get more o… ▽ More

    Submitted 31 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

  6. arXiv:2303.04910  [pdf, other

    cs.LG cs.LO cs.SE

    Baldur: Whole-Proof Generation and Repair with Large Language Models

    Authors: Emily First, Markus N. Rabe, Talia Ringer, Yuriy Brun

    Abstract: Formally verifying software properties is a highly desirable but labor-intensive task. Recent work has developed methods to automate formal verification using proof assistants, such as Coq and Isabelle/HOL, e.g., by training a model to predict one proof step at a time, and using that model to search through the space of possible proofs. This paper introduces a new method to automate formal verific… ▽ More

    Submitted 15 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

  7. arXiv:2208.04738  [pdf, other

    cs.CY cs.GL cs.PL

    Long-Term Mentoring for Computer Science Researchers

    Authors: Emily Ruppel, Sihang Liu, Elba Garza, Sukyoung Ryu, Alexandra Silva, Talia Ringer

    Abstract: Early in the pandemic, we -- leaders in the research areas of programming languages (PL) and computer architecture (CA) -- realized that we had a problem: the only way to form new lasting connections in the community was to already have lasting connections in the community. Both of our academic communities had wonderful short-term mentoring programs to address this problem, but it was clear that w… ▽ More

    Submitted 17 September, 2022; v1 submitted 6 August, 2022; originally announced August 2022.

  8. Passport: Improving Automated Formal Verification Using Identifiers

    Authors: Alex Sanchez-Stern, Emily First, Timothy Zhou, Zhanna Kaufman, Yuriy Brun, Talia Ringer

    Abstract: Formally verifying system properties is one of the most effective ways of improving system quality, but its high manual effort requirements often render it prohibitively expensive. Tools that automate formal verification, by learning from proof corpora to suggest proofs, have just begun to show their promise. These tools are effective because of the richness of the data the proof corpora contain.… ▽ More

    Submitted 2 August, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Journal ref: ACM Transactions on Programming Languages and Systems (TOPLAS), 45(2):12:1-12:30, June 2023

  9. Proof Repair across Type Equivalences

    Authors: Talia Ringer, RanDair Porter, Nathaniel Yazdani, John Leo, Dan Grossman

    Abstract: We describe a new approach to automatically repairing broken proofs in the Coq proof assistant in response to changes in types. Our approach combines a configurable proof term transformation with a decompiler from proof terms to tactic scripts. The proof term transformation implements transport across equivalences in a way that removes references to the old version of the changed type and does not… ▽ More

    Submitted 11 May, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Tool repository with code guide: https://github.com/uwplse/pumpkin-pi/blob/v2.0.0/GUIDE.md

  10. QED at Large: A Survey of Engineering of Formally Verified Software

    Authors: Talia Ringer, Karl Palmskog, Ilya Sergey, Milos Gligoric, Zachary Tatlock

    Abstract: Development of formal proofs of correctness of programs can increase actual and perceived reliability and facilitate better understanding of program specifications and their underlying assumptions. Tools supporting such development have been available for over 40 years, but have only recently seen wide practical use. Projects based on construction of machine-checked formal proofs are now reaching… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: 183 pages, for errata see https://proofengineering.org/qed_errata.html

    ACM Class: F.3.1; D.2.4; I.2.3

    Journal ref: Foundations and Trends in Programming Languages, Vol. 5, No. 2-3 (Sept. 2019), pp. 102-281