Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Soares, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13009  [pdf, other

    cs.CL cs.AI

    METAREFLECTION: Learning Instructions for Language Agents using Past Reflections

    Authors: Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Sherry Shi, Gustavo Soares

    Abstract: Despite the popularity of Large Language Models (LLMs), crafting specific prompts for LLMs to perform particular tasks remains challenging. Users often engage in multiple conversational turns with an LLM-based agent to accomplish their intended task. Recent studies have demonstrated that linguistic feedback, in the form of self-reflections generated by the model, can work as reinforcement during t… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2402.06229  [pdf, other

    cs.HC cs.AI cs.SE

    Exploring Interaction Patterns for Debugging: Enhancing Conversational Capabilities of AI-assistants

    Authors: Bhavya Chopra, Yasharth Bajpai, Param Biyani, Gustavo Soares, Arjun Radhakrishna, Chris Parnin, Sumit Gulwani

    Abstract: The widespread availability of Large Language Models (LLMs) within Integrated Development Environments (IDEs) has led to their speedy adoption. Conversational interactions with LLMs enable programmers to obtain natural language explanations for various software development tasks. However, LLMs often leap to action without sufficient context, giving rise to implicit assumptions and inaccurate respo… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 7 pages, 4 figures, 2 tables

  3. arXiv:2312.14231  [pdf, other

    cs.SE

    Building Your Own Product Copilot: Challenges, Opportunities, and Needs

    Authors: Chris Parnin, Gustavo Soares, Rahul Pandita, Sumit Gulwani, Jessica Rich, Austin Z. Henley

    Abstract: A race is underway to embed advanced AI capabilities into products. These product copilots enable users to ask questions in natural language and receive relevant responses that are specific to the user's context. In fact, virtually every large technology company is looking to add these capabilities to their software products. However, for most software engineers, this is often their first encounte… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 11 pages

  4. arXiv:2310.03780  [pdf, other

    cs.AI

    Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation

    Authors: Tung Phung, Victor-Alexandru Pădurean, Anjali Singh, Christopher Brooks, José Cambronero, Sumit Gulwani, Adish Singla, Gustavo Soares

    Abstract: Generative AI and large language models hold great promise in enhancing programming education by automatically generating individualized feedback for students. We investigate the role of generative AI models in providing human tutor-style programming hints to help students resolve errors in their buggy programs. Recent works have benchmarked state-of-the-art models for various feedback generation… ▽ More

    Submitted 6 August, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Published in Learning Analytics and Knowledge Conference (LAK) 2024

  5. arXiv:2306.17156  [pdf, other

    cs.CY cs.AI cs.CL

    Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors

    Authors: Tung Phung, Victor-Alexandru Pădurean, José Cambronero, Sumit Gulwani, Tobias Kohn, Rupak Majumdar, Adish Singla, Gustavo Soares

    Abstract: Generative AI and large language models hold great promise in enhancing computing education by powering next-generation educational technologies for introductory programming. Recent works have studied these models for different scenarios relevant to programming education; however, these works are limited for several reasons, as they typically consider already outdated models or only specific scena… ▽ More

    Submitted 31 July, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: This article is a full version of the poster (extended abstract) from ICER'23

  6. arXiv:2305.14129  [pdf, other

    cs.SE cs.LG

    GrACE: Generation using Associated Code Edits

    Authors: Priyanshu Gupta, Avishree Khare, Yasharth Bajpai, Saikat Chakraborty, Sumit Gulwani, Aditya Kanade, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

    Abstract: Developers expend a significant amount of time in editing code for a variety of reasons such as bug fixing or adding new features. Designing effective methods to predict code edits has been an active yet challenging area of research due to the diversity of code edits and the difficulty of capturing the developer intent. In this work, we address these challenges by endowing pre-trained large langua… ▽ More

    Submitted 20 September, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  7. arXiv:2302.04662  [pdf, other

    cs.PL cs.AI cs.CL

    Generating High-Precision Feedback for Programming Syntax Errors using Large Language Models

    Authors: Tung Phung, José Cambronero, Sumit Gulwani, Tobias Kohn, Rupak Majumdar, Adish Singla, Gustavo Soares

    Abstract: Large language models (LLMs), such as Codex, hold great promise in enhancing programming education by automatically generating feedback for students. We investigate using LLMs to generate feedback for fixing syntax errors in Python programs, a key scenario in introductory programming. More concretely, given a student's buggy program, our goal is to generate feedback comprising a fixed program alon… ▽ More

    Submitted 28 April, 2023; v1 submitted 24 January, 2023; originally announced February 2023.

    Comments: Published in International Conference on Educational Data Mining (EDM) 2023

  8. arXiv:2210.03216  [pdf, other

    cs.DM

    Beyond the shortest path: the path length index as a distribution

    Authors: Leonardo B. L. Santos, Luiz Max Carvalho, Giovanni G. Soares, Leonardo N. Ferreira, Igor M. Sokolov

    Abstract: The traditional complex network approach considers only the shortest paths from one node to another, not taking into account several other possible paths. This limitation is significant, for example, in urban mobility studies. In this short report, as the first steps, we present an exhaustive approach to address that problem and show we can go beyond the shortest path, but we do not need to go so… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  9. arXiv:2209.14876  [pdf, other

    cs.SE cs.AI

    Repairing Bugs in Python Assignments Using Large Language Models

    Authors: Jialu Zhang, José Cambronero, Sumit Gulwani, Vu Le, Ruzica Piskac, Gustavo Soares, Gust Verbruggen

    Abstract: Students often make mistakes on their introductory programming assignments as part of their learning process. Unfortunately, providing custom repairs for these mistakes can require a substantial amount of time and effort from class instructors. Automated program repair (APR) techniques can be used to synthesize such fixes. Prior work has explored the use of symbolic and neural techniques for APR i… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  10. arXiv:2207.12456  [pdf, other

    cs.PL cs.AI cs.SE

    Overwatch: Learning Patterns in Code Edit Sequences

    Authors: Yuhao Zhang, Yasharth Bajpai, Priyanshu Gupta, Ameya Ketkar, Miltiadis Allamanis, Titus Barik, Sumit Gulwani, Arjun Radhakrishna, Mohammad Raza, Gustavo Soares, Ashish Tiwari

    Abstract: Integrated Development Environments (IDEs) provide tool support to automate many source code editing tasks. Traditionally, IDEs use only the spatial context, i.e., the location where the developer is editing, to generate candidate edit recommendations. However, spatial context alone is often not sufficient to confidently predict the developer's next edit, and thus IDEs generate many suggestions at… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: 25 pages, 7 Figures, 4 Algorithms, 3 Tables

  11. arXiv:2201.11227  [pdf, other

    cs.LG cs.PL

    Synchromesh: Reliable code generation from pre-trained language models

    Authors: Gabriel Poesia, Oleksandr Polozov, Vu Le, Ashish Tiwari, Gustavo Soares, Christopher Meek, Sumit Gulwani

    Abstract: Large pre-trained language models have been used to generate code,providing a flexible interface for synthesizing programs from natural language specifications. However, they often violate syntactic and semantic rules of their output language, limiting their practical usability. In this paper, we propose Synchromesh: a framework for substantially improving the reliability of pre-trained models for… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    Comments: 10 pages, 9 additional pages of Appendix

  12. arXiv:2109.02445  [pdf, other

    cs.AI cs.PL

    Multi-modal Program Inference: a Marriage of Pre-trainedLanguage Models and Component-based Synthesis

    Authors: Kia Rahmani, Mohammad Raza, Sumit Gulwani, Vu Le, Daniel Morris, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

    Abstract: Multi-modal program synthesis refers to the task of synthesizing programs (code) from their specification given in different forms, such as a combination of natural language and examples. Examples provide a precise but incomplete specification, and natural language provides an ambiguous but more "complete" task description. Machine-learned pre-trained models (PTMs) are adept at handling ambiguous… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: To be published in OOPSLA 2021

  13. arXiv:1803.03806  [pdf, other

    cs.SE

    Learning Quick Fixes from Code Repositories

    Authors: Reudismam Rolim, Gustavo Soares, Rohit Gheyi, Titus Barik, Loris D'Antoni

    Abstract: Code analyzers such as Error Prone and FindBugs detect code patterns symptomatic of bugs, performance issues, or bad style. These tools express patterns as quick fixes that detect and rewrite unwanted code. However, it is difficult to come up with new quick fixes and decide which ones are useful and frequently appear in real code. We propose to rely on the collective wisdom of programmers and lear… ▽ More

    Submitted 7 September, 2018; v1 submitted 10 March, 2018; originally announced March 2018.

    Comments: 12 pages

  14. arXiv:1708.03786  [pdf, other

    cs.HC cs.CY cs.PL cs.SE

    TraceDiff: Debugging Unexpected Code Behavior Using Trace Divergences

    Authors: Ryo Suzuki, Gustavo Soares, Andrew Head, Elena Glassman, Ruan Reis, Melina Mongiovi, Loris D'Antoni, Bjoern Hartmann

    Abstract: Recent advances in program synthesis offer means to automatically debug student submissions and generate personalized feedback in massive programming classrooms. When automatically generating feedback for programming assignments, a key challenge is designing pedagogically useful hints that are as effective as the manual feedback given by teachers. Through an analysis of teachers' hint-giving pract… ▽ More

    Submitted 12 August, 2017; originally announced August 2017.

    Comments: VL/HCC 2017

    ACM Class: H.5.2

  15. arXiv:1608.09000  [pdf, other

    cs.SE cs.LG cs.PL

    Learning Syntactic Program Transformations from Examples

    Authors: Reudismam Rolim, Gustavo Soares, Loris D'Antoni, Oleksandr Polozov, Sumit Gulwani, Rohit Gheyi, Ryo Suzuki, Bjoern Hartmann

    Abstract: IDEs, such as Visual Studio, automate common transformations, such as Rename and Extract Method refactorings. However, extending these catalogs of transformations is complex and time-consuming. A similar phenomenon appears in intelligent tutoring systems where instructors have to write cumbersome code transformations that describe "common faults" to fix similar student submissions to programming a… ▽ More

    Submitted 31 August, 2016; originally announced August 2016.