Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Henley, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14589  [pdf, other

    cs.HC

    Wandercode: An Interaction Design for Code Recommenders to Reduce Information Overload, Ease Exploration, and Save Screen Space

    Authors: Austin Z. Henley, David Shepherd, Scott D. Fleming

    Abstract: In this paper, we present Wandercode, a novel interaction design for recommender systems that recommend code locations to aid programmers in software development tasks. In particular, our design aims to improve upon prior designs by reducing information overload, by better supporting the exploration of recommendations, and by making more efficient use of screen space. During our design process, we… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  2. Improving Steering and Verification in AI-Assisted Data Analysis with Interactive Task Decomposition

    Authors: Majeed Kazemitabaar, Jack Williams, Ian Drosos, Tovi Grossman, Austin Henley, Carina Negreanu, Advait Sarkar

    Abstract: LLM-powered tools like ChatGPT Data Analysis, have the potential to help users tackle the challenging task of data analysis programming, which requires expertise in data processing, programming, and statistics. However, our formative study (n=15) uncovered serious challenges in verifying AI-generated results and steering the AI (i.e., guiding the AI system to produce the desired output). We develo… ▽ More

    Submitted 1 August, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: Published at UIST 2024; 19 pages, 9 figures, and 2 tables

    Journal ref: Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology (UIST 2024)

  3. arXiv:2405.01556  [pdf, other

    cs.SE cs.AI cs.CL

    Semantically Aligned Question and Code Generation for Automated Insight Generation

    Authors: Ananya Singha, Bhavya Chopra, Anirudh Khatry, Sumit Gulwani, Austin Z. Henley, Vu Le, Chris Parnin, Mukul Singh, Gust Verbruggen

    Abstract: Automated insight generation is a common tactic for helping knowledge workers, such as data scientists, to quickly understand the potential value of new and unfamiliar data. Unfortunately, automated insights produced by large-language models can generate code that does not correctly correspond (or align) to the insight. In this paper, we leverage the semantic knowledge of large language models to… ▽ More

    Submitted 21 March, 2024; originally announced May 2024.

  4. arXiv:2403.07762  [pdf, other

    cs.HC

    Supporting Annotators with Affordances for Efficiently Labeling Conversational Data

    Authors: Austin Z. Henley, David Piorkowski

    Abstract: Without well-labeled ground truth data, machine learning-based systems would not be as ubiquitous as they are today, but these systems rely on substantial amounts of correctly labeled data. Unfortunately, crowdsourced labeling is time consuming and expensive. To address the concerns of effort and tedium, we designed CAL, a novel interface to aid in data labeling. We made several key design decisio… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  5. CodeAid: Evaluating a Classroom Deployment of an LLM-based Programming Assistant that Balances Student and Educator Needs

    Authors: Majeed Kazemitabaar, Runlong Ye, Xiaoning Wang, Austin Z. Henley, Paul Denny, Michelle Craig, Tovi Grossman

    Abstract: Timely, personalized feedback is essential for students learning programming. LLM-powered tools like ChatGPT offer instant support, but reveal direct answers with code, which may hinder deep conceptual engagement. We developed CodeAid, an LLM-powered programming assistant delivering helpful, technically correct responses, without revealing code solutions. CodeAid answers conceptual questions, gene… ▽ More

    Submitted 25 February, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: CHI 2024 Paper - The paper includes 17 pages, 8 figures, 2 tables, along with a 2-page appendix

  6. arXiv:2312.14231  [pdf, other

    cs.SE

    Building Your Own Product Copilot: Challenges, Opportunities, and Needs

    Authors: Chris Parnin, Gustavo Soares, Rahul Pandita, Sumit Gulwani, Jessica Rich, Austin Z. Henley

    Abstract: A race is underway to embed advanced AI capabilities into products. These product copilots enable users to ask questions in natural language and receive relevant responses that are specific to the user's context. In fact, virtually every large technology company is looking to add these capabilities to their software products. However, for most software engineers, this is often their first encounte… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 11 pages

  7. arXiv:2310.16164  [pdf, other

    cs.HC

    Conversational Challenges in AI-Powered Data Science: Obstacles, Needs, and Design Opportunities

    Authors: Bhavya Chopra, Ananya Singha, Anna Fariha, Sumit Gulwani, Chris Parnin, Ashish Tiwari, Austin Z. Henley

    Abstract: Large Language Models (LLMs) are being increasingly employed in data science for tasks like data preprocessing and analytics. However, data scientists encounter substantial obstacles when conversing with LLM-powered chatbots and acting on their suggestions and answers. We conducted a mixed-methods study, including contextual observations, semi-structured interviews (n=14), and a survey (n=114), to… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 24 pages, 8 figures

  8. arXiv:2309.14049  [pdf, other

    cs.HC

    How Novices Use LLM-Based Code Generators to Solve CS1 Coding Tasks in a Self-Paced Learning Environment

    Authors: Majeed Kazemitabaar, Xinying Hou, Austin Henley, Barbara J. Ericson, David Weintrop, Tovi Grossman

    Abstract: As Large Language Models (LLMs) gain in popularity, it is important to understand how novice programmers use them. We present a thematic analysis of 33 learners, aged 10-17, independently learning Python through 45 code-authoring tasks using Codex, an LLM-based code generator. We explore several questions related to how learners used these code generators and provide an analysis of the properties… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 12 pages, Peer-Reviewed, Accepted for publication in the proceedings of the 2023 ACM Koli Calling International Conference on Computing Education Research

  9. arXiv:2210.05506  [pdf, other

    cs.SE cs.AI cs.HC cs.LG

    Follow-up Attention: An Empirical Study of Developer and Neural Model Code Exploration

    Authors: Matteo Paltenghi, Rahul Pandita, Austin Z. Henley, Albert Ziegler

    Abstract: Recent neural models of code, such as OpenAI Codex and AlphaCode, have demonstrated remarkable proficiency at code generation due to the underlying attention mechanism. However, it often remains unclear how the models actually process code, and to what extent their reasoning and the way their attention mechanism scans the code matches the patterns of developers. A poor understanding of the model r… ▽ More

    Submitted 29 August, 2024; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Published at IEEE Transactions on Software Engineering

  10. arXiv:2206.06260  [pdf, other

    cs.SE

    OpenCBS: An Open-Source COBOL Defects Benchmark Suite

    Authors: Dylan Lee, Austin Henley, Bill Hinshaw, Rahul Pandita

    Abstract: As the current COBOL workforce retires, entry-level developers are left to keep complex legacy systems maintained and operational. This creates a massive gap in knowledge and ability as companies are having their veteran developers replaced with a new, inexperienced workforce. Additionally, the lack of COBOL and mainframe technology in the current academic curriculum further increases the learning… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  11. arXiv:2204.08108  [pdf, other

    cs.SE

    How are Software Repositories Mined? A Systematic Literature Review of Workflows, Methodologies, Reproducibility, and Tools

    Authors: Adam Tutko, Austin Z. Henley, Audris Mockus

    Abstract: With the advent of open source software, a veritable treasure trove of previously proprietary software development data was made available. This opened the field of empirical software engineering research to anyone in academia. Data that is mined from software projects, however, requires extensive processing and needs to be handled with utmost care to ensure valid conclusions. Since the software d… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

    Comments: 11 Pages

    MSC Class: 68N99

  12. arXiv:2102.06098  [pdf, other

    cs.SE

    An Inquisitive Code Editor for Addressing Novice Programmers' Misconceptions of Program Behavior

    Authors: Austin Z. Henley, Julian Ball, Benjamin Klein, Aiden Rutter, Dylan Lee

    Abstract: Novice programmers face numerous barriers while attempting to learn how to code that may deter them from pursuing a computer science degree or career in software development. In this work, we propose a tool concept to address the particularly challenging barrier of novice programmers holding misconceptions about how their code behaves. Specifically, the concept involves an inquisitive code editor… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: Accepted to ICSE-JSEET'21

  13. arXiv:2011.06244  [pdf, other

    cs.SE

    A Fine-grained Data Set and Analysis of Tangling in Bug Fixing Commits

    Authors: Steffen Herbold, Alexander Trautsch, Benjamin Ledel, Alireza Aghamohammadi, Taher Ahmed Ghaleb, Kuljit Kaur Chahal, Tim Bossenmaier, Bhaveet Nagaria, Philip Makedonski, Matin Nili Ahmadabadi, Kristof Szabados, Helge Spieker, Matej Madeja, Nathaniel Hoy, Valentina Lenarduzzi, Shangwen Wang, Gema Rodríguez-Pérez, Ricardo Colomo-Palacios, Roberto Verdecchia, Paramvir Singh, Yihao Qin, Debasish Chakroborti, Willard Davis, Vijay Walunj, Hongjun Wu , et al. (23 additional authors not shown)

    Abstract: Context: Tangled commits are changes to software that address multiple concerns at once. For researchers interested in bugs, tangled commits mean that they actually study not only bugs, but also other concerns irrelevant for the study of bugs. Objective: We want to improve our understanding of the prevalence of tangling and the types of changes that are tangled within bug fixing commits. Metho… ▽ More

    Submitted 13 October, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: Status: Accepted at Empirical Software Engineering

  14. arXiv:2008.03439  [pdf, ps, other

    cs.SE

    More Effective Software Repository Mining

    Authors: Adam Tutko, Austin Henley, Audris Mockus

    Abstract: Background: Data mining and analyzing of public Git software repositories is a growing research field. The tools used for studies that investigate a single project or a group of projects have been refined, but it is not clear whether the results obtained on such ``convenience samples'' generalize. Aims: This paper aims to elucidate the difficulties faced by researchers who would like to ascertain… ▽ More

    Submitted 16 August, 2020; v1 submitted 8 August, 2020; originally announced August 2020.

    Comments: 5 pages, 3 figures, Submitted to ESEM2020 Emerging Results track