Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Grunde-McLaughlin, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.11681  [pdf, other

    cs.HC cs.AI cs.CL

    Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows

    Authors: Madeleine Grunde-McLaughlin, Michelle S. Lam, Ranjay Krishna, Daniel S. Weld, Jeffrey Heer

    Abstract: LLM chains enable complex tasks by decomposing work into a sequence of subtasks. Similarly, the more established techniques of crowdsourcing workflows decompose complex tasks into smaller tasks for human crowdworkers. Chains address LLM errors analogously to the way crowdsourcing workflows address human error. To characterize opportunities for LLM chaining, we survey 107 papers across the crowdsou… ▽ More

    Submitted 6 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  2. arXiv:2309.10108  [pdf, other

    cs.HC

    How Do Data Analysts Respond to AI Assistance? A Wizard-of-Oz Study

    Authors: Ken Gu, Madeleine Grunde-McLaughlin, Andrew M. McNutt, Jeffrey Heer, Tim Althoff

    Abstract: Data analysis is challenging as analysts must navigate nuanced decisions that may yield divergent conclusions. AI assistants have the potential to support analysts in planning their analyses, enabling more robust decision making. Though AI-based assistants that target code execution (e.g., Github Copilot) have received significant attention, limited research addresses assistance for both analysis… ▽ More

    Submitted 4 March, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted to CHI 2024

  3. arXiv:2212.06823  [pdf, other

    cs.HC cs.AI

    Explanations Can Reduce Overreliance on AI Systems During Decision-Making

    Authors: Helena Vasconcelos, Matthew Jörke, Madeleine Grunde-McLaughlin, Tobias Gerstenberg, Michael Bernstein, Ranjay Krishna

    Abstract: Prior work has identified a resilient phenomenon that threatens the performance of human-AI decision-making teams: overreliance, when people agree with an AI, even when it is incorrect. Surprisingly, overreliance does not reduce when the AI produces explanations for its predictions, compared to only providing predictions. Some have argued that overreliance results from cognitive biases or uncalibr… ▽ More

    Submitted 26 January, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: CSCW 2023

  4. arXiv:2204.07190  [pdf, other

    cs.CV

    Measuring Compositional Consistency for Video Question Answering

    Authors: Mona Gandhi, Mustafa Omer Gul, Eva Prakash, Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala

    Abstract: Recent video question answering benchmarks indicate that state-of-the-art models struggle to answer compositional questions. However, it remains unclear which types of compositional reasoning cause models to mispredict. Furthermore, it is difficult to discern whether models arrive at answers using compositional reasoning or by leveraging data biases. In this paper, we develop a question decomposit… ▽ More

    Submitted 24 May, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: To appear in CVPR 2022. 23 pages, 12 figures and 12 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

  5. arXiv:2204.06105  [pdf, other

    cs.CV

    AGQA 2.0: An Updated Benchmark for Compositional Spatio-Temporal Reasoning

    Authors: Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala

    Abstract: Prior benchmarks have analyzed models' answers to questions about videos in order to measure visual compositional reasoning. Action Genome Question Answering (AGQA) is one such benchmark. AGQA provides a training/test split with balanced answer distributions to reduce the effect of linguistic biases. However, some biases remain in several AGQA categories. We introduce AGQA 2.0, a version of this b… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 7 pages, 2 figures, 7 tables, update to AGQA arXiv:2103.16002

  6. arXiv:2103.16002  [pdf, other

    cs.CV cs.CL

    AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning

    Authors: Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala

    Abstract: Visual events are a composition of temporal actions involving actors spatially interacting with objects. When developing computer vision models that can reason about compositional spatio-temporal events, we need benchmarks that can analyze progress and uncover shortcomings. Existing video question answering benchmarks are useful, but they often conflate multiple sources of error into one accuracy… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: 8 pages, 15 pages supplementary, 12 figures. To be published in CVPR 2021

  7. arXiv:2008.00142  [pdf, other

    cs.HC stat.ME

    Bayesian-Assisted Inference from Visualized Data

    Authors: Yea-Seul Kim, Paula Kayongo, Madeleine Grunde-McLaughlin, Jessica Hullman

    Abstract: A Bayesian view of data interpretation suggests that a visualization user should update their existing beliefs about a parameter's value in accordance with the amount of information about the parameter value captured by the new observations. Extending recent work applying Bayesian models to understand and evaluate belief updating from visualizations, we show how the predictions of Bayesian inferen… ▽ More

    Submitted 8 August, 2020; v1 submitted 31 July, 2020; originally announced August 2020.