Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Pacheco, M L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09030  [pdf, other

    cs.CL cs.HC

    Studying the Effects of Collaboration in Interactive Theme Discovery Systems

    Authors: Alvin Po-Chun Chen, Dananjay Srinivas, Alexandra Barry, Maksim Seniw, Maria Leonor Pacheco

    Abstract: NLP-assisted solutions have gained considerable traction to support qualitative data analysis. However, there does not exist a unified evaluation framework that can account for the many different settings in which qualitative researchers may employ them. In this paper, we take a first step in this direction by proposing an evaluation framework to study the way in which different tools may result i… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  2. arXiv:2402.14224  [pdf, other

    cs.CL

    Framing in the Presence of Supporting Data: A Case Study in U.S. Economic News

    Authors: Alexandria Leto, Elliot Pickens, Coen D. Needell, David Rothschild, Maria Leonor Pacheco

    Abstract: The mainstream media has much leeway in what it chooses to cover and how it covers it. These choices have real-world consequences on what people know and their subsequent behaviors. However, the lack of objective measures to evaluate editorial choices makes research in this area particularly difficult. In this paper, we argue that there are newsworthy topics where objective measures exist in the f… ▽ More

    Submitted 22 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: total pages: 19; main body pages: 8; total figures: 19

  3. arXiv:2311.11979  [pdf, other

    cs.SE cs.CL

    On the Potential and Limitations of Few-Shot In-Context Learning to Generate Metamorphic Specifications for Tax Preparation Software

    Authors: Dananjay Srinivas, Rohan Das, Saeid Tizpaz-Niari, Ashutosh Trivedi, Maria Leonor Pacheco

    Abstract: Due to the ever-increasing complexity of income tax laws in the United States, the number of US taxpayers filing their taxes using tax preparation software (henceforth, tax software) continues to increase. According to the U.S. Internal Revenue Service (IRS), in FY22, nearly 50% of taxpayers filed their individual income taxes using tax software. Given the legal consequences of incorrectly filing… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to the Proceedings of the Natural Legal Language Processing Workshop, EMNLP 2023

  4. arXiv:2305.05094  [pdf, other

    cs.CL cs.HC

    Interactive Concept Learning for Uncovering Latent Themes in Large Text Collections

    Authors: Maria Leonor Pacheco, Tunazzina Islam, Lyle Ungar, Ming Yin, Dan Goldwasser

    Abstract: Experts across diverse disciplines are often interested in making sense of large text collections. Traditionally, this challenge is approached either by noisy unsupervised techniques such as topic models, or by following a manual theme discovery process. In this paper, we expand the definition of a theme to account for more than just a word distribution, and include generalized concepts deemed rel… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL: ACL 2023

  5. Queer In AI: A Case Study in Community-Led Participatory AI

    Authors: Organizers Of QueerInAI, :, Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherland, Davide Locatelli, Eva Breznik, Filip Klubička, Hang Yuan, Hetvi J, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, A Pranav , et al. (26 additional authors not shown)

    Abstract: We present Queer in AI as a case study for community-led participatory design in AI. We examine how participatory design and intersectional tenets started and shaped this community's programs over the years. We discuss different challenges that emerged in the process, look at ways this organization has fallen short of operationalizing participatory and intersectional principles, and then assess th… ▽ More

    Submitted 8 June, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: To appear at FAccT 2023

    Journal ref: 2023 ACM Conference on Fairness, Accountability, and Transparency

  6. A Holistic Framework for Analyzing the COVID-19 Vaccine Debate

    Authors: Maria Leonor Pacheco, Tunazzina Islam, Monal Mahajan, Andrey Shor, Ming Yin, Lyle Ungar, Dan Goldwasser

    Abstract: The Covid-19 pandemic has led to infodemic of low quality information leading to poor health decisions. Combating the outcomes of this infodemic is not only a question of identifying false claims, but also reasoning about the decisions individuals make. In this work we propose a holistic analysis framework connecting stance and reason analysis, and fine-grained entity level moral sentiment analysi… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: Accepted to NAACL 2022

    Journal ref: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

  7. arXiv:2202.09470  [pdf, other

    cs.CR cs.CL cs.FL cs.LG

    Automated Attack Synthesis by Extracting Finite State Machines from Protocol Specification Documents

    Authors: Maria Leonor Pacheco, Max von Hippel, Ben Weintraub, Dan Goldwasser, Cristina Nita-Rotaru

    Abstract: Automated attack discovery techniques, such as attacker synthesis or model-based fuzzing, provide powerful ways to ensure network protocols operate correctly and securely. Such techniques, in general, require a formal representation of the protocol, often in the form of a finite state machine (FSM). Unfortunately, many protocols are only described in English prose, and implementing even a simple n… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Comments: To appear in IEEE Security and Privacy, 2022

  8. arXiv:2109.04535  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Identifying Morality Frames in Political Tweets using Relational Learning

    Authors: Shamik Roy, Maria Leonor Pacheco, Dan Goldwasser

    Abstract: Extracting moral sentiment from text is a vital component in understanding public opinion, social movements, and policy decisions. The Moral Foundation Theory identifies five moral foundations, each associated with a positive and negative polarity. However, moral sentiment is often motivated by its targets, which can correspond to individuals or collective entities. In this paper, we introduce mor… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021

  9. Modeling Human Mental States with an Entity-based Narrative Graph

    Authors: I-Ta Lee, Maria Leonor Pacheco, Dan Goldwasser

    Abstract: Understanding narrative text requires capturing characters' motivations, goals, and mental states. This paper proposes an Entity-based Narrative Graph (ENG) to model the internal-states of characters in a story. We explicitly model entities, their interactions and the context in which they appear, and learn rich representations for them. We experiment with different task-adaptive pre-training obje… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted at NAACL 2021

    Journal ref: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

  10. arXiv:2101.10435  [pdf, other

    cs.CL cs.AI cs.LG

    Randomized Deep Structured Prediction for Discourse-Level Processing

    Authors: Manuel Widmoser, Maria Leonor Pacheco, Jean Honorio, Dan Goldwasser

    Abstract: Expressive text encoders such as RNNs and Transformer Networks have been at the center of NLP models in recent work. Most of the effort has focused on sentence-level tasks, capturing the dependencies between words in a single sentence, or pairs of sentences. However, certain tasks, such as argumentation mining, require accounting for longer texts and complicated structural dependencies between the… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: Accepted to EACL 2021

    Journal ref: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

  11. Modeling Content and Context with Deep Relational Learning

    Authors: Maria Leonor Pacheco, Dan Goldwasser

    Abstract: Building models for realistic natural language tasks requires dealing with long texts and accounting for complicated structural dependencies. Neural-symbolic representations have emerged as a way to combine the reasoning capabilities of symbolic methods, with the expressiveness of neural networks. However, most of the existing frameworks for combining neural and symbolic representations have been… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: TACL pre-MIT Press version

    Journal ref: Transactions of the Association for Computational Linguistics, 2021

  12. Leveraging Textual Specifications for Grammar-based Fuzzing of Network Protocols

    Authors: Samuel Jero, Maria Leonor Pacheco, Dan Goldwasser, Cristina Nita-Rotaru

    Abstract: Grammar-based fuzzing is a technique used to find software vulnerabilities by injecting well-formed inputs generated following rules that encode application semantics. Most grammar-based fuzzers for network protocols rely on human experts to manually specify these rules. In this work we study automated learning of protocol rules from textual specifications (i.e. RFCs). We evaluate the automaticall… ▽ More

    Submitted 10 October, 2018; originally announced October 2018.

    Journal ref: The Thirty-First AAAI Conference on Innovative Applications of Artificial Intelligence, IAAI 2019