Skip to main content

Showing 1–7 of 7 results for author: Amayuelas, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06426  [pdf, other

    cs.CL cs.AI cs.MA

    DebUnc: Mitigating Hallucinations in Large Language Model Agent Communication with Uncertainty Estimations

    Authors: Luke Yoffe, Alfonso Amayuelas, William Yang Wang

    Abstract: To enhance Large Language Model (LLM) capabilities, multi-agent debates have been introduced, where multiple LLMs discuss solutions to a problem over several rounds of debate. However, LLMs often produce incorrect responses that appear deceptively confident, which can mislead other agents. This is partly because agents do not express their confidence levels during standard debates. To address this… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2406.14867  [pdf, other

    cs.LG cs.AI cs.CL

    DistiLRR: Transferring Code Repair for Low-Resource Programming Languages

    Authors: Kyle Wong, Alfonso Amayuelas, Liangming Pan, William Yang Wang

    Abstract: Large language models (LLMs) have shown remarkable performance on code generation tasks. A recent application of LLMs for code generation is iterative code repair, where a model fixes an incorrect program by rationalizing about errors and generating a new program. However, code repair is primarily studied on high-resource languages like Python, and the framework's efficacy is under-explored on low… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.14711  [pdf, other

    cs.CL cs.AI cs.MA

    MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate

    Authors: Alfonso Amayuelas, Xianjun Yang, Antonis Antoniades, Wenyue Hua, Liangming Pan, William Wang

    Abstract: Large Language Models (LLMs) have shown exceptional results on current benchmarks when working individually. The advancement in their capabilities, along with a reduction in parameter size and inference times, has facilitated the use of these models as agents, enabling interactions among multiple models to execute complex tasks. Such collaborations offer several advantages, including the use of sp… ▽ More

    Submitted 26 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2402.03268  [pdf, other

    cs.LG cs.AI cs.CL

    Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation

    Authors: Xinyi Wang, Alfonso Amayuelas, Kexun Zhang, Liangming Pan, Wenhu Chen, William Yang Wang

    Abstract: Pre-trained language models (LMs) are able to perform complex reasoning without explicit fine-tuning. To understand how pre-training with a next-token prediction objective contributes to the emergence of such reasoning capability, we propose that we can view an LM as deriving new conclusions by aggregating indirect reasoning paths seen at pre-training time. We found this perspective effective in t… ▽ More

    Submitted 20 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024

  5. arXiv:2305.13712  [pdf, other

    cs.CL cs.AI

    Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models

    Authors: Alfonso Amayuelas, Kyle Wong, Liangming Pan, Wenhu Chen, William Wang

    Abstract: This paper investigates the capabilities of Large Language Models (LLMs) in the context of understanding their knowledge and uncertainty over questions. Specifically, we focus on addressing known-unknown questions, characterized by high uncertainty due to the absence of definitive answers. To facilitate our study, we collect a new dataset with Known-Unknown Questions (KUQ) and establish a categori… ▽ More

    Submitted 1 July, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  6. arXiv:2209.14464  [pdf, other

    cs.AI cs.LG

    Neural Methods for Logical Reasoning Over Knowledge Graphs

    Authors: Alfonso Amayuelas, Shuai Zhang, Susie Xi Rao, Ce Zhang

    Abstract: Reasoning is a fundamental problem for computers and deeply studied in Artificial Intelligence. In this paper, we specifically focus on answering multi-hop logical queries on Knowledge Graphs (KGs). This is a complicated task because, in real-world scenarios, the graphs tend to be large and incomplete. Most previous works have been unable to create models that accept full First-Order Logical (FOL)… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 14 pages, 5 figures, 11 tables

    Journal ref: International Conference on Learning Representations, 2022

  7. arXiv:2012.11448  [pdf, other

    cs.LG cs.AI

    The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

    Authors: Naman Goel, Alfonso Amayuelas, Amit Deshpande, Amit Sharma

    Abstract: Training datasets for machine learning often have some form of missingness. For example, to learn a model for deciding whom to give a loan, the available training data includes individuals who were given a loan in the past, but not those who were not. This missingness, if ignored, nullifies any fairness guarantee of the training procedure when the model is deployed. Using causal graphs, we charact… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: To appear in the Proceedings of AAAI 2021