Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Bonadiman, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05766  [pdf, other

    cs.CL

    FLAP: Flow-Adhering Planning with Constrained Decoding in LLMs

    Authors: Shamik Roy, Sailik Sengupta, Daniele Bonadiman, Saab Mansour, Arshit Gupta

    Abstract: Planning is a crucial task for agents in task oriented dialogs (TODs). Human agents typically resolve user issues by following predefined workflows, decomposing workflow steps into actionable items, and performing actions by executing APIs in order; all of which require reasoning and planning. With the recent advances in LLMs, there have been increasing attempts to use them for task planning and A… ▽ More

    Submitted 4 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: NAACL 2024 (Camera Ready) (fixed minor typos)

  2. arXiv:2403.02567  [pdf, other

    cs.CL cs.AI

    Eliciting Better Multilingual Structured Reasoning from LLMs through Code

    Authors: Bryan Li, Tamer Alkhouli, Daniele Bonadiman, Nikolaos Pappas, Saab Mansour

    Abstract: The development of large language models (LLM) has shown progress on reasoning, though studies have largely considered either English or simple reasoning tasks. To address this, we introduce a multilingual structured reasoning and explanation dataset, termed xSTREET, that covers four tasks across six languages. xSTREET exposes a gap in base LLM performance between English and non-English reasoning… ▽ More

    Submitted 12 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  3. arXiv:2402.06147  [pdf, other

    cs.AI cs.CL

    DeAL: Decoding-time Alignment for Large Language Models

    Authors: James Y. Huang, Sailik Sengupta, Daniele Bonadiman, Yi-an Lai, Arshit Gupta, Nikolaos Pappas, Saab Mansour, Katrin Kirchhoff, Dan Roth

    Abstract: Large Language Models (LLMs) are nowadays expected to generate content aligned with human preferences. Current work focuses on alignment at model training time, through techniques such as Reinforcement Learning with Human Feedback (RLHF). However, it is unclear if such methods are an effective choice to teach alignment objectives to the model. First, the inability to incorporate multiple, custom r… ▽ More

    Submitted 20 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: The appendix contains data that is offensive / disturbing in nature

  4. arXiv:2212.08120  [pdf, other

    cs.CL cs.AI

    Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue Systems

    Authors: Denis Emelin, Daniele Bonadiman, Sawsan Alqahtani, Yi Zhang, Saab Mansour

    Abstract: Pre-trained language models (PLM) have advanced the state-of-the-art across NLP applications, but lack domain-specific knowledge that does not naturally occur in pre-training data. Previous studies augmented PLMs with symbolic knowledge for different downstream NLP tasks. However, knowledge bases (KBs) utilized in these studies are usually large-scale and static, in contrast to small, domain-speci… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: Published at EMNLP 2022 (main conference)

  5. arXiv:2212.08099  [pdf, other

    cs.CL cs.AI

    DFEE: Interactive DataFlow Execution and Evaluation Kit

    Authors: Han He, Song Feng, Daniele Bonadiman, Yi Zhang, Saab Mansour

    Abstract: DataFlow has been emerging as a new paradigm for building task-oriented chatbots due to its expressive semantic representations of the dialogue tasks. Despite the availability of a large dataset SMCalFlow and a simplified syntax, the development and evaluation of DataFlow-based chatbots remain challenging due to the system complexity and the lack of downstream toolchains. In this demonstration, we… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: Accepted to AAAI-23: the Thirty-Seventh AAAI Conference on Artificial Intelligence

  6. arXiv:2003.02349  [pdf, ps, other

    cs.CL

    A Study on Efficiency, Accuracy and Document Structure for Answer Sentence Selection

    Authors: Daniele Bonadiman, Alessandro Moschitti

    Abstract: An essential task of most Question Answering (QA) systems is to re-rank the set of answer candidates, i.e., Answer Sentence Selection (A2S). These candidates are typically sentences either extracted from one or more documents preserving their natural order or retrieved by a search engine. Most state-of-the-art approaches to the task use huge neural models, such as BERT, or complex attentive archit… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

  7. arXiv:1905.12786  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Large Scale Question Paraphrase Retrieval with Smoothed Deep Metric Learning

    Authors: Daniele Bonadiman, Anjishnu Kumar, Arpit Mittal

    Abstract: The goal of a Question Paraphrase Retrieval (QPR) system is to retrieve equivalent questions that result in the same answer as the original question. Such a system can be used to understand and answer rare and noisy reformulations of common questions by mapping them to a set of canonical forms. This has large-scale applications for community Question Answering (cQA) and open-domain spoken language… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  8. arXiv:1806.08009  [pdf, other

    cs.CL

    Injecting Relational Structural Representation in Neural Networks for Question Similarity

    Authors: Antonio Uva, Daniele Bonadiman, Alessandro Moschitti

    Abstract: Effectively using full syntactic parsing information in Neural Networks (NNs) to solve relational tasks, e.g., question similarity, is still an open problem. In this paper, we propose to inject structural representations in NNs by (i) learning an SVM model using Tree Kernels (TKs) on relatively few pairs of questions (few thousands) as gold standard (GS) training data is typically scarce, (ii) pre… ▽ More

    Submitted 20 June, 2018; originally announced June 2018.

    Comments: ACL2018

  9. arXiv:1702.03706  [pdf, other

    cs.CL

    Multitask Learning with Deep Neural Networks for Community Question Answering

    Authors: Daniele Bonadiman, Antonio Uva, Alessandro Moschitti

    Abstract: In this paper, we developed a deep neural network (DNN) that learns to solve simultaneously the three tasks of the cQA challenge proposed by the SemEval-2016 Task 3, i.e., question-comment similarity, question-question similarity and new question-comment similarity. The latter is the main task, which can exploit the previous two for achieving better results. Our DNN is trained jointly on all the t… ▽ More

    Submitted 13 February, 2017; originally announced February 2017.