Zum Hauptinhalt springen

Showing 1–46 of 46 results for author: Galley, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.01761  [pdf, other

    cs.CL cs.AI cs.LG

    Rethinking Interpretability in the Era of Large Language Models

    Authors: Chandan Singh, Jeevana Priya Inala, Michel Galley, Rich Caruana, Jianfeng Gao

    Abstract: Interpretable machine learning has exploded as an area of interest over the last decade, sparked by the rise of increasingly large datasets and deep neural networks. Simultaneously, large language models (LLMs) have demonstrated remarkable capabilities across a wide array of tasks, offering a chance to rethink opportunities in interpretable machine learning. Notably, the capability to explain in n… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

    Comments: 7 pages

  2. arXiv:2310.13522  [pdf, other

    cs.CL

    Teaching Language Models to Self-Improve through Interactive Demonstrations

    Authors: Xiao Yu, Baolin Peng, Michel Galley, Jianfeng Gao, Zhou Yu

    Abstract: The self-improving ability of large language models (LLMs), enabled by prompting them to analyze and revise their own outputs, has garnered significant interest in recent research. However, this ability has been shown to be absent and difficult to learn for smaller models, thus widening the performance gap between state-of-the-art LLMs and more cost-effective and faster ones. To reduce this gap, w… ▽ More

    Submitted 31 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: NAACL 2024 main

  3. arXiv:2310.02255  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts

    Authors: Pan Lu, Hritik Bansal, Tony Xia, Jiacheng Liu, Chunyuan Li, Hannaneh Hajishirzi, Hao Cheng, Kai-Wei Chang, Michel Galley, Jianfeng Gao

    Abstract: Large Language Models (LLMs) and Large Multimodal Models (LMMs) exhibit impressive problem-solving skills in many tasks and domains, but their ability in mathematical reasoning in visual contexts has not been systematically studied. To bridge this gap, we present MathVista, a benchmark designed to combine challenges from diverse mathematical and visual tasks. It consists of 6,141 examples, derived… ▽ More

    Submitted 20 January, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: 116 pages, 120 figures. Accepted to ICLR 2024

  4. arXiv:2306.00024  [pdf, other

    cs.CL cs.LG

    Self-Verification Improves Few-Shot Clinical Information Extraction

    Authors: Zelalem Gero, Chandan Singh, Hao Cheng, Tristan Naumann, Michel Galley, Jianfeng Gao, Hoifung Poon

    Abstract: Extracting patient information from unstructured text is a critical task in health decision-support and clinical research. Large language models (LLMs) have shown the potential to accelerate clinical curation via few-shot in-context learning, in contrast to supervised learning which requires much more costly human annotations. However, despite drastic advances in modern LLMs such as GPT-4, they st… ▽ More

    Submitted 30 May, 2023; originally announced June 2023.

    Journal ref: IMLH 2023

  5. arXiv:2305.14623  [pdf, other

    cs.CL

    Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models

    Authors: Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang

    Abstract: Fact-checking is an essential task in NLP that is commonly utilized for validating the factual accuracy of claims. Prior work has mainly focused on fine-tuning pre-trained languages models on specific datasets, which can be computationally intensive and time-consuming. With the rapid development of large language models (LLMs), such as ChatGPT and GPT-3, researchers are now exploring their in-cont… ▽ More

    Submitted 31 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by NAACL 2024 Findings

  6. arXiv:2304.09842  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models

    Authors: Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, Jianfeng Gao

    Abstract: Large language models (LLMs) have achieved remarkable progress in solving various natural language processing tasks due to emergent reasoning abilities. However, LLMs have inherent limitations as they are incapable of accessing up-to-date information (stored on the Web or in task-specific knowledge bases), using external tools, and performing precise mathematical and logical reasoning. In this pap… ▽ More

    Submitted 31 October, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 32 pages, 10 figures, 24 tables. Accepted to NeurIPS 2023

  7. arXiv:2304.03277  [pdf, other

    cs.CL cs.AI

    Instruction Tuning with GPT-4

    Authors: Baolin Peng, Chunyuan Li, Pengcheng He, Michel Galley, Jianfeng Gao

    Abstract: Prior work has shown that finetuning large language models (LLMs) using machine-generated instruction-following data enables such models to achieve remarkable zero-shot capabilities on new tasks, and no human-written instructions are needed. In this paper, we present the first attempt to use GPT-4 to generate instruction-following data for LLM finetuning. Our early experiments on instruction-tuned… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 8 pages. Work in progress. Project page: https://instruction-tuning-with-gpt-4.github.io

  8. arXiv:2303.00908  [pdf, other

    cs.CL

    Interactive Text Generation

    Authors: Felix Faltings, Michel Galley, Baolin Peng, Kianté Brantley, Weixin Cai, Yizhe Zhang, Jianfeng Gao, Bill Dolan

    Abstract: Users interact with text, image, code, or other editors on a daily basis. However, machine learning models are rarely trained in the settings that reflect the interactivity between users and their editor. This is understandable as training AI models with real users is not only slow and costly, but what these models learn may be specific to user interface design choices. Unfortunately, this means m… ▽ More

    Submitted 11 November, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: EMNLP 2023

  9. arXiv:2302.12813  [pdf, other

    cs.CL cs.AI

    Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback

    Authors: Baolin Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao

    Abstract: Large language models (LLMs), such as ChatGPT, are able to generate human-like, fluent responses for many downstream tasks, e.g., task-oriented dialog and question answering. However, applying LLMs to real-world, mission-critical applications remains challenging mainly due to their tendency to generate hallucinations and their inability to use external knowledge. This paper proposes a LLM-Augmente… ▽ More

    Submitted 8 March, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: 15 pages

  10. arXiv:2302.11520  [pdf, other

    cs.CL

    Guiding Large Language Models via Directional Stimulus Prompting

    Authors: Zekun Li, Baolin Peng, Pengcheng He, Michel Galley, Jianfeng Gao, Xifeng Yan

    Abstract: We introduce Directional Stimulus Prompting, a novel framework for guiding black-box large language models (LLMs) toward specific desired outputs. Instead of directly adjusting LLMs, our method employs a small tunable policy model (e.g., T5) to generate an auxiliary directional stimulus prompt for each input instance. These directional stimulus prompts act as nuanced, instance-specific hints and c… ▽ More

    Submitted 9 October, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: Accepted by NeurIPS2023. The code and data are available at https://github.com/Leezekun/Directional-Stimulus-Prompting

  11. arXiv:2212.10018  [pdf, other

    cs.CL

    DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization

    Authors: Yu Li, Baolin Peng, Pengcheng He, Michel Galley, Zhou Yu, Jianfeng Gao

    Abstract: Dialogue summarization has recently garnered significant attention due to its wide range of applications. However, existing methods for summarizing dialogues have limitations because they do not take into account the inherent structure of dialogue and rely heavily on labeled data, which can lead to poor performance in new domains. In this work, we propose DIONYSUS (dynamic input optimization in pr… ▽ More

    Submitted 26 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted for ACL 2023

  12. arXiv:2212.10008  [pdf, other

    cs.CL cs.AI

    Enhancing Task Bot Engagement with Synthesized Open-Domain Dialog

    Authors: Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang

    Abstract: Many efforts have been made to construct dialog systems for different types of conversations, such as task-oriented dialog (TOD) and open-domain dialog (ODD). To better mimic human-level conversations that usually fuse various dialog modes, it is essential to build a system that can effectively handle both TOD and ODD and access different knowledge sources. To address the lack of available data fo… ▽ More

    Submitted 30 July, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: SIGDIAL 2023

  13. arXiv:2212.01956  [pdf, other

    cs.CL

    Grounded Keys-to-Text Generation: Towards Factual Open-Ended Generation

    Authors: Faeze Brahman, Baolin Peng, Michel Galley, Sudha Rao, Bill Dolan, Snigdha Chaturvedi, Jianfeng Gao

    Abstract: Large pre-trained language models have recently enabled open-ended generation frameworks (e.g., prompt-to-text NLG) to tackle a variety of tasks going beyond the traditional data-to-text generation. While this framework is more general, it is under-specified and often leads to a lack of controllability restricting their real-world usage. We propose a new grounded keys-to-text generation task: the… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: EMNLP 2022 Findings camera-ready

  14. arXiv:2206.11309  [pdf, other

    cs.CL

    GODEL: Large-Scale Pre-Training for Goal-Directed Dialog

    Authors: Baolin Peng, Michel Galley, Pengcheng He, Chris Brockett, Lars Liden, Elnaz Nouri, Zhou Yu, Bill Dolan, Jianfeng Gao

    Abstract: We introduce GODEL (Grounded Open Dialogue Language Model), a large pre-trained language model for dialog. In contrast with earlier models such as DialoGPT, GODEL leverages a new phase of grounded pre-training designed to better support adapting GODEL to a wide range of downstream dialog tasks that require information external to the current conversation (e.g., a database or document) to produce g… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  15. arXiv:2203.10133  [pdf, other

    cs.CL cs.AI

    Probing Factually Grounded Content Transfer with Factual Ablation

    Authors: Peter West, Chris Quirk, Michel Galley, Yejin Choi

    Abstract: Despite recent success, large neural models often generate factually incorrect text. Compounding this is the lack of a standard automatic evaluation for factuality--it cannot be meaningfully improved if it cannot be measured. Grounded generation promises a path to solving both of these problems: models draw on a reliable external document (grounding) for factual information, simplifying the challe… ▽ More

    Submitted 28 March, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

  16. arXiv:2110.06536  [pdf, other

    cs.AI

    NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative Environment

    Authors: Julia Kiseleva, Ziming Li, Mohammad Aliannejadi, Shrestha Mohanty, Maartje ter Hoeve, Mikhail Burtsev, Alexey Skrynnik, Artem Zholus, Aleksandr Panov, Kavya Srinet, Arthur Szlam, Yuxuan Sun, Katja Hofmann, Michel Galley, Ahmed Awadallah

    Abstract: Human intelligence has the remarkable ability to adapt to new tasks and environments quickly. Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions. To facilitate research in this direction, we propose IGLU: Interactive Grounded Language Understanding in a Collabor… ▽ More

    Submitted 14 October, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

  17. arXiv:2106.07192  [pdf, other

    cs.CL

    Automatic Document Sketching: Generating Drafts from Analogous Texts

    Authors: Zeqiu Wu, Michel Galley, Chris Brockett, Yizhe Zhang, Bill Dolan

    Abstract: The advent of large pre-trained language models has made it possible to make high-quality predictions on how to add or change a sentence in a document. However, the high branching factor inherent to text generation impedes the ability of even the strongest language models to offer useful editing suggestions at a more global or document level. We introduce a new task, document sketching, which invo… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: Findings of ACL 2021

  18. arXiv:2105.06597  [pdf, other

    cs.CL cs.AI

    RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling

    Authors: Yizhe Zhang, Siqi Sun, Xiang Gao, Yuwei Fang, Chris Brockett, Michel Galley, Jianfeng Gao, Bill Dolan

    Abstract: Recent advances in large-scale pre-training such as GPT-3 allow seemingly high quality text to be generated from a given prompt. However, such generation systems often suffer from problems of hallucinated facts, and are not inherently designed to incorporate useful external information. Grounded generation models appear to offer remedies, but their training typically relies on rarely-available par… ▽ More

    Submitted 24 February, 2022; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: accepted by AAAI-22, camera ready version

  19. arXiv:2104.08231  [pdf, other

    cs.CL

    An Adversarially-Learned Turing Test for Dialog Generation Models

    Authors: Xiang Gao, Yizhe Zhang, Michel Galley, Bill Dolan

    Abstract: The design of better automated dialogue evaluation metrics offers the potential of accelerate evaluation research on conversational AI. However, existing trainable dialogue evaluation models are generally restricted to classifiers trained in a purely supervised manner, which suffer a significant risk from adversarial attacking (e.g., a nonsensical response that enjoys a high classification score).… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: 7 pages, 2 figures

  20. arXiv:2104.06828  [pdf, other

    cs.CL

    Ask what's missing and what's useful: Improving Clarification Question Generation using Global Knowledge

    Authors: Bodhisattwa Prasad Majumder, Sudha Rao, Michel Galley, Julian McAuley

    Abstract: The ability to generate clarification questions i.e., questions that identify useful missing information in a given context, is important in reducing ambiguity. Humans use previous experience with similar contexts to form a global view and compare it to the given context to ascertain what is missing and what is useful in the context. Inspired by this, we propose a model for clarification question… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted in NAACL 2021, Code is available at https://github.com/microsoft/clarification-qgen-globalinfo

  21. arXiv:2103.01863  [pdf, other

    cs.CL cs.AI

    Data Augmentation for Abstractive Query-Focused Multi-Document Summarization

    Authors: Ramakanth Pasunuru, Asli Celikyilmaz, Michel Galley, Chenyan Xiong, Yizhe Zhang, Mohit Bansal, Jianfeng Gao

    Abstract: The progress in Query-focused Multi-Document Summarization (QMDS) has been limited by the lack of sufficient largescale high-quality training datasets. We present two QMDS training datasets, which we construct using two data augmentation methods: (1) transferring the commonly used single-document CNN/Daily Mail summarization dataset to create the QMDSCNN dataset, and (2) mining search-query logs t… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: AAAI 2021 (13 pages)

  22. arXiv:2010.12826  [pdf, other

    cs.CL

    Text Editing by Command

    Authors: Felix Faltings, Michel Galley, Gerold Hintz, Chris Brockett, Chris Quirk, Jianfeng Gao, Bill Dolan

    Abstract: A prevailing paradigm in neural text generation is one-shot generation, where text is produced in a single step. The one-shot setting is inadequate, however, when the constraints the user wishes to impose on the generated text are dynamic, especially when authoring longer documents. We address this limitation with an interactive text generation setting in which the user interacts with the system b… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

  23. arXiv:2009.06978  [pdf, other

    cs.CL

    Dialogue Response Ranking Training with Large-Scale Human Feedback Data

    Authors: Xiang Gao, Yizhe Zhang, Michel Galley, Chris Brockett, Bill Dolan

    Abstract: Existing open-domain dialog models are generally trained to minimize the perplexity of target human responses. However, some human replies are more engaging than others, spawning more followup interactions. Current conversational models are increasingly capable of producing turns that are context-relevant, but in order to produce compelling agents, these models need to be able to predict and optim… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

    Comments: Accepted to appear at EMNLP 2020

  24. arXiv:2005.08365  [pdf, other

    cs.CL

    MixingBoard: a Knowledgeable Stylized Integrated Text Generation Platform

    Authors: Xiang Gao, Michel Galley, Bill Dolan

    Abstract: We present MixingBoard, a platform for quickly building demos with a focus on knowledge grounded stylized text generation. We unify existing text generation algorithms in a shared codebase and further adapt earlier algorithms for constrained generation. To borrow advantages from different models, we implement strategies for cross-model integration, from the token probability level to the latent sp… ▽ More

    Submitted 2 July, 2020; v1 submitted 17 May, 2020; originally announced May 2020.

    Comments: accepted at ACL 2020

  25. arXiv:2005.00613  [pdf, other

    cs.CL

    A Controllable Model of Grounded Response Generation

    Authors: Zeqiu Wu, Michel Galley, Chris Brockett, Yizhe Zhang, Xiang Gao, Chris Quirk, Rik Koncel-Kedziorski, Jianfeng Gao, Hannaneh Hajishirzi, Mari Ostendorf, Bill Dolan

    Abstract: Current end-to-end neural conversation models inherently lack the flexibility to impose semantic control in the response generation process, often resulting in uninteresting responses. Attempts to boost informativeness alone come at the expense of factual accuracy, as attested by pretrained language models' propensity to "hallucinate" facts. While this may be mitigated by access to background know… ▽ More

    Submitted 14 June, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: AAAI 2021

  26. arXiv:1911.06394  [pdf, other

    cs.CL

    The Eighth Dialog System Technology Challenge

    Authors: Seokhwan Kim, Michel Galley, Chulaka Gunasekara, Sungjin Lee, Adam Atkinson, Baolin Peng, Hannes Schulz, Jianfeng Gao, Jinchao Li, Mahmoud Adada, Minlie Huang, Luis Lastras, Jonathan K. Kummerfeld, Walter S. Lasecki, Chiori Hori, Anoop Cherian, Tim K. Marks, Abhinav Rastogi, Xiaoxue Zang, Srinivas Sunkara, Raghav Gupta

    Abstract: This paper introduces the Eighth Dialog System Technology Challenge. In line with recent challenges, the eighth edition focuses on applying end-to-end dialog technologies in a pragmatic way for multi-domain task-completion, noetic response selection, audio visual scene-aware dialog, and schema-guided dialog state tracking tasks. This paper describes the task definition, provided datasets, and eval… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: Submitted to NeurIPS 2019 3rd Conversational AI Workshop

  27. arXiv:1911.00536  [pdf, other

    cs.CL cs.LG

    DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

    Authors: Yizhe Zhang, Siqi Sun, Michel Galley, Yen-Chun Chen, Chris Brockett, Xiang Gao, Jianfeng Gao, Jingjing Liu, Bill Dolan

    Abstract: We present a large, tunable neural conversational response generation model, DialoGPT (dialogue generative pre-trained transformer). Trained on 147M conversation-like exchanges extracted from Reddit comment chains over a period spanning from 2005 through 2017, DialoGPT extends the Hugging Face PyTorch transformer to attain a performance close to human both in terms of automatic and human evaluatio… ▽ More

    Submitted 2 May, 2020; v1 submitted 1 November, 2019; originally announced November 2019.

    Comments: Accepted by ACL 2020 system demonstration

  28. arXiv:1909.05361  [pdf, other

    cs.CL cs.AI

    Structuring Latent Spaces for Stylized Response Generation

    Authors: Xiang Gao, Yizhe Zhang, Sungjin Lee, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan

    Abstract: Generating responses in a targeted style is a useful yet challenging task, especially in the absence of parallel data. With limited data, existing methods tend to generate responses that are either less stylized or less context-relevant. We propose StyleFusion, which bridges conversation modeling and non-parallel style transfer by sharing a structured latent space. This structure allows the system… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: accepted to appear at EMNLP 2019 (long)

    Journal ref: EMNLP 2019

  29. arXiv:1906.02738  [pdf, other

    cs.CL cs.AI cs.LG

    Conversing by Reading: Contentful Neural Conversation with On-demand Machine Reading

    Authors: Lianhui Qin, Michel Galley, Chris Brockett, Xiaodong Liu, Xiang Gao, Bill Dolan, Yejin Choi, Jianfeng Gao

    Abstract: Although neural conversation models are effective in learning how to produce fluent responses, their primary challenge lies in knowing what to say to make the conversation contentful and non-vacuous. We present a new end-to-end approach to contentful neural conversation that jointly models response generation and on-demand machine reading. The key idea is to provide the conversation model with rel… ▽ More

    Submitted 6 June, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: ACL 2019 long paper

  30. arXiv:1905.05293  [pdf, other

    cs.CL

    Towards Content Transfer through Grounded Text Generation

    Authors: Shrimai Prabhumoye, Chris Quirk, Michel Galley

    Abstract: Recent work in neural generation has attracted significant interest in controlling the form of text, such as style, persona, and politeness. However, there has been less work on controlling neural text generation for content. This paper introduces the notion of Content Transfer for long-form text generation, where the task is to generate a next sentence in a document that both fits its context and… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Journal ref: Proc. NAACL 2019

  31. arXiv:1903.05759  [pdf, other

    cs.CL

    Consistent Dialogue Generation with Self-supervised Feature Learning

    Authors: Yizhe Zhang, Xiang Gao, Sungjin Lee, Chris Brockett, Michel Galley, Jianfeng Gao, Bill Dolan

    Abstract: Generating responses that are consistent with the dialogue context is one of the central challenges in building engaging conversational agents. We demonstrate that neural conversation models can be geared towards generating consistent responses by maintaining certain features related to topics and personas throughout the conversation. Past work has required external supervision that exploits featu… ▽ More

    Submitted 11 August, 2021; v1 submitted 13 March, 2019; originally announced March 2019.

    Comments: Accepted by SIGDIAL 2021. Eventually dropped off for non-technical reason

  32. arXiv:1902.11205  [pdf, other

    cs.CL cs.AI

    Jointly Optimizing Diversity and Relevance in Neural Response Generation

    Authors: Xiang Gao, Sungjin Lee, Yizhe Zhang, Chris Brockett, Michel Galley, Jianfeng Gao, Bill Dolan

    Abstract: Although recent neural conversation models have shown great potential, they often generate bland and generic responses. While various approaches have been explored to diversify the output of the conversation model, the improvement often comes at the cost of decreased relevance. In this paper, we propose a SpaceFusion model to jointly optimize diversity and relevance that essentially fuses the late… ▽ More

    Submitted 4 April, 2019; v1 submitted 28 February, 2019; originally announced February 2019.

    Comments: Long paper accepted at NAACL 2019

  33. arXiv:1901.03461  [pdf, ps, other

    cs.CL

    Dialog System Technology Challenge 7

    Authors: Koichiro Yoshino, Chiori Hori, Julien Perez, Luis Fernando D'Haro, Lazaros Polymenakos, Chulaka Gunasekara, Walter S. Lasecki, Jonathan K. Kummerfeld, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan, Xiang Gao, Huda Alamari, Tim K. Marks, Devi Parikh, Dhruv Batra

    Abstract: This paper introduces the Seventh Dialog System Technology Challenges (DSTC), which use shared datasets to explore the problem of building dialog systems. Recently, end-to-end dialog modeling approaches have been applied to various dialog tasks. The seventh DSTC (DSTC7) focuses on developing technologies related to end-to-end dialog systems for (1) sentence selection, (2) sentence generation and (… ▽ More

    Submitted 10 January, 2019; originally announced January 2019.

    Comments: This paper is presented at NIPS2018 2nd Conversational AI workshop

  34. arXiv:1811.00511  [pdf, other

    cs.CL

    Towards Coherent and Cohesive Long-form Text Generation

    Authors: Woon Sang Cho, Pengchuan Zhang, Yizhe Zhang, Xiujun Li, Michel Galley, Chris Brockett, Mengdi Wang, Jianfeng Gao

    Abstract: Generating coherent and cohesive long-form texts is a challenging task. Previous works relied on large amounts of human-generated texts to train neural language models. However, few attempted to explicitly improve neural language models from the perspectives of coherence and cohesion. In this work, we propose a new neural language model that is equipped with two neural discriminators which provide… ▽ More

    Submitted 29 May, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: Selected for spotlight oral presentation at NAACL-HLT 2019 Workshop on Narrative Understanding

  35. arXiv:1809.08267  [pdf, other

    cs.CL

    Neural Approaches to Conversational AI

    Authors: Jianfeng Gao, Michel Galley, Lihong Li

    Abstract: The present paper surveys neural approaches to conversational AI that have been developed in the last few years. We group conversational systems into three categories: (1) question answering agents, (2) task-oriented dialogue agents, and (3) chatbots. For each category, we present a review of state-of-the-art neural approaches, draw the connection between them and traditional approaches, and discu… ▽ More

    Submitted 10 September, 2019; v1 submitted 21 September, 2018; originally announced September 2018.

    Comments: Foundations and Trends in Information Retrieval (95 pages)

  36. arXiv:1809.05972  [pdf, other

    cs.CL cs.AI

    Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization

    Authors: Yizhe Zhang, Michel Galley, Jianfeng Gao, Zhe Gan, Xiujun Li, Chris Brockett, Bill Dolan

    Abstract: Responses generated by neural conversational models tend to lack informativeness and diversity. We present Adversarial Information Maximization (AIM), an adversarial learning strategy that addresses these two related but distinct problems. To foster response diversity, we leverage adversarial training that allows distributional matching of synthetic and real responses. To improve informativeness,… ▽ More

    Submitted 6 November, 2018; v1 submitted 16 September, 2018; originally announced September 2018.

    Comments: NIPS 2018

  37. arXiv:1710.07388  [pdf, other

    cs.CL

    Multi-Task Learning for Speaker-Role Adaptation in Neural Conversation Models

    Authors: Yi Luan, Chris Brockett, Bill Dolan, Jianfeng Gao, Michel Galley

    Abstract: Building a persona-based conversation agent is challenging owing to the lack of large amounts of speaker-specific conversation data for model training. This paper addresses the problem by proposing a multi-task learning approach to training neural conversation models that leverages both conversation data across speakers and other types of data pertaining to the speaker and speaker roles to be mode… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

  38. arXiv:1702.01932  [pdf, other

    cs.CL

    A Knowledge-Grounded Neural Conversation Model

    Authors: Marjan Ghazvininejad, Chris Brockett, Ming-Wei Chang, Bill Dolan, Jianfeng Gao, Wen-tau Yih, Michel Galley

    Abstract: Neural network models are capable of generating extremely natural sounding conversational interactions. Nevertheless, these models have yet to demonstrate that they can incorporate content in the form of factual information or entity-grounded opinion that would enable them to serve in more task-oriented conversational applications. This paper presents a novel, fully data-driven, and knowledge-grou… ▽ More

    Submitted 15 November, 2018; v1 submitted 7 February, 2017; originally announced February 2017.

    Comments: AAAI 2018 (9 pages)

  39. arXiv:1701.08251  [pdf, other

    cs.CL cs.AI cs.CV

    Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation

    Authors: Nasrin Mostafazadeh, Chris Brockett, Bill Dolan, Michel Galley, Jianfeng Gao, Georgios P. Spithourakis, Lucy Vanderwende

    Abstract: The popularity of image sharing on social media and the engagement it creates between users reflects the important role that visual context plays in everyday conversations. We present a novel task, Image-Grounded Conversations (IGC), in which natural-sounding conversations are generated about a shared image. To benchmark progress, we introduce a new multiple-reference dataset of crowd-sourced, eve… ▽ More

    Submitted 19 April, 2017; v1 submitted 28 January, 2017; originally announced January 2017.

  40. arXiv:1606.01541  [pdf, other

    cs.CL

    Deep Reinforcement Learning for Dialogue Generation

    Authors: Jiwei Li, Will Monroe, Alan Ritter, Michel Galley, Jianfeng Gao, Dan Jurafsky

    Abstract: Recent neural models of dialogue generation offer great promise for generating responses for conversational agents, but tend to be shortsighted, predicting utterances one at a time while ignoring their influence on future outcomes. Modeling the future direction of a dialogue is crucial to generating coherent, interesting dialogues, a need which led traditional NLP models of dialogue to draw on rei… ▽ More

    Submitted 29 September, 2016; v1 submitted 5 June, 2016; originally announced June 2016.

  41. arXiv:1604.03968  [pdf, other

    cs.CL cs.AI cs.CV

    Visual Storytelling

    Authors: Ting-Hao, Huang, Francis Ferraro, Nasrin Mostafazadeh, Ishan Misra, Aishwarya Agrawal, Jacob Devlin, Ross Girshick, Xiaodong He, Pushmeet Kohli, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende, Michel Galley, Margaret Mitchell

    Abstract: We introduce the first dataset for sequential vision-to-language, and explore how this data may be used for the task of visual storytelling. The first release of this dataset, SIND v.1, includes 81,743 unique photos in 20,211 sequences, aligned to both descriptive (caption) and story language. We establish several strong baselines for the storytelling task, and motivate an automatic metric to benc… ▽ More

    Submitted 13 April, 2016; originally announced April 2016.

    Comments: to appear in NAACL 2016

  42. arXiv:1603.06155  [pdf, other

    cs.CL

    A Persona-Based Neural Conversation Model

    Authors: Jiwei Li, Michel Galley, Chris Brockett, Georgios P. Spithourakis, Jianfeng Gao, Bill Dolan

    Abstract: We present persona-based models for handling the issue of speaker consistency in neural response generation. A speaker model encodes personas in distributed embeddings that capture individual characteristics such as background information and speaking style. A dyadic speaker-addressee model captures properties of interactions between two interlocutors. Our models yield qualitative performance impr… ▽ More

    Submitted 8 June, 2016; v1 submitted 19 March, 2016; originally announced March 2016.

    Comments: Accepted for publication at ACL 2016

  43. arXiv:1510.03055  [pdf, ps, other

    cs.CL

    A Diversity-Promoting Objective Function for Neural Conversation Models

    Authors: Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan

    Abstract: Sequence-to-sequence neural network models for generation of conversational responses tend to generate safe, commonplace responses (e.g., "I don't know") regardless of the input. We suggest that the traditional objective function, i.e., the likelihood of output (response) given input (message) is unsuited to response generation tasks. Instead we propose using Maximum Mutual Information (MMI) as th… ▽ More

    Submitted 10 June, 2016; v1 submitted 11 October, 2015; originally announced October 2015.

    Comments: In. Proc of NAACL 2016

  44. arXiv:1506.06863  [pdf, other

    cs.CL

    deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets

    Authors: Michel Galley, Chris Brockett, Alessandro Sordoni, Yangfeng Ji, Michael Auli, Chris Quirk, Margaret Mitchell, Jianfeng Gao, Bill Dolan

    Abstract: We introduce Discriminative BLEU (deltaBLEU), a novel metric for intrinsic evaluation of generated text in tasks that admit a diverse range of possible outputs. Reference strings are scored for quality by human raters on a scale of [-1, +1] to weight multi-reference BLEU. In tasks involving generation of conversational responses, deltaBLEU correlates reasonably with human judgments and outperforms… ▽ More

    Submitted 23 June, 2015; v1 submitted 23 June, 2015; originally announced June 2015.

    Comments: 6 pages, to appear at ACL 2015

  45. arXiv:1506.06833  [pdf, other

    cs.CL cs.AI cs.CV

    A Survey of Current Datasets for Vision and Language Research

    Authors: Francis Ferraro, Nasrin Mostafazadeh, Ting-Hao, Huang, Lucy Vanderwende, Jacob Devlin, Michel Galley, Margaret Mitchell

    Abstract: Integrating vision and language has long been a dream in work on artificial intelligence (AI). In the past two years, we have witnessed an explosion of work that brings together vision and language from images to videos and beyond. The available corpora have played a crucial role in advancing this area of research. In this paper, we propose a set of quality metrics for evaluating and analyzing the… ▽ More

    Submitted 19 August, 2015; v1 submitted 22 June, 2015; originally announced June 2015.

    Comments: To appear in EMNLP 2015, short proceedings. Dataset analysis and discussion expanded, including an initial examination into reporting bias for one of them. F.F. and N.M. contributed equally to this work

  46. arXiv:1506.06714  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    A Neural Network Approach to Context-Sensitive Generation of Conversational Responses

    Authors: Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Margaret Mitchell, Jian-Yun Nie, Jianfeng Gao, Bill Dolan

    Abstract: We present a novel response generation system that can be trained end to end on large quantities of unstructured Twitter conversations. A neural network architecture is used to address sparsity issues that arise when integrating contextual information into classic statistical models, allowing the system to take into account previous dialog utterances. Our dynamic-context generative models show con… ▽ More

    Submitted 22 June, 2015; originally announced June 2015.

    Comments: A. Sordoni, M. Galley, M. Auli, C. Brockett, Y. Ji, M. Mitchell, J.-Y. Nie, J. Gao, B. Dolan. 2015. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses. In Proc. of NAACL-HLT. Pages 196-205