Zum Hauptinhalt springen

Showing 51–100 of 160 results for author: Joty, S

.
  1. A Data-centric Framework for Improving Domain-specific Machine Reading Comprehension Datasets

    Authors: Iva Bojic, Josef Halim, Verena Suharman, Sreeja Tar, Qi Chwen Ong, Duy Phung, Mathieu Ravaut, Shafiq Joty, Josip Car

    Abstract: Low-quality data can cause downstream problems in high-stakes applications. Data-centric approach emphasizes on improving dataset quality to enhance model performance. High-quality datasets are needed for general-purpose Large Language Models (LLMs) training, as well as for domain-specific models, which are usually small in size as it is costly to engage a large number of domain experts for their… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Journal ref: 2023.In The Fourth Workshop on Insights from Negative Results in NLP, pages 19-32, Dubrovnik, Croatia. Association for Computational Linguistics

  2. arXiv:2303.10868  [pdf, other

    cs.CL

    Retrieving Multimodal Information for Augmented Generation: A Survey

    Authors: Ruochen Zhao, Hailin Chen, Weishi Wang, Fangkai Jiao, Xuan Long Do, Chengwei Qin, Bosheng Ding, Xiaobao Guo, Minzhi Li, Xingxuan Li, Shafiq Joty

    Abstract: As Large Language Models (LLMs) become popular, there emerged an important trend of using multimodality to augment the LLMs' generation ability, which enables LLMs to better interact with the world. However, there lacks a unified perception of at which stage and how to incorporate different modalities. In this survey, we review methods that assist and augment generative models by retrieving multim… ▽ More

    Submitted 30 November, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  3. arXiv:2303.03608  [pdf, other

    cs.CL

    Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation

    Authors: Yixin Liu, Alexander R. Fabbri, Yilun Zhao, Pengfei Liu, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev

    Abstract: Interpretability and efficiency are two important considerations for the adoption of neural automatic metrics. In this work, we develop strong-performing automatic metrics for reference-based summarization evaluation, based on a two-stage evaluation pipeline that first extracts basic information units from one text sequence and then checks the extracted units in another sequence. The metrics we de… ▽ More

    Submitted 16 November, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: EMNLP 2023 Camera Ready Version

  4. arXiv:2303.03004  [pdf, other

    cs.CL

    xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval

    Authors: Mohammad Abdullah Matin Khan, M Saiful Bari, Xuan Long Do, Weishi Wang, Md Rizwan Parvez, Shafiq Joty

    Abstract: Recently, pre-trained large language models (LLMs) have shown impressive abilities in generating codes from natural language descriptions, repairing buggy codes, translating codes between languages, and retrieving relevant code segments. However, the evaluation of these models has often been performed in a scattered way on only one or two specific tasks, in a few languages, at a partial granularit… ▽ More

    Submitted 6 November, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Code & Data available at https://github.com/ntunlp/xCodeEval, https://huggingface.co/datasets/NTU-NLP-sg/xCodeEval. Evaluation framework available at https://github.com/ntunlp/execeval

  5. arXiv:2302.08143  [pdf, other

    cs.CL cs.AI

    Learning to Initialize: Can Meta Learning Improve Cross-task Generalization in Prompt Tuning?

    Authors: Chengwei Qin, Qian Li, Ruochen Zhao, Shafiq Joty

    Abstract: Prompt tuning (PT) which only tunes the embeddings of an additional sequence of tokens per task, keeping the pre-trained language model (PLM) frozen, has shown remarkable performance in few-shot learning. Despite this, PT has been shown to rely heavily on good initialization of the prompt embeddings. In this work, we study meta prompt tuning (MPT) to systematically explore how meta-learning can he… ▽ More

    Submitted 19 November, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: ACL2023

  6. arXiv:2301.13753  [pdf, ps, other

    cs.CL

    Dynamic Scheduled Sampling with Imitation Loss for Neural Text Generation

    Authors: Xiang Lin, Prathyusha Jwalapuram, Shafiq Joty

    Abstract: State-of-the-art neural text generation models are typically trained to maximize the likelihood of each token in the ground-truth sequence conditioned on the previous target tokens. However, during inference, the model needs to make a prediction conditioned on the tokens generated by itself. This train-test discrepancy is referred to as exposure bias. Scheduled sampling is a curriculum learning st… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  7. arXiv:2212.10929  [pdf, other

    cs.CL cs.AI cs.LG

    SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning

    Authors: M Saiful Bari, Aston Zhang, Shuai Zheng, Xingjian Shi, Yi Zhu, Shafiq Joty, Mu Li

    Abstract: Pre-trained large language models can efficiently interpolate human-written prompts in a natural way. Multitask prompted learning can help generalization through a diverse set of tasks at once, thus enhancing the potential for more effective downstream fine-tuning. To perform efficient multitask-inference in the same batch, parameter-efficient fine-tuning methods such as prompt tuning have been pr… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  8. arXiv:2212.10529  [pdf, other

    cs.CL cs.AI cs.CY

    Evaluating Psychological Safety of Large Language Models

    Authors: Xingxuan Li, Yutong Li, Lin Qiu, Shafiq Joty, Lidong Bing

    Abstract: In this work, we designed unbiased prompts to systematically evaluate the psychological safety of large language models (LLMs). First, we tested five different LLMs by using two personality tests: Short Dark Triad (SD-3) and Big Five Inventory (BFI). All models scored higher than the human average on SD-3, suggesting a relatively darker personality pattern. Despite being instruction fine-tuned wit… ▽ More

    Submitted 29 February, 2024; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Preprint. Under review

  9. arXiv:2212.10450  [pdf, other

    cs.CL

    Is GPT-3 a Good Data Annotator?

    Authors: Bosheng Ding, Chengwei Qin, Linlin Liu, Yew Ken Chia, Shafiq Joty, Boyang Li, Lidong Bing

    Abstract: Data annotation is the process of labeling data that could be used to train machine learning models. Having high-quality annotation is crucial, as it allows the model to learn the relationship between the input data and the desired output. GPT-3, a large-scale language model developed by OpenAI, has demonstrated impressive zero- and few-shot performance on a wide range of NLP tasks. It is therefor… ▽ More

    Submitted 14 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted by ACL 2023

  10. arXiv:2212.09593  [pdf, other

    cs.CL

    Unsupervised Summarization Re-ranking

    Authors: Mathieu Ravaut, Shafiq Joty, Nancy Chen

    Abstract: With the rise of task-specific pre-training objectives, abstractive summarization models like PEGASUS offer appealing zero-shot performance on downstream summarization tasks. However, the performance of such unsupervised models still lags significantly behind their supervised counterparts. Similarly to the supervised setup, we notice a very high variance in quality among summary candidates from th… ▽ More

    Submitted 26 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 9 pages, 1 figure, 10 tables, 23 appendix pages, ACL Findings 2023

  11. arXiv:2212.07981  [pdf, other

    cs.CL

    Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation

    Authors: Yixin Liu, Alexander R. Fabbri, Pengfei Liu, Yilun Zhao, Linyong Nan, Ruilin Han, Simeng Han, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev

    Abstract: Human evaluation is the foundation upon which the evaluation of both summarization systems and automatic metrics rests. However, existing human evaluation studies for summarization either exhibit a low inter-annotator agreement or have insufficient scale, and an in-depth analysis of human evaluation is lacking. Therefore, we address the shortcomings of existing summarization evaluation along the f… ▽ More

    Submitted 6 June, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: ACL 2023 Camera Ready

  12. arXiv:2211.17142  [pdf, other

    cs.LG cs.AI

    Learning Label Modular Prompts for Text Classification in the Wild

    Authors: Hailin Chen, Amrita Saha, Shafiq Joty, Steven C. H. Hoi

    Abstract: Machine learning models usually assume i.i.d data during training and testing, but data and tasks in real world often change over time. To emulate the transient nature of real world, we propose a challenging but practical task: text classification in-the-wild, which introduces different non-stationary training/testing stages. Decomposing a complex task into modular components can enable robust gen… ▽ More

    Submitted 5 December, 2022; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: accepted to EMNLP 2022

  13. arXiv:2211.15916  [pdf, other

    cs.CL

    BotSIM: An End-to-End Bot Simulation Toolkit for Commercial Task-Oriented Dialog Systems

    Authors: Guangsen Wang, Shafiq Joty, Junnan Li, Steven Hoi

    Abstract: We introduce BotSIM, a modular, open-source Bot SIMulation environment with dialog generation, user simulation and conversation analytics capabilities. BotSIM aims to serve as a one-stop solution for large-scale data-efficient end-to-end evaluation, diagnosis and remediation of commercial task-oriented dialog (TOD) systems to significantly accelerate commercial bot development and evaluation, redu… ▽ More

    Submitted 30 November, 2022; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: Accompanying code documentation at https://opensource.salesforce.com/botsim/latest/index.html. arXiv admin note: text overlap with arXiv:2211.11982

  14. arXiv:2211.11982  [pdf, other

    cs.CL

    BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems

    Authors: Guangsen Wang, Samson Tan, Shafiq Joty, Gang Wu, Jimmy Au, Steven Hoi

    Abstract: We present BotSIM, a data-efficient end-to-end Bot SIMulation toolkit for commercial text-based task-oriented dialog (TOD) systems. BotSIM consists of three major components: 1) a Generator that can infer semantic-level dialog acts and entities from bot definitions and generate user queries via model-based paraphrasing; 2) an agenda-based dialog user Simulator (ABUS) to simulate conversations with… ▽ More

    Submitted 30 November, 2022; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Paper accepted by the EMNLP 2022 System Demo Track; We have open-sourced the toolkit at https://github.com/salesforce/botsim

  15. arXiv:2211.08794  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations

    Authors: Linlin Liu, Xingxuan Li, Megh Thakkar, Xin Li, Shafiq Joty, Luo Si, Lidong Bing

    Abstract: Due to the huge amount of parameters, fine-tuning of pretrained language models (PLMs) is prone to overfitting in the low resource scenarios. In this work, we present a novel method that operates on the hidden representations of a PLM to reduce overfitting. During fine-tuning, our method inserts random autoencoders between the hidden layers of a PLM, which transform activations from the previous l… ▽ More

    Submitted 26 May, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Accepted by ACL 2023

  16. arXiv:2211.03950  [pdf, other

    cs.AI

    Alleviating Sparsity of Open Knowledge Graphs with Ternary Contrastive Learning

    Authors: Qian Li, Shafiq Joty, Daling Wang, Shi Feng, Yifei Zhang

    Abstract: Sparsity of formal knowledge and roughness of non-ontological construction make sparsity problem particularly prominent in Open Knowledge Graphs (OpenKGs). Due to sparse links, learning effective representation for few-shot entities becomes difficult. We hypothesize that by introducing negative samples, a contrastive learning (CL) formulation could be beneficial in such scenarios. However, existin… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Journal ref: EMNLP Findings 2022

  17. arXiv:2210.08779  [pdf, other

    cs.CL

    Towards Summary Candidates Fusion

    Authors: Mathieu Ravaut, Shafiq Joty, Nancy F. Chen

    Abstract: Sequence-to-sequence deep neural models fine-tuned for abstractive summarization can achieve great performance on datasets with enough human annotations. Yet, it has been shown that they have not reached their full potential, with a wide gap between the top beam search output and the oracle beam. Recently, re-ranking methods have been proposed, to learn to select a better summary candidate. Howeve… ▽ More

    Submitted 26 May, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: 4 Figures, 9 Tables, EMNLP 2022

  18. arXiv:2210.06628  [pdf, other

    cs.LG cs.CL

    OpenCQA: Open-ended Question Answering with Charts

    Authors: Shankar Kantharaj, Xuan Long Do, Rixie Tiffany Ko Leong, Jia Qing Tan, Enamul Hoque, Shafiq Joty

    Abstract: Charts are very popular to analyze data and convey important insights. People often analyze visualizations to answer open-ended questions that require explanatory answers. Answering such questions are often difficult and time-consuming as it requires a lot of cognitive and perceptual efforts. To address this challenge, we introduce a new task called OpenCQA, where the goal is to answer an open-end… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  19. arXiv:2209.11386  [pdf, other

    cs.CL cs.AI

    Improving Conversational Recommender System via Contextual and Time-Aware Modeling with Less Domain-Specific Knowledge

    Authors: Lingzhi Wang, Shafiq Joty, Wei Gao, Xingshan Zeng, Kam-Fai Wong

    Abstract: Conversational Recommender Systems (CRS) has become an emerging research topic seeking to perform recommendations through interactive conversations, which generally consist of generation and recommendation modules. Prior work on CRS tends to incorporate more external and domain-specific knowledge like item reviews to enhance performance. Despite the fact that the collection and annotation of the e… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  20. arXiv:2209.06652  [pdf, other

    cs.CL

    CoHS-CQG: Context and History Selection for Conversational Question Generation

    Authors: Xuan Long Do, Bowei Zou, Liangming Pan, Nancy F. Chen, Shafiq Joty, Ai Ti Aw

    Abstract: Conversational question generation (CQG) serves as a vital task for machines to assist humans, such as interactive reading comprehension, through conversations. Compared to traditional single-turn question generation (SQG), CQG is more challenging in the sense that the generated question is required not only to be meaningful, but also to align with the occurred conversation history. While previous… ▽ More

    Submitted 10 October, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted by 29th International Conference on Computational Linguistics (COLING 2022)

  21. arXiv:2209.00840  [pdf, other

    cs.CL

    FOLIO: Natural Language Reasoning with First-Order Logic

    Authors: Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Wenfei Zhou, James Coady, David Peng, Yujie Qiao, Luke Benson, Lucy Sun, Alex Wardle-Solano, Hannah Szabo, Ekaterina Zubova, Matthew Burtell, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Alexander R. Fabbri , et al. (10 additional authors not shown)

    Abstract: Large language models (LLMs) have achieved remarkable performance on a variety of natural language understanding tasks. However, existing benchmarks are inadequate in measuring the complex logical reasoning capabilities of a model. We present FOLIO, a human-annotated, logically complex and diverse dataset for reasoning in natural language (NL), equipped with first-order logic (FOL) annotations. FO… ▽ More

    Submitted 17 May, 2024; v1 submitted 2 September, 2022; originally announced September 2022.

  22. arXiv:2205.15544  [pdf, other

    cs.CL cs.AI

    Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Model

    Authors: Xuan-Phi Nguyen, Shafiq Joty, Wu Kui, Ai Ti Aw

    Abstract: Numerous recent work on unsupervised machine translation (UMT) implies that competent unsupervised translations of low-resource and unrelated languages, such as Nepali or Sinhala, are only possible if the model is trained in a massive multilingual environment, where these low-resource languages are mixed with high-resource counterparts. Nonetheless, while the high-resource languages greatly help k… ▽ More

    Submitted 1 October, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: Published in NeurIPS 2022

  23. arXiv:2203.13867  [pdf, other

    cs.CL cs.LG

    Data Selection Curriculum for Neural Machine Translation

    Authors: Tasnim Mohiuddin, Philipp Koehn, Vishrav Chaudhary, James Cross, Shruti Bhosale, Shafiq Joty

    Abstract: Neural Machine Translation (NMT) models are typically trained on heterogeneous data that are concatenated and randomly shuffled. However, not all of the training data are equally useful to the model. Curriculum training aims to present the data to the NMT models in a meaningful order. In this work, we introduce a two-stage curriculum training framework for NMT where we fine-tune a base NMT model o… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

  24. arXiv:2203.10244  [pdf, other

    cs.CL

    ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning

    Authors: Ahmed Masry, Do Xuan Long, Jia Qing Tan, Shafiq Joty, Enamul Hoque

    Abstract: Charts are very popular for analyzing data. When exploring charts, people often ask a variety of complex reasoning questions that involve several logical and arithmetic operations. They also commonly refer to visual features of a chart in their questions. However, most existing datasets do not focus on such complex reasoning questions as their questions are template-based and answers come from a f… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: Accepted by ACL 2022 Findings

  25. arXiv:2203.06569  [pdf, other

    cs.CL

    SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization

    Authors: Mathieu Ravaut, Shafiq Joty, Nancy F. Chen

    Abstract: Sequence-to-sequence neural networks have recently achieved great success in abstractive summarization, especially through fine-tuning large pre-trained language models on the downstream dataset. These models are typically decoded with beam search to generate a unique summary. However, the search space is very large, and with the exposure bias, such decoding is not optimal. In this paper, we show… ▽ More

    Submitted 26 May, 2023; v1 submitted 13 March, 2022; originally announced March 2022.

    Comments: 9 pages, 6 figures, 6 tables, 9 appendix pages, ACL 2022

  26. arXiv:2203.06486  [pdf, other

    cs.CL

    Chart-to-Text: A Large-Scale Benchmark for Chart Summarization

    Authors: Shankar Kantharaj, Rixie Tiffany Ko Leong, Xiang Lin, Ahmed Masry, Megh Thakkar, Enamul Hoque, Shafiq Joty

    Abstract: Charts are commonly used for exploring data and communicating insights. Generating natural language summaries from charts can be very helpful for people in inferring key insights that would otherwise require a lot of cognitive and perceptual efforts. We present Chart-to-text, a large-scale benchmark with two datasets and a total of 44,096 charts covering a wide range of topics and chart types. We… ▽ More

    Submitted 14 April, 2022; v1 submitted 12 March, 2022; originally announced March 2022.

    Comments: Accepted by ACL 2022 Main Conference

  27. arXiv:2203.02135  [pdf, other

    cs.CL cs.LG

    Continual Few-shot Relation Learning via Embedding Space Regularization and Data Augmentation

    Authors: Chengwei Qin, Shafiq Joty

    Abstract: Existing continual relation learning (CRL) methods rely on plenty of labeled training data for learning a new task, which can be hard to acquire in real scenario as getting large and representative labeled data is often expensive and time-consuming. It is therefore necessary for the model to learn novel relational patterns with very few labeled data while avoiding catastrophic forgetting of previo… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  28. arXiv:2111.10962  [pdf, other

    cs.CL cs.AI

    Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples

    Authors: Linlin Liu, Xin Li, Ruidan He, Lidong Bing, Shafiq Joty, Luo Si

    Abstract: Knowledge-enhanced language representation learning has shown promising results across various knowledge-intensive NLP tasks. However, prior methods are limited in efficient utilization of multilingual knowledge graph (KG) data for language model (LM) pretraining. They often train LMs with KGs in indirect ways, relying on extra entity/relation embeddings to facilitate knowledge injection. In this… ▽ More

    Submitted 18 October, 2022; v1 submitted 21 November, 2021; originally announced November 2021.

    Comments: Accepted by EMNLP 2022

  29. arXiv:2110.08545  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    A Unified Speaker Adaptation Approach for ASR

    Authors: Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq Joty, Eng Siong Chng, Bin Ma

    Abstract: Transformer models have been used in automatic speech recognition (ASR) successfully and yields state-of-the-art results. However, its performance is still affected by speaker mismatch between training and test data. Further finetuning a trained model with target speaker data is the most natural approach for adaptation, but it takes a lot of compute and may cause catastrophic forgetting to the exi… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

    Comments: Accepted by EMNLP 2021

  30. arXiv:2110.07811  [pdf, other

    cs.CL cs.PL

    Cascaded Fast and Slow Models for Efficient Semantic Code Search

    Authors: Akhilesh Deepak Gotmare, Junnan Li, Shafiq Joty, Steven C. H. Hoi

    Abstract: The goal of natural language semantic code search is to retrieve a semantically relevant code snippet from a fixed set of candidates using a natural language query. Existing approaches are neither effective nor efficient enough towards a practical semantic code search system. In this paper, we propose an efficient and accurate semantic code search framework with cascaded fast and slow models, in w… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: 12 pages

  31. arXiv:2110.07679  [pdf, other

    cs.CL cs.AI

    GlobalWoZ: Globalizing MultiWoZ to Develop Multilingual Task-Oriented Dialogue Systems

    Authors: Bosheng Ding, Junjie Hu, Lidong Bing, Sharifah Mahani Aljunied, Shafiq Joty, Luo Si, Chunyan Miao

    Abstract: Much recent progress in task-oriented dialogue (ToD) systems has been driven by available annotation data across multiple domains for training. Over the last few years, there has been a move towards data curation for multilingual ToD systems that are applicable to serve people speaking different languages. However, existing multilingual ToD datasets either have a limited coverage of languages due… ▽ More

    Submitted 1 April, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  32. arXiv:2110.07298  [pdf, other

    cs.CL

    LFPT5: A Unified Framework for Lifelong Few-shot Language Learning Based on Prompt Tuning of T5

    Authors: Chengwei Qin, Shafiq Joty

    Abstract: Existing approaches to lifelong language learning rely on plenty of labeled data for learning a new task, which is hard to obtain in most real scenarios. Considering that humans can continually learn new tasks from a handful of examples, we expect the models also to be able to generalize well on new few-shot tasks without forgetting the previous ones. In this work, we define this more challenging… ▽ More

    Submitted 30 March, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: ICLR 2022. Code is available at https://github.com/qcwthu/Lifelong-Fewshot-Language-Learning

  33. arXiv:2110.07198  [pdf, other

    cs.CL

    Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling

    Authors: Prathyusha Jwalapuram, Shafiq Joty, Xiang Lin

    Abstract: Given the claims of improved text generation quality across various pre-trained neural models, we consider the coherence evaluation of machine generated text to be one of the principal applications of coherence models that needs to be investigated. Prior work in neural coherence modeling has primarily focused on devising new architectures for solving the permuted document task. We instead use a ba… ▽ More

    Submitted 21 March, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted at ACL 2022

  34. arXiv:2109.00859  [pdf, other

    cs.CL cs.PL

    CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation

    Authors: Yue Wang, Weishi Wang, Shafiq Joty, Steven C. H. Hoi

    Abstract: Pre-trained models for Natural Languages (NL) like BERT and GPT have been recently shown to transfer well to Programming Languages (PL) and largely benefit a broad set of code-related tasks. Despite their success, most current methods either rely on an encoder-only (or decoder-only) pre-training that is suboptimal for generation (resp. understanding) tasks or process the code snippet in the same w… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021. 13 pages

  35. arXiv:2107.12579  [pdf, other

    cs.CV

    Remember What You have drawn: Semantic Image Manipulation with Memory

    Authors: Xiangxi Shi, Zhonghua Wu, Guosheng Lin, Jianfei Cai, Shafiq Joty

    Abstract: Image manipulation with natural language, which aims to manipulate images with the guidance of language descriptions, has been a challenging problem in the fields of computer vision and natural language processing (NLP). Currently, a number of efforts have been made for this task, but their performances are still distant away from generating realistic and text-conformed manipulated images. Therefo… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  36. arXiv:2107.07651  [pdf, other

    cs.CV cs.AI

    Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

    Authors: Junnan Li, Ramprasaath R. Selvaraju, Akhilesh Deepak Gotmare, Shafiq Joty, Caiming Xiong, Steven Hoi

    Abstract: Large-scale vision and language representation learning has shown promising improvements on various vision-language tasks. Most existing methods employ a transformer-based multimodal encoder to jointly model visual tokens (region-based image features) and word tokens. Because the visual tokens and word tokens are unaligned, it is challenging for the multimodal encoder to learn image-text interacti… ▽ More

    Submitted 7 October, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

  37. arXiv:2106.15760  [pdf, other

    cs.CL cs.AI

    A Conditional Splitting Framework for Efficient Constituency Parsing

    Authors: Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty, Xiaoli Li

    Abstract: We introduce a generic seq2seq parsing framework that casts constituency parsing problems (syntactic and discourse parsing) into a series of conditional splitting decisions. Our parsing model estimates the conditional probability distribution of possible splitting points in a given text span and supports efficient top-down decoding, which is linear in number of nodes. The conditional splitting for… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: Accepted to ACL2021

  38. arXiv:2106.07207  [pdf, other

    cs.CL

    Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation

    Authors: Xiang Lin, Simeng Han, Shafiq Joty

    Abstract: Advanced large-scale neural language models have led to significant success in many language generation tasks. However, the most commonly used training objective, Maximum Likelihood Estimation (MLE), has been shown problematic, where the trained model prefers using dull and repetitive phrases. In this work, we introduce ScaleGrad, a modification straight to the gradient of the loss function, to re… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  39. arXiv:2106.05141  [pdf, other

    cs.CL

    AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT

    Authors: Tasnim Mohiuddin, M Saiful Bari, Shafiq Joty

    Abstract: The success of Neural Machine Translation (NMT) largely depends on the availability of large bitext training corpora. Due to the lack of such large corpora in low-resource language pairs, NMT systems often exhibit poor performance. Extra relevant monolingual data often helps, but acquiring it could be quite expensive, especially for low-resource languages. Moreover, domain mismatch between bitext… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: ACL-2021 accepted paper

  40. arXiv:2105.10861  [pdf, other

    cs.CL cs.AI

    RST Parsing from Scratch

    Authors: Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty, Xiaoli Li

    Abstract: We introduce a novel top-down end-to-end formulation of document-level discourse parsing in the Rhetorical Structure Theory (RST) framework. In this formulation, we consider discourse parsing as a sequence of splitting decisions at token boundaries and use a seq2seq network to model the splitting decisions. Our framework facilitates discourse parsing from scratch without requiring discourse segmen… ▽ More

    Submitted 23 May, 2021; originally announced May 2021.

    Comments: Accepted to NAACL 2021

  41. arXiv:2105.02590  [pdf, other

    cs.LG cs.AI cs.CL cs.CY cs.NE

    Reliability Testing for Natural Language Processing Systems

    Authors: Samson Tan, Shafiq Joty, Kathy Baxter, Araz Taeihagh, Gregory A. Bennett, Min-Yen Kan

    Abstract: Questions of fairness, robustness, and transparency are paramount to address before deploying NLP systems. Central to these concerns is the question of reliability: Can NLP systems reliably treat different demographics fairly and function correctly in diverse and noisy environments? To address this, we argue for the need for reliability testing and contextualize it among existing work on improving… ▽ More

    Submitted 31 May, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted to ACL-IJCNLP 2021 (main conference). Camera-ready version

  42. arXiv:2104.09810  [pdf, other

    cs.CL

    Addressing the Vulnerability of NMT in Input Perturbations

    Authors: Weiwen Xu, Ai Ti Aw, Yang Ding, Kui Wu, Shafiq Joty

    Abstract: Neural Machine Translation (NMT) has achieved significant breakthrough in performance but is known to suffer vulnerability to input perturbations. As real input noise is difficult to predict during training, robustness is a big issue for system deployment. In this paper, we improve the robustness of NMT models by reducing the effect of noisy words through a Context-Enhanced Reconstruction (CER) ap… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: Accepted by NAACL 2021 Industry Track

  43. arXiv:2103.09593  [pdf, other

    cs.CL cs.AI cs.CY cs.LG cs.NE

    Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots

    Authors: Samson Tan, Shafiq Joty

    Abstract: Multilingual models have demonstrated impressive cross-lingual transfer performance. However, test sets like XNLI are monolingual at the example level. In multilingual communities, it is common for polyglots to code-mix when conversing with each other. Inspired by this phenomenon, we present two strong black-box adversarial attacks (one word-level, one phrase-level) for multilingual models that pu… ▽ More

    Submitted 4 June, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: To be presented at NAACL-HLT 2021. Abstract also published in the Rising Stars Track of the Workshop on Computational Approaches to Linguistic Code-Switching (CALCS 2021)

    Journal ref: 2021.naacl-main.282

  44. arXiv:2103.06459  [pdf, other

    cs.CL cs.AI

    Towards Multi-Sense Cross-Lingual Alignment of Contextual Embeddings

    Authors: Linlin Liu, Thien Hai Nguyen, Shafiq Joty, Lidong Bing, Luo Si

    Abstract: Cross-lingual word embeddings (CLWE) have been proven useful in many cross-lingual tasks. However, most existing approaches to learn CLWE including the ones with contextual embeddings are sense agnostic. In this work, we propose a novel framework to align contextual embeddings at the sense level by leveraging cross-lingual signal from bilingual dictionaries only. We operationalize our framework by… ▽ More

    Submitted 15 September, 2022; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: Accepted by COLING 2022

  45. arXiv:2103.00740  [pdf, other

    cs.DB

    Towards Enhancing Database Education: Natural Language Generation Meets Query Execution Plans

    Authors: Weiguo Wang, Sourav S Bhowmick, Hui Li, Shafiq R Joty, Siyuan Liu, Peng Chen

    Abstract: The database systems course is offered as part of an undergraduate computer science degree program in many major universities. A key learning goal of learners taking such a course is to understand how SQL queries are processed in a RDBMS in practice. Since a query execution plan (QEP) describes the execution steps of a query, learners can acquire the understanding by perusing the QEPs generated by… ▽ More

    Submitted 2 March, 2021; v1 submitted 28 February, 2021; originally announced March 2021.

    Comments: 16 pages, 10 figures, SIGMOD2021

  46. arXiv:2101.11802  [pdf, other

    cs.CL

    Weakly Supervised Neuro-Symbolic Module Networks for Numerical Reasoning

    Authors: Amrita Saha, Shafiq Joty, Steven C. H. Hoi

    Abstract: Neural Module Networks (NMNs) have been quite successful in incorporating explicit reasoning as learnable modules in various question answering tasks, including the most generic form of numerical reasoning over text in Machine Reading Comprehension (MRC). However, to achieve this, contemporary NMNs need strong supervision in executing the query as a specialized program over reasoning modules and f… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

  47. arXiv:2011.01549  [pdf, other

    cs.CL cs.AI

    DAGA: Data Augmentation with a Generation Approach for Low-resource Tagging Tasks

    Authors: Bosheng Ding, Linlin Liu, Lidong Bing, Canasai Kruengkrai, Thien Hai Nguyen, Shafiq Joty, Luo Si, Chunyan Miao

    Abstract: Data augmentation techniques have been widely used to improve machine learning performance as they enhance the generalization capability of models. In this work, to generate high quality synthetic data for low-resource tagging tasks, we propose a novel augmentation method with language models trained on the linearized labeled sentences. Our method is applicable to both supervised and semi-supervis… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: Accepted by EMNLP 2020

  48. arXiv:2010.12836  [pdf, other

    cs.CL

    Improving Zero and Few-Shot Abstractive Summarization with Intermediate Fine-tuning and Data Augmentation

    Authors: Alexander R. Fabbri, Simeng Han, Haoyuan Li, Haoran Li, Marjan Ghazvininejad, Shafiq Joty, Dragomir Radev, Yashar Mehdad

    Abstract: Models pretrained with self-supervised objectives on large text corpora achieve state-of-the-art performance on English text summarization tasks. However, these models are typically fine-tuned on hundreds of thousands of data points, an infeasible requirement when applying summarization to new, niche domains. In this work, we introduce a novel and generalizable method, called WikiTransfer, for fin… ▽ More

    Submitted 11 April, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: NAACL 2021

  49. arXiv:2010.11080  [pdf, other

    cs.CL

    Online Conversation Disentanglement with Pointer Networks

    Authors: Tao Yu, Shafiq Joty

    Abstract: Huge amounts of textual conversations occur online every day, where multiple conversations take place concurrently. Interleaved conversations lead to difficulties in not only following the ongoing discussions but also extracting relevant information from simultaneous messages. Conversation disentanglement aims to separate intermingled messages into detached conversations. However, existing disenta… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  50. arXiv:2010.07785  [pdf, other

    cs.CL cs.AI

    Response Selection for Multi-Party Conversations with Dynamic Topic Tracking

    Authors: Weishi Wang, Shafiq Joty, Steven C. H. Hoi

    Abstract: While participants in a multi-party multi-turn conversation simultaneously engage in multiple conversation topics, existing response selection methods are developed mainly focusing on a two-party single-conversation scenario. Hence, the prolongation and transition of conversation topics are ignored by current methods. In this work, we frame response selection as a dynamic topic tracking task to ma… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: 9 pages, EMNLP2020