Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Krone, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.04782  [pdf, other

    cs.CL cs.AI cs.LG

    Robustification of Multilingual Language Models to Real-world Noise in Crosslingual Zero-shot Settings with Robust Contrastive Pretraining

    Authors: Asa Cooper Stickland, Sailik Sengupta, Jason Krone, Saab Mansour, He He

    Abstract: Advances in neural modeling have achieved state-of-the-art (SOTA) results on public natural language processing (NLP) benchmarks, at times surpassing human performance. However, there is a gap between public benchmarks and real-world applications where noise, such as typographical or grammatical mistakes, is abundant and can result in degraded performance. Unfortunately, works which evaluate the r… ▽ More

    Submitted 10 February, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted and to be presented at the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

  2. arXiv:2204.07128  [pdf, other

    cs.CL

    Label Semantic Aware Pre-training for Few-shot Text Classification

    Authors: Aaron Mueller, Jason Krone, Salvatore Romeo, Saab Mansour, Elman Mansimov, Yi Zhang, Dan Roth

    Abstract: In text classification tasks, useful information is encoded in the label names. Label semantic aware systems have leveraged this information for improved text classification performance during fine-tuning and prediction. However, use of label-semantics during pre-training has not been extensively explored. We therefore propose Label Semantic Aware Pre-training (LSAP) to improve the generalization… ▽ More

    Submitted 29 May, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted at ACL 2022

  3. arXiv:2107.09840  [pdf, other

    cs.CL

    Soft Layer Selection with Meta-Learning for Zero-Shot Cross-Lingual Transfer

    Authors: Weijia Xu, Batool Haider, Jason Krone, Saab Mansour

    Abstract: Multilingual pre-trained contextual embedding models (Devlin et al., 2019) have achieved impressive performance on zero-shot cross-lingual transfer tasks. Finding the most effective fine-tuning strategy to fine-tune these models on high-resource languages so that it transfers well to the zero-shot languages is a non-trivial task. In this paper, we propose a novel meta-optimizer to soft-select whic… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: MetaNLP at ACL 2021

  4. arXiv:2104.07149  [pdf, other

    cs.CL cs.AI

    On the Robustness of Intent Classification and Slot Labeling in Goal-oriented Dialog Systems to Real-world Noise

    Authors: Sailik Sengupta, Jason Krone, Saab Mansour

    Abstract: Intent Classification (IC) and Slot Labeling (SL) models, which form the basis of dialogue systems, often encounter noisy data in real-word environments. In this work, we investigate how robust IC/SL models are to noisy data. We collect and publicly release a test-suite for seven common noise types found in production human-to-bot conversations (abbreviations, casing, misspellings, morphological v… ▽ More

    Submitted 1 November, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: To be presented at NLP for Conversational AI, EMNLP 2021

  5. arXiv:2101.05779  [pdf, other

    cs.LG cs.CL

    Structured Prediction as Translation between Augmented Natural Languages

    Authors: Giovanni Paolini, Ben Athiwaratkun, Jason Krone, Jie Ma, Alessandro Achille, Rishita Anubhai, Cicero Nogueira dos Santos, Bing Xiang, Stefano Soatto

    Abstract: We propose a new framework, Translation between Augmented Natural Languages (TANL), to solve many structured prediction language tasks including joint entity and relation extraction, nested named entity recognition, relation classification, semantic role labeling, event extraction, coreference resolution, and dialogue state tracking. Instead of tackling the problem by training task-specific discri… ▽ More

    Submitted 2 December, 2021; v1 submitted 14 January, 2021; originally announced January 2021.

    Journal ref: International Conference on Learning Representations (ICLR) 2021

  6. arXiv:2012.07516  [pdf, other

    cs.CL

    Meta learning to classify intent and slot labels with noisy few shot examples

    Authors: Shang-Wen Li, Jason Krone, Shuyan Dong, Yi Zhang, Yaser Al-onaizan

    Abstract: Recently deep learning has dominated many machine learning areas, including spoken language understanding (SLU). However, deep learning models are notorious for being data-hungry, and the heavily optimized models are usually sensitive to the quality of the training examples provided and the consistency between training and inference conditions. To improve the performance of SLU models on tasks wit… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

    Comments: accepted by IEEE Spoken Language Technology Workshop, 2021

  7. arXiv:2009.13272  [pdf, other

    cs.CL cs.LG stat.ML

    Augmented Natural Language for Generative Sequence Labeling

    Authors: Ben Athiwaratkun, Cicero Nogueira dos Santos, Jason Krone, Bing Xiang

    Abstract: We propose a generative framework for joint sequence labeling and sentence-level classification. Our model performs multiple sequence labeling tasks at once using a single, shared natural language output space. Unlike prior discriminative methods, our model naturally incorporates label semantics and shares knowledge across tasks. Our framework is general purpose, performing well on few-shot, low-r… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

    Comments: To appear at EMNLP 2020

  8. arXiv:2004.10793  [pdf, other

    cs.CL

    Learning to Classify Intents and Slot Labels Given a Handful of Examples

    Authors: Jason Krone, Yi Zhang, Mona Diab

    Abstract: Intent classification (IC) and slot filling (SF) are core components in most goal-oriented dialogue systems. Current IC/SF models perform poorly when the number of training examples per class is small. We propose a new few-shot learning task, few-shot IC/SF, to study and improve the performance of IC and SF models on classes not seen at training time in ultra low resource scenarios. We establish a… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

    Comments: 8 pages, 2 figures

  9. arXiv:1802.01034  [pdf, other

    cs.LG stat.ML

    Multi-task Learning for Continuous Control

    Authors: Himani Arora, Rajath Kumar, Jason Krone, Chong Li

    Abstract: Reliable and effective multi-task learning is a prerequisite for the development of robotic agents that can quickly learn to accomplish related, everyday tasks. However, in the reinforcement learning domain, multi-task learning has not exhibited the same level of success as in other domains, such as computer vision. In addition, most reinforcement learning research on multi-task learning has been… ▽ More

    Submitted 3 February, 2018; originally announced February 2018.

  10. A Typeful Integration of SQL into Curry

    Authors: Michael Hanus, Julia Krone

    Abstract: We present an extension of the declarative programming language Curry to support the access to data stored in relational databases via SQL. Since Curry is statically typed, our emphasis on this SQL integration is on type safety. Our extension respects the type system of Curry so that run-time errors due to ill-typed data are avoided. This is obtained by preprocessing SQL statements at compile ti… ▽ More

    Submitted 3 January, 2017; originally announced January 2017.

    Comments: In Proceedings WLP'15/'16/WFLP'16, arXiv:1701.00148

    Journal ref: EPTCS 234, 2017, pp. 104-119