Zum Hauptinhalt springen

Showing 1–23 of 23 results for author: Shin, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14739  [pdf, other

    cs.CL

    Learning to Retrieve Iteratively for In-Context Learning

    Authors: Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme

    Abstract: We introduce iterative retrieval, a novel framework that empowers retrievers to make iterative decisions through policy optimization. Finding an optimal portfolio of retrieved items is a combinatorial optimization problem, generally considered NP-hard. This approach provides a learned approximation to such a solution, meeting specific task requirements under a given family of large language models… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2311.10775  [pdf, other

    cs.CL cs.AI cs.LG

    ToolTalk: Evaluating Tool-Usage in a Conversational Setting

    Authors: Nicholas Farn, Richard Shin

    Abstract: Large language models (LLMs) have displayed massive improvements in reasoning and decision-making skills and can hold natural conversations with users. Many recent works seek to augment LLM-based assistants with external tools so they can access private or up-to-date information and carry out actions on behalf of users. To better measure the performance of these assistants, this paper introduces T… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 10 pages, 1 figure, ICLR 2024 Submission, https://github.com/microsoft/ToolTalk

  3. arXiv:2309.11765  [pdf, other

    cs.LG cs.CR

    Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation

    Authors: Xinyu Tang, Richard Shin, Huseyin A. Inan, Andre Manoel, Fatemehsadat Mireshghallah, Zinan Lin, Sivakanth Gopi, Janardhan Kulkarni, Robert Sim

    Abstract: We study the problem of in-context learning (ICL) with large language models (LLMs) on private datasets. This scenario poses privacy risks, as LLMs may leak or regurgitate the private examples demonstrated in the prompt. We propose a novel algorithm that generates synthetic few-shot demonstrations from the private dataset with formal differential privacy (DP) guarantees, and show empirically that… ▽ More

    Submitted 27 January, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

  4. arXiv:2212.10520  [pdf, other

    cs.CL

    Privacy-Preserving Domain Adaptation of Semantic Parsers

    Authors: Fatemehsadat Mireshghallah, Yu Su, Tatsunori Hashimoto, Jason Eisner, Richard Shin

    Abstract: Task-oriented dialogue systems often assist users with personal or confidential matters. For this reason, the developers of such a system are generally prohibited from observing actual usage. So how can they know where the system is failing and needs more training data or new functionality? In this work, we study ways in which realistic user utterances can be generated synthetically, to help incre… ▽ More

    Submitted 8 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  5. SoK: Modeling Explainability in Security Analytics for Interpretability, Trustworthiness, and Usability

    Authors: Dipkamal Bhusal, Rosalyn Shin, Ajay Ashok Shewale, Monish Kumar Manikya Veerabhadran, Michael Clifford, Sara Rampazzi, Nidhi Rastogi

    Abstract: Interpretability, trustworthiness, and usability are key considerations in high-stake security applications, especially when utilizing deep learning models. While these models are known for their high accuracy, they behave as black boxes in which identifying important features and factors that led to a classification or a prediction is difficult. This can lead to uncertainty and distrust, especial… ▽ More

    Submitted 12 June, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 12 pages, 4 figures

  6. arXiv:2206.10668  [pdf, ps, other

    cs.CL

    BenchCLAMP: A Benchmark for Evaluating Language Models on Syntactic and Semantic Parsing

    Authors: Subhro Roy, Sam Thomson, Tongfei Chen, Richard Shin, Adam Pauls, Jason Eisner, Benjamin Van Durme

    Abstract: Recent work has shown that generation from a prompted or fine-tuned language model can perform well at semantic parsing when the output is constrained to be a valid semantic representation. We introduce BenchCLAMP, a Benchmark to evaluate Constrained LAnguage Model Parsing, that includes context-free grammars for seven semantic parsing datasets and two syntactic parsing datasets with varied output… ▽ More

    Submitted 10 January, 2024; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks

  7. arXiv:2205.08675  [pdf, other

    cs.CL cs.AI

    Addressing Resource and Privacy Constraints in Semantic Parsing Through Data Augmentation

    Authors: Kevin Yang, Olivia Deng, Charles Chen, Richard Shin, Subhro Roy, Benjamin Van Durme

    Abstract: We introduce a novel setup for low-resource task-oriented semantic parsing which incorporates several constraints that may arise in real-world scenarios: (1) lack of similar datasets/models from a related domain, (2) inability to sample useful logical forms directly from a grammar, and (3) privacy requirements for unlabeled natural utterances. Our goal is to improve a low-resource semantic parser… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: ACL Findings 2022

  8. arXiv:2112.08696  [pdf, other

    cs.CL

    Few-Shot Semantic Parsing with Language Models Trained On Code

    Authors: Richard Shin, Benjamin Van Durme

    Abstract: Large language models can perform semantic parsing with little training data, when prompted with in-context examples. It has been shown that this can be improved by formulating the problem as paraphrasing into canonical utterances, which casts the underlying meaning representation into a controlled natural language-like representation. Intuitively, such models can more easily output canonical utte… ▽ More

    Submitted 29 May, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: NAACL 2022

  9. arXiv:2112.05705  [pdf, other

    cs.CL cs.LG

    Pruning Pretrained Encoders with a Multitask Objective

    Authors: Patrick Xia, Richard Shin

    Abstract: The sizes of pretrained language models make them challenging and expensive to use when there are multiple desired downstream tasks. In this work, we adopt recent strategies for model pruning during finetuning to explore the question of whether it is possible to prune a single encoder so that it can be used for multiple tasks. We allocate a fixed parameter budget and compare pruning a single model… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: ENLSP NeurIPS 2021

  10. arXiv:2105.05601  [pdf, other

    cs.CL cs.LG

    OutFlip: Generating Out-of-Domain Samples for Unknown Intent Detection with Natural Language Attack

    Authors: DongHyun Choi, Myeong Cheol Shin, EungGyun Kim, Dong Ryeol Shin

    Abstract: Out-of-domain (OOD) input detection is vital in a task-oriented dialogue system since the acceptance of unsupported inputs could lead to an incorrect response of the system. This paper proposes OutFlip, a method to generate out-of-domain samples using only in-domain training dataset automatically. A white-box natural language attack method HotFlip is revised to generate out-of-domain samples inste… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 9 pages, 3 figures; to be appear in ACL Findings of ACL-IJCNLP 2021

  11. arXiv:2104.08768  [pdf, other

    cs.CL

    Constrained Language Models Yield Few-Shot Semantic Parsers

    Authors: Richard Shin, Christopher H. Lin, Sam Thomson, Charles Chen, Subhro Roy, Emmanouil Antonios Platanios, Adam Pauls, Dan Klein, Jason Eisner, Benjamin Van Durme

    Abstract: We explore the use of large pretrained language models as few-shot semantic parsers. The goal in semantic parsing is to generate a structured meaning representation given a natural language input. However, language models are trained to generate natural language. To bridge the gap, we use language models to paraphrase inputs into a controlled sublanguage resembling English that can be automaticall… ▽ More

    Submitted 16 November, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021. Code is available at https://github.com/microsoft/semantic_parsing_with_constrained_lm

  12. Integrated Eojeol Embedding for Erroneous Sentence Classification in Korean Chatbots

    Authors: DongHyun Choi, IlNam Park, Myeong Cheol Shin, EungGyun Kim, Dong Ryeol Shin

    Abstract: This paper attempts to analyze the Korean sentence classification system for a chatbot. Sentence classification is the task of classifying an input sentence based on predefined categories. However, spelling or space error contained in the input sentence causes problems in morphological analysis and tokenization. This paper proposes a novel approach of Integrated Eojeol (Korean syntactic word separ… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

    Comments: 9 pages, 2 figures

    Journal ref: IEEE Access, 2021

  13. arXiv:2004.03125  [pdf, other

    cs.CL

    RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases

    Authors: DongHyun Choi, Myeong Cheol Shin, EungGyun Kim, Dong Ryeol Shin

    Abstract: Text-to-SQL is the problem of converting a user question into an SQL query, when the question and database are given. In this paper, we present a neural network approach called RYANSQL (Recursively Yielding Annotation Network for SQL) to solve complex Text-to-SQL tasks for cross-domain databases. State-ment Position Code (SPC) is defined to trans-form a nested SQL query into a set of non-nested SE… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: 10 pages, 1 figure

  14. arXiv:1912.12612  [pdf, other

    cs.LG stat.ML

    Hierarchical Variational Imitation Learning of Control Programs

    Authors: Roy Fox, Richard Shin, William Paul, Yitian Zou, Dawn Song, Ken Goldberg, Pieter Abbeel, Ion Stoica

    Abstract: Autonomous agents can learn by imitating teacher demonstrations of the intended behavior. Hierarchical control policies are ubiquitously useful for such learning, having the potential to break down structured tasks into simpler sub-tasks, thereby improving data efficiency and generalization. In this paper, we propose a variational inference method for imitation learning of a control policy represe… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

  15. arXiv:1912.12345  [pdf, other

    cs.LG cs.AI cs.PL stat.ML

    Synthetic Datasets for Neural Program Synthesis

    Authors: Richard Shin, Neel Kant, Kavi Gupta, Christopher Bender, Brandon Trabucco, Rishabh Singh, Dawn Song

    Abstract: The goal of program synthesis is to automatically generate programs in a particular language from corresponding specifications, e.g. input-output behavior. Many current approaches achieve impressive results after training on randomly generated I/O examples in limited domain-specific languages (DSLs), as with string transformations in RobustFill. However, we empirically discover that applying test… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

    Comments: ICLR 2019

  16. arXiv:1911.04942  [pdf, other

    cs.CL cs.AI

    RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers

    Authors: Bailin Wang, Richard Shin, Xiaodong Liu, Oleksandr Polozov, Matthew Richardson

    Abstract: When translating natural language questions into SQL queries to answer questions from a database, contemporary semantic parsing models struggle to generalize to unseen database schemas. The generalization challenge lies in (a) encoding the database relations in an accessible way for the semantic parser, and (b) modeling alignment between database columns and their mentions in a given query. We pre… ▽ More

    Submitted 24 August, 2021; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: Fix some errors of ACL 2020 camera-ready; 12 pages, 5 figures, 7 tables. arXiv admin note: text overlap with arXiv:1906.11790

  17. arXiv:1906.11790  [pdf, other

    cs.LG cs.CL stat.ML

    Encoding Database Schemas with Relation-Aware Self-Attention for Text-to-SQL Parsers

    Authors: Richard Shin

    Abstract: When translating natural language questions into SQL queries to answer questions from a database, we would like our methods to generalize to domains and database schemas outside of the training set. To handle complex questions and database schemas with a neural encoder-decoder paradigm, it is critical to properly encode the schema as part of the input with the question. In this paper, we use relat… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

  18. arXiv:1906.10816  [pdf, other

    cs.LG cs.AI cs.CL cs.PL stat.ML

    Program Synthesis and Semantic Parsing with Learned Code Idioms

    Authors: Richard Shin, Miltiadis Allamanis, Marc Brockschmidt, Oleksandr Polozov

    Abstract: Program synthesis of general-purpose source code from natural language specifications is challenging due to the need to reason about high-level patterns in the target program and low-level implementation details at the same time. In this work, we present PATOIS, a system that allows a neural program synthesizer to explicitly interleave high-level and low-level reasoning at every generation step. I… ▽ More

    Submitted 4 November, 2019; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS) 2019. 13 pages total, 9 pages of main text

  19. arXiv:1704.06611  [pdf, other

    cs.LG cs.NE cs.PL

    Making Neural Programming Architectures Generalize via Recursion

    Authors: Jonathon Cai, Richard Shin, Dawn Song

    Abstract: Empirically, neural networks that attempt to learn programs from data have exhibited poor generalizability. Moreover, it has traditionally been difficult to reason about the behavior of these models beyond a certain level of input complexity. In order to address these issues, we propose augmenting neural architectures with a key abstraction: recursion. As an application, we implement recursion in… ▽ More

    Submitted 21 April, 2017; originally announced April 2017.

    Comments: Published in ICLR 2017

  20. arXiv:1704.03118  [pdf, other

    cs.CR

    PIANO: Proximity-based User Authentication on Voice-Powered Internet-of-Things Devices

    Authors: Neil Zhenqiang Gong, Altay Ozen, Yu Wu, Xiaoyu Cao, Richard Shin, Dawn Song, Hongxia Jin, Xuan Bao

    Abstract: Voice is envisioned to be a popular way for humans to interact with Internet-of-Things (IoT) devices. We propose a proximity-based user authentication method (called PIANO) for access control on such voice-powered IoT devices. PIANO leverages the built-in speaker, microphone, and Bluetooth that voice-powered IoT devices often already have. Specifically, we assume that a user carries a personal voi… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: To appear in ICDCS'17

  21. arXiv:1611.01867  [pdf, other

    cs.CL

    Latent Attention For If-Then Program Synthesis

    Authors: Xinyun Chen, Chang Liu, Richard Shin, Dawn Song, Mingcheng Chen

    Abstract: Automatic translation from natural language descriptions into programs is a longstanding challenging problem. In this work, we consider a simple yet important sub-problem: translation from textual descriptions to If-Then programs. We devise a novel neural network architecture for this task which we train end-to-end. Specifically, we introduce Latent Attention, which computes multiplicative weights… ▽ More

    Submitted 6 November, 2016; originally announced November 2016.

    Comments: Accepted by NIPS 2016

  22. arXiv:1507.08309  [pdf, other

    cs.CR

    Exploring Privacy Preservation in Outsourced K-Nearest Neighbors with Multiple Data Owners

    Authors: Frank Li, Richard Shin, Vern Paxson

    Abstract: The k-nearest neighbors (k-NN) algorithm is a popular and effective classification algorithm. Due to its large storage and computational requirements, it is suitable for cloud outsourcing. However, k-NN is often run on sensitive data such as medical records, user images, or personal information. It is important to protect the privacy of data in an outsourced k-NN system. Prior works have all ass… ▽ More

    Submitted 29 July, 2015; originally announced July 2015.

  23. arXiv:1112.3265  [pdf, other

    cs.SI physics.soc-ph

    Jointly Predicting Links and Inferring Attributes using a Social-Attribute Network (SAN)

    Authors: Neil Zhenqiang Gong, Ameet Talwalkar, Lester Mackey, Ling Huang, Eui Chul Richard Shin, Emil Stefanov, Elaine, Shi, Dawn Song

    Abstract: The effects of social influence and homophily suggest that both network structure and node attribute information should inform the tasks of link prediction and node attribute inference. Recently, Yin et al. proposed Social-Attribute Network (SAN), an attribute-augmented social network, to integrate network structure and node attributes to perform both link prediction and attribute inference. They… ▽ More

    Submitted 22 June, 2012; v1 submitted 14 December, 2011; originally announced December 2011.

    Comments: 9 pages, 4 figures and 4 tables