Zum Hauptinhalt springen

Showing 1–26 of 26 results for author: Misra, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.05086  [pdf, other

    cs.CL cs.AI

    Generating novel experimental hypotheses from language models: A case study on cross-dative generalization

    Authors: Kanishka Misra, Najoung Kim

    Abstract: Neural network language models (LMs) have been shown to successfully capture complex linguistic knowledge. However, their utility for understanding language acquisition is still debated. We contribute to this debate by presenting a case study where we use LMs as simulated learners to derive novel experimental hypotheses to be tested with humans. We apply this paradigm to study cross-dative general… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  2. arXiv:2403.19827  [pdf, other

    cs.CL

    Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs

    Authors: Kanishka Misra, Kyle Mahowald

    Abstract: Language models learn rare syntactic phenomena, but the extent to which this is attributable to generalization vs. memorization is a major open question. To that end, we iteratively trained transformer language models on systematically manipulated corpora which were human-scale in size, and then evaluated their learning of a rare grammatical phenomenon: the English Article+Adjective+Numeral+Noun (… ▽ More

    Submitted 10 August, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Updated version

  3. arXiv:2401.06640  [pdf, other

    cs.CL cs.AI

    Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently

    Authors: Kanishka Misra, Allyson Ettinger, Kyle Mahowald

    Abstract: Recent zero-shot evaluations have highlighted important limitations in the abilities of language models (LMs) to perform meaning extraction. However, it is now well known that LMs can demonstrate radical improvements in the presence of experimental contexts such as in-context examples and instructions. How well does this translate to previously studied meaning-sensitive tasks? We present a case-st… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  4. arXiv:2312.03708  [pdf, other

    cs.CL cs.AI

    Abstraction via exemplars? A representational case study on lexical category inference in BERT

    Authors: Kanishka Misra, Najoung Kim

    Abstract: Exemplar based accounts are often considered to be in direct opposition to pure linguistic abstraction in explaining language learners' ability to generalize to novel expressions. However, the recent success of neural network language models on linguistically sensitive tasks suggests that perhaps abstractions can arise via the encoding of exemplars. We provide empirical evidence for this claim by… ▽ More

    Submitted 3 November, 2023; originally announced December 2023.

    Comments: 2-page abstract, to appear in BUCLD48

  5. arXiv:2310.18736  [pdf, other

    cs.GT econ.TH

    A Gale-Shapley View of Unique Stable Marriages

    Authors: Kartik Gokhale, Amit Kumar Mallik, Ankit Kumar Misra, Swaprava Nath

    Abstract: Stable marriage of a two-sided market with unit demand is a classic problem that arises in many real-world scenarios. In addition, a unique stable marriage in this market simplifies a host of downstream desiderata. In this paper, we explore a new set of sufficient conditions for unique stable matching (USM) under this setup. Unlike other approaches that also address this question using the structu… ▽ More

    Submitted 2 August, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 20 pages, 1 figure, In Proceedings, ECAI 2024

  6. arXiv:2306.04009  [pdf, other

    cs.CL cs.AI

    Triggering Multi-Hop Reasoning for Question Answering in Language Models using Soft Prompts and Random Walks

    Authors: Kanishka Misra, Cicero Nogueira dos Santos, Siamak Shakeri

    Abstract: Despite readily memorizing world knowledge about entities, pre-trained language models (LMs) struggle to compose together two or more facts to perform multi-hop reasoning in question-answering tasks. In this work, we propose techniques that improve upon this limitation by relying on random walks over structured knowledge graphs. Specifically, we use soft prompts to guide LMs to chain together thei… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Findings of ACL 2023

  7. arXiv:2302.00093  [pdf, other

    cs.CL cs.AI

    Large Language Models Can Be Easily Distracted by Irrelevant Context

    Authors: Freda Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed Chi, Nathanael Schärli, Denny Zhou

    Abstract: Large language models have achieved impressive performance on various natural language processing tasks. However, so far they have been evaluated primarily on benchmarks where all information in the input context is relevant for solving the task. In this work, we investigate the distractibility of large language models, i.e., how the model problem-solving accuracy can be influenced by irrelevant c… ▽ More

    Submitted 6 June, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: Published in ICML 2023

  8. arXiv:2212.08979  [pdf, other

    cs.CL cs.LG

    Language model acceptability judgements are not always robust to context

    Authors: Koustuv Sinha, Jon Gauthier, Aaron Mueller, Kanishka Misra, Keren Fuentes, Roger Levy, Adina Williams

    Abstract: Targeted syntactic evaluations of language models ask whether models show stable preferences for syntactically acceptable content over minimal-pair unacceptable inputs. Most targeted syntactic evaluation datasets ask models to make these judgements with just a single context-free sentence as input. This does not match language models' training regime, in which input sentences are always highly con… ▽ More

    Submitted 17 December, 2022; originally announced December 2022.

  9. arXiv:2210.01963  [pdf, other

    cs.CL cs.AI

    COMPS: Conceptual Minimal Pair Sentences for testing Robust Property Knowledge and its Inheritance in Pre-trained Language Models

    Authors: Kanishka Misra, Julia Taylor Rayz, Allyson Ettinger

    Abstract: A characteristic feature of human semantic cognition is its ability to not only store and retrieve the properties of concepts observed through experience, but to also facilitate the inheritance of properties (can breathe) from superordinate concepts (animal) to their subordinates (dog) -- i.e. demonstrate property inheritance. In this paper, we present COMPS, a collection of minimal pair sentences… ▽ More

    Submitted 8 February, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: EACL 2023 Camera Ready version. Code can be found at https://github.com/kanishkamisra/comps

  10. arXiv:2205.06910  [pdf, other

    cs.CL

    A Property Induction Framework for Neural Language Models

    Authors: Kanishka Misra, Julia Taylor Rayz, Allyson Ettinger

    Abstract: To what extent can experience from language contribute to our conceptual knowledge? Computational explorations of this question have shed light on the ability of powerful neural language models (LMs) -- informed solely through text input -- to encode and elicit information about concepts and properties. To extend this line of research, we present a framework that uses neural-network language model… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: CogSci 2022 camera ready version, with hyperref-compatible citations. Code and Supplemental Material can be found in https://github.com/kanishkamisra/lm-induction

  11. arXiv:2203.13112  [pdf, other

    cs.CL

    minicons: Enabling Flexible Behavioral and Representational Analyses of Transformer Language Models

    Authors: Kanishka Misra

    Abstract: We present minicons, an open source library that provides a standard API for researchers interested in conducting behavioral and representational analyses of transformer-based language models (LMs). Specifically, minicons enables researchers to apply analysis methods at two levels: (1) at the prediction level -- by providing functions to efficiently extract word/sentence level probabilities; and (… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: To be submitted; Code to reproduce experiments can be found on https://github.com/kanishkamisra/minicons-experiments

  12. arXiv:2203.12606  [pdf

    q-fin.GN cs.CR

    Journey of Cryptocurrency in India In View of Financial Budget 2022-23

    Authors: Varun Shukla, Manoj Kumar Misra, Atul Chaturvedi

    Abstract: Recently, Indian Finance minister Nirmala Sitharaman announced in Union budget 2022-23 that Indian government will put 30% tax (the highest tax slab in India) on income generated from cryptocurrencies. Big financial institutions, experts and academicians have different opinions in this regard. They claim that it would be the end of cryptocurrency market in India or it would be possible that RBI (R… ▽ More

    Submitted 4 February, 2022; originally announced March 2022.

  13. arXiv:2111.02603  [pdf, ps, other

    cs.CL cs.AI

    On Semantic Cognition, Inductive Generalization, and Language Models

    Authors: Kanishka Misra

    Abstract: My doctoral research focuses on understanding semantic knowledge in neural network models trained solely to predict natural language (referred to as language models, or LMs), by drawing on insights from the study of concepts and categories grounded in cognitive science. I propose a framework inspired by 'inductive reasoning,' a phenomenon that sheds light on how humans utilize background knowledge… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: Accepted at AAAI 2022 Doctoral Consortium

  14. arXiv:2105.02987  [pdf, other

    cs.CL

    Do language models learn typicality judgments from text?

    Authors: Kanishka Misra, Allyson Ettinger, Julia Taylor Rayz

    Abstract: Building on research arguing for the possibility of conceptual and categorical knowledge acquisition through statistics contained in language, we evaluate predictive language models (LMs) -- informed solely by textual input -- on a prevalent phenomenon in cognitive science: typicality. Inspired by experiments that involve language processing and show robust typicality effects in humans, we propose… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted as a talk to CogSci 2021

  15. arXiv:2104.10813  [pdf, other

    cs.CL

    Finding Fuzziness in Neural Network Models of Language Processing

    Authors: Kanishka Misra, Julia Taylor Rayz

    Abstract: Humans often communicate by using imprecise language, suggesting that fuzzy concepts with unclear boundaries are prevalent in language use. In this paper, we test the extent to which models trained to capture the distributional statistics of language show correspondence to fuzzy-membership patterns. Using the task of natural language inference, we test a recent state of the art model on the classi… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: To appear at NAFIPS 2021

  16. arXiv:2101.07397  [pdf, ps, other

    cs.CL

    Exploring Lexical Irregularities in Hypothesis-Only Models of Natural Language Inference

    Authors: Qingyuan Hu, Yi Zhang, Kanishka Misra, Julia Rayz

    Abstract: Natural Language Inference (NLI) or Recognizing Textual Entailment (RTE) is the task of predicting the entailment relation between a pair of sentences (premise and hypothesis). This task has been described as a valuable testing ground for the development of semantic representations, and is a key component in natural language understanding evaluation benchmarks. Models that understand entailment sh… ▽ More

    Submitted 21 January, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: Accepted by 2020 IEEE 19th International Conference on Cognitive Informatics & Cognitive Computing (ICCI* CC). IEEE

  17. arXiv:2101.01693  [pdf, other

    cs.CY

    COVID-19 Tests Gone Rogue: Privacy, Efficacy, Mismanagement and Misunderstandings

    Authors: Manuel Morales, Rachel Barbar, Darshan Gandhi, Sanskruti Landage, Joseph Bae, Arpita Vats, Jil Kothari, Sheshank Shankar, Rohan Sukumaran, Himi Mathur, Krutika Misra, Aishwarya Saxena, Parth Patwa, Sethuraman T. V., Maurizio Arseni, Shailesh Advani, Kasia Jakimowicz, Sunaina Anand, Priyanshi Katiyar, Ashley Mehra, Rohan Iyer, Srinidhi Murali, Aryan Mahindra, Mikhail Dmitrienko, Saurish Srivastava , et al. (5 additional authors not shown)

    Abstract: COVID-19 testing, the cornerstone for effective screening and identification of COVID-19 cases, remains paramount as an intervention tool to curb the spread of COVID-19 both at local and national levels. However, the speed at which the pandemic struck and the response was rolled out, the widespread impact on healthcare infrastructure, the lack of sufficient preparation within the public health sys… ▽ More

    Submitted 7 May, 2021; v1 submitted 5 January, 2021; originally announced January 2021.

    Comments: 22 pages, 2 figures

  18. arXiv:2012.01772  [pdf, other

    cs.CY

    Digital Landscape of COVID-19 Testing: Challenges and Opportunities

    Authors: Darshan Gandhi, Rohan Sukumaran, Priyanshi Katiyar, Alex Radunsky, Sunaina Anand, Shailesh Advani, Jil Kothari, Kasia Jakimowicz, Sheshank Shankar, Sethuraman T. V., Krutika Misra, Aishwarya Saxena, Sanskruti Landage, Richa Sonker, Parth Patwa, Aryan Mahindra, Mikhail Dmitrienko, Kanishka Vaish, Ashley Mehra, Srinidhi Murali, Rohan Iyer, Joseph Bae, Vivek Sharma, Abhishek Singh, Rachel Barbar , et al. (1 additional authors not shown)

    Abstract: The COVID-19 Pandemic has left a devastating trail all over the world, in terms of loss of lives, economic decline, travel restrictions, trade deficit, and collapsing economy including real-estate, job loss, loss of health benefits, the decline in quality of access to care and services and overall quality of life. Immunization from the anticipated vaccines will not be the stand-alone guideline tha… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: 28 pages, 4 figures

  19. Exploring BERT's Sensitivity to Lexical Cues using Tests from Semantic Priming

    Authors: Kanishka Misra, Allyson Ettinger, Julia Taylor Rayz

    Abstract: Models trained to estimate word probabilities in context have become ubiquitous in natural language processing. How do these models use lexical cues in context to inform their word probabilities? To answer this question, we present a case study analyzing the pre-trained BERT model with tests informed by semantic priming. Using English lexical stimuli that show priming in humans, we find that BERT… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Accepted for publication in Findings of ACL: EMNLP 2020

  20. arXiv:2010.01666  [pdf, other

    cs.IR cs.LG

    Multi-Modal Retrieval using Graph Neural Networks

    Authors: Aashish Kumar Misraa, Ajinkya Kale, Pranav Aggarwal, Ali Aminian

    Abstract: Most real world applications of image retrieval such as Adobe Stock, which is a marketplace for stock photography and illustrations, need a way for users to find images which are both visually (i.e. aesthetically) and conceptually (i.e. containing the same salient objects) as a query image. Learning visual-semantic representations from images is a well studied problem for image retrieval. Filterin… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

  21. arXiv:1809.00367  [pdf, ps, other

    cs.RO eess.SY

    Momentum Model-based Minimal Parameter Identification of a Space Robot

    Authors: B. Naveen, Suril V. Shah, Arun K. Misra

    Abstract: Accurate information of inertial parameters is critical to motion planning and control of space robots. Before the launch, only a rudimentary estimate of the inertial parameters is available from experiments and computer-aided design (CAD) models. After the launch, on-orbit operations substantially alter the value of inertial parameters. In this work, we propose a new momentum model-based method f… ▽ More

    Submitted 2 September, 2018; originally announced September 2018.

    Comments: Accepted for publication in AIAA Journal of Guidance, Control, and Dynamics

  22. arXiv:1509.04618  [pdf

    cs.AR

    Cost Efficient Design of Reversible Adder Circuits for Low Power Applications

    Authors: Neeraj Kumar Misra, Mukesh Kumar Kushwaha, Subodh Wairya, Amit Kumar

    Abstract: A large amount of research is currently going on in the field of reversible logic, which have low heat dissipation, low power consumption, which is the main factor to apply reversible in digital VLSI circuit design. This paper introduces reversible gate named as Inventive0 gate. The novel gate is synthesis the efficient adder modules with minimum garbage output and gate count. The Inventive0 gate… ▽ More

    Submitted 11 September, 2015; originally announced September 2015.

    Comments: 9 pages, 12 figures, journal

  23. Evolution of structure of some binary group based n bit comparator, n-to-2n decoder by reversible technique

    Authors: Neeraj Kumar Misra, Subodh Wairya, Vinod Kumar Singh

    Abstract: Reversible logic has attracted substantial interest due to its low power consumption which is the main concern of low power VLSI circuit design. In this paper, a novel 4x4 reversible gate called inventive gate has been introduced and using this gate 1-bit, 2-bit, 8-bit, 32-bit and n-bit group-based reversible comparator have been constructed with low value of reversible parameters. The MOS transis… ▽ More

    Submitted 11 September, 2015; originally announced September 2015.

    Comments: 22 pages, 19 figure, journal

    Journal ref: International Journal of VLSI design & Communication Systems (VLSICS) Vol.5, No.5, October 2014

  24. Feasible methodology for optimization of a novel reversible binary compressor

    Authors: Neeraj Kumar Misra, Mukesh Kumar Kushwaha, Subodh Wairya, Amit Kumar

    Abstract: Now a day reversible logic is an attractive research area due to its low power consumption in the area of VLSI circuit design. The reversible logic gate is utilized to optimize power consumption by a feature of retrieving input logic from an output logic because of bijective mapping between input and output. In this manuscript, we design 4 2 and 5 2 reversible compressor circuits using a new type… ▽ More

    Submitted 11 September, 2015; originally announced September 2015.

    Comments: 13 pages, 9 figures

    Journal ref: International Journal of VLSI design & Communication Systems (VLSICS) Vol.6, No.4, August 2015

  25. arXiv:1412.0691  [pdf, other

    cs.AI cs.RO

    RoboBrain: Large-Scale Knowledge Engine for Robots

    Authors: Ashutosh Saxena, Ashesh Jain, Ozan Sener, Aditya Jami, Dipendra K. Misra, Hema S. Koppula

    Abstract: In this paper we introduce a knowledge engine, which learns and shares knowledge representations, for robots to carry out a variety of tasks. Building such an engine brings with it the challenge of dealing with multiple data modalities including symbols, natural language, haptic senses, robot trajectories, visual features and many others. The \textit{knowledge} stored in the engine comes from mult… ▽ More

    Submitted 12 April, 2015; v1 submitted 1 December, 2014; originally announced December 2014.

    Comments: 10 pages, 9 figures

  26. arXiv:1001.2270  [pdf

    cs.DB cs.IR

    An Improved Approach to High Level Privacy Preserving Itemset Mining

    Authors: Rajesh Kumar Boora, Ruchi Shukla, A. K. Misra

    Abstract: Privacy preserving association rule mining has triggered the development of many privacy preserving data mining techniques. A large fraction of them use randomized data distortion techniques to mask the data for preserving. This paper proposes a new transaction randomization method which is a combination of the fake transaction randomization method and a new per transaction randomization method.… ▽ More

    Submitted 13 January, 2010; originally announced January 2010.

    Comments: 8 pages IEEE format, International Journal of Computer Science and Information Security, IJCSIS December 2009, ISSN 1947 5500, http://sites.google.com/site/ijcsis/

    Report number: Volume 6, No. 3, ISSN 1947 5500

    Journal ref: International Journal of Computer Science and Information Security, IJCSIS, Vol. 6, No. 3, pp. 216-223, December 2009, USA