Zum Hauptinhalt springen

Showing 1–26 of 26 results for author: Krishnaswamy, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12147  [pdf, other

    cs.AI

    Metacognitive AI: Framework and the Case for a Neurosymbolic Approach

    Authors: Hua Wei, Paulo Shakarian, Christian Lebiere, Bruce Draper, Nikhil Krishnaswamy, Sergei Nirenburg

    Abstract: Metacognition is the concept of reasoning about an agent's own internal processes and was originally introduced in the field of developmental psychology. In this position paper, we examine the concept of applying metacognition to artificial intelligence. We introduce a framework for understanding metacognitive artificial intelligence (AI) that we call TRAP: transparency, reasoning, adaptation, and… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2405.08304  [pdf, other

    cs.CL cs.AI q-bio.NC

    Computational Thought Experiments for a More Rigorous Philosophy and Science of the Mind

    Authors: Iris Oved, Nikhil Krishnaswamy, James Pustejovsky, Joshua Hartshorne

    Abstract: We offer philosophical motivations for a method we call Virtual World Cognitive Science (VW CogSci), in which researchers use virtual embodied agents that are embedded in virtual worlds to explore questions in the field of Cognitive Science. We focus on questions about mental and linguistic representation and the ways that such computational modeling can add rigor to philosophical thought experime… ▽ More

    Submitted 14 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures, to appear at CogSci 2024

  3. arXiv:2404.08949  [pdf, other

    cs.CL

    Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles

    Authors: Abhijnan Nath, Huma Jamil, Shafiuddin Rehan Ahmed, George Baker, Rahul Ghosh, James H. Martin, Nathaniel Blanchard, Nikhil Krishnaswamy

    Abstract: Event coreference resolution (ECR) is the task of determining whether distinct mentions of events within a multi-document corpus are actually linked to the same underlying occurrence. Images of the events can help facilitate resolution when language is ambiguous. Here, we propose a multimodal cross-document event coreference resolution method that integrates visual and textual cues with a simple l… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: To appear at LREC-COLING 2024

  4. arXiv:2404.03196  [pdf, other

    cs.CL

    Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation

    Authors: Abhijnan Nath, Shadi Manafi, Avyakta Chelle, Nikhil Krishnaswamy

    Abstract: In NLP, Event Coreference Resolution (ECR) is the task of connecting event clusters that refer to the same underlying real-life event, usually via neural systems. In this work, we investigate using abductive free-text rationales (FTRs) generated by modern autoregressive LLMs as distant supervision of smaller student models for cross-document coreference (CDCR) of events. We implement novel rationa… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: To be published in NAACL 2024 Main

  5. arXiv:2403.20056  [pdf, other

    cs.CL

    Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets

    Authors: Shadi Manafi, Nikhil Krishnaswamy

    Abstract: Multilingual Language Models (MLLMs) exhibit robust cross-lingual transfer capabilities, or the ability to leverage information acquired in a source language and apply it to a target language. These capabilities find practical applications in well-established Natural Language Processing (NLP) tasks such as Named Entity Recognition (NER). This study aims to investigate the effectiveness of a source… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: accepted in LREC-COLING 2024

  6. arXiv:2403.17284  [pdf, other

    cs.CL

    Common Ground Tracking in Multimodal Dialogue

    Authors: Ibrahim Khebour, Kenneth Lai, Mariah Bradford, Yifan Zhu, Richard Brutti, Christopher Tam, Jingxuan Tu, Benjamin Ibarra, Nathaniel Blanchard, Nikhil Krishnaswamy, James Pustejovsky

    Abstract: Within Dialogue Modeling research in AI and NLP, considerable attention has been spent on ``dialogue state tracking'' (DST), which is the ability to update the representations of the speaker's needs at each turn in the dialogue by taking into account the past dialogue moves and history. Less studied but just as important to dialogue modeling, however, is ``common ground tracking'' (CGT), which ide… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  7. arXiv:2402.15654  [pdf, other

    cs.CL

    Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics

    Authors: Sadaf Ghaffari, Nikhil Krishnaswamy

    Abstract: In this paper, we present an exploration of LLMs' abilities to problem solve with physical reasoning in situated environments. We construct a simple simulated environment and demonstrate examples of where, in a zero-shot setting, both text and multimodal LLMs display atomic world knowledge about various objects but fail to compose this knowledge in correct solutions for an object manipulation and… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 10 pages, 10 figures, Proceedings of AAAI Spring Symposium: Empowering Machine Learning and Large Language Models with Domain and Commonsense Knowledge (MAKE). AAAI (2024)

  8. arXiv:2306.05434  [pdf, other

    cs.CL

    How Good is the Model in Model-in-the-loop Event Coreference Resolution Annotation?

    Authors: Shafiuddin Rehan Ahmed, Abhijnan Nath, Michael Regan, Adam Pollins, Nikhil Krishnaswamy, James H. Martin

    Abstract: Annotating cross-document event coreference links is a time-consuming and cognitively demanding task that can compromise annotation quality and efficiency. To address this, we propose a model-in-the-loop annotation approach for event coreference resolution, where a machine learning model suggests likely corefering event pairs only. We evaluate the effectiveness of this approach by first simulating… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: The 17th Liguistics Annotation Workshop, 2023 (LAW-XVII) short paper. 10 pages, 6 figures, 1 table

  9. arXiv:2305.17350  [pdf, other

    cs.CL

    How Good is Automatic Segmentation as a Multimodal Discourse Annotation Aid?

    Authors: Corbyn Terpstra, Ibrahim Khebour, Mariah Bradford, Brett Wisniewski, Nikhil Krishnaswamy, Nathaniel Blanchard

    Abstract: Collaborative problem solving (CPS) in teams is tightly coupled with the creation of shared meaning between participants in a situated, collaborative task. In this work, we assess the quality of different utterance segmentation techniques as an aid in annotating CPS. We (1) manually transcribe utterances in a dataset of triads collaboratively solving a problem involving dialogue and physical objec… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 7 pages, 1 figure, 2 tables, Proceedings of 19th Joint ISO-ACL Workshop on Interoperable Semantic Annotation (ISA 2023)

  10. arXiv:2305.13668  [pdf, other

    cs.CL cs.LG

    Grounding and Distinguishing Conceptual Vocabulary Through Similarity Learning in Embodied Simulations

    Authors: Sadaf Ghaffari, Nikhil Krishnaswamy

    Abstract: We present a novel method for using agent experiences gathered through an embodied simulation to ground contextualized word vectors to object representations. We use similarity learning to make comparisons between different object types based on their properties when interacted with, and to extract common features pertaining to the objects' behavior. We then use an affine transformation to calcula… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at IWCS Conference

  11. arXiv:2305.13641  [pdf, other

    cs.CL

    AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese

    Authors: Abhijnan Nath, Sheikh Mannan, Nikhil Krishnaswamy

    Abstract: Despite their successes in NLP, Transformer-based language models still require extensive computing resources and suffer in low-resource or low-compute settings. In this paper, we present AxomiyaBERTa, a novel BERT model for Assamese, a morphologically-rich low-resource language (LRL) of Eastern India. AxomiyaBERTa is trained only on the masked language modeling (MLM) task, without the typical add… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 16 pages, 6 figures, 8 tables, appearing in Findings of the ACL: ACL 2023. This version compiled using pdfLaTeX-compatible Assamese script font. Assamese text may appear differently here than in official ACL 2023 proceedings

  12. arXiv:2305.13076  [pdf, other

    cs.CL

    An Abstract Specification of VoxML as an Annotation Language

    Authors: Kiyong Lee, Nikhil Krishnaswamy, James Pustejovsky

    Abstract: VoxML is a modeling language used to map natural language expressions into real-time visualizations using commonsense semantic knowledge of objects and events. Its utility has been demonstrated in embodied simulation environments and in agent-object interactions in situated multimodal human-agent collaboration and communication. It introduces the notion of object affordance (both Gibsonian and Tel… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 8 pages, 4 figures, Proceedings of 19th Joint ISO-ACL Workshop on Interoperable Semantic Annotation (ISA 2023)

  13. arXiv:2305.05672  [pdf, other

    cs.CL

    $2 * n$ is better than $n^2$: Decomposing Event Coreference Resolution into Two Tractable Problems

    Authors: Shafiuddin Rehan Ahmed, Abhijnan Nath, James H. Martin, Nikhil Krishnaswamy

    Abstract: Event Coreference Resolution (ECR) is the task of linking mentions of the same event either within or across documents. Most mention pairs are not coreferent, yet many that are coreferent can be identified through simple techniques such as lemma matching of the event triggers or the sentences in which they appear. Existing methods for training coreference systems sample from a largely skewed distr… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Findings of the Association of Computational Linguistics, ACL 2023. 13 pages, 7 figures, 6 tables

  14. arXiv:2211.04555  [pdf, other

    cs.LG cs.AI

    Detecting and Accommodating Novel Types and Concepts in an Embodied Simulation Environment

    Authors: Sadaf Ghaffari, Nikhil Krishnaswamy

    Abstract: In this paper, we present methods for two types of metacognitive tasks in an AI system: rapidly expanding a neural classification model to accommodate a new category of object, and recognizing when a novel object type is observed instead of misclassifying the observation as a known class. Our methods take numerical data drawn from an embodied simulation environment, which describes the motion and… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2204.08107

  15. arXiv:2204.08107  [pdf, other

    cs.AI cs.CV cs.LG

    Exploiting Embodied Simulation to Detect Novel Object Classes Through Interaction

    Authors: Nikhil Krishnaswamy, Sadaf Ghaffari

    Abstract: In this paper we present a novel method for a naive agent to detect novel objects it encounters in an interaction. We train a reinforcement learning policy on a stacking task given a known object type, and then observe the results of the agent attempting to stack various other objects based on the same trained policy. By extracting embedding vectors from a convolutional neural net trained over the… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

  16. arXiv:2012.02947  [pdf, other

    cs.AI

    Neurosymbolic AI for Situated Language Understanding

    Authors: Nikhil Krishnaswamy, James Pustejovsky

    Abstract: In recent years, data-intensive AI, particularly the domain of natural language processing and understanding, has seen significant progress driven by the advent of large datasets and deep neural networks that have sidelined more classic AI approaches to the field. These systems can apparently demonstrate sophisticated linguistic understanding or generation capabilities, but often fail to transfer… ▽ More

    Submitted 5 December, 2020; originally announced December 2020.

    Comments: 18 pages + refs, 16 figures, presented at the 8th Annual Conference on Advances in Cognitive Systems (ACS), 2020

  17. arXiv:2007.09053  [pdf, other

    cs.RO cs.AI cs.CL

    Situated Multimodal Control of a Mobile Robot: Navigation through a Virtual Environment

    Authors: Katherine Krajovic, Nikhil Krishnaswamy, Nathaniel J. Dimick, R. Pito Salas, James Pustejovsky

    Abstract: We present a new interface for controlling a navigation robot in novel environments using coordinated gesture and language. We use a TurtleBot3 robot with a LIDAR and a camera, an embodied simulation of what the robot has encountered while exploring, and a cross-platform bridge facilitating generic communication. A human partner can deliver instructions to the robot using spoken English and gestur… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: 4 pages, 1 table, 4 figures, proceedings of RoboDIAL special session a SigDIAL 2020

  18. arXiv:2003.07385  [pdf, other

    cs.CL cs.AI

    A Formal Analysis of Multimodal Referring Strategies Under Common Ground

    Authors: Nikhil Krishnaswamy, James Pustejovsky

    Abstract: In this paper, we present an analysis of computationally generated mixed-modality definite referring expressions using combinations of gesture and linguistic descriptions. In doing so, we expose some striking formal semantic properties of the interactions between gesture and language, conditioned on the introduction of content into the common ground between the (computational) speaker and (human)… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: 9 pages (incl refs), 7 figures, 3 tables, proceedings of LREC 2020 (postponed due to COVID-19)

  19. arXiv:1909.08161  [pdf, other

    cs.HC cs.AI cs.RO

    Multimodal Continuation-style Architectures for Human-Robot Interaction

    Authors: Nikhil Krishnaswamy, James Pustejovsky

    Abstract: We present an architecture for integrating real-time, multimodal input into a computational agent's contextual model. Using a human-avatar interaction in a virtual world, we treat aligned gesture and speech as an ensemble where content may be communicated by either modality. With a modified nondeterministic pushdown automaton architecture, the computer system: (1) consumes input incrementally usin… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: Advances in Cognitive Systems Cognitive Vision Workshop (2019), 8 pages, 5 figures

  20. arXiv:1902.01886  [pdf, other

    cs.AI

    Situational Grounding within Multimodal Simulations

    Authors: James Pustejovsky, Nikhil Krishnaswamy

    Abstract: In this paper, we argue that simulation platforms enable a novel type of embodied spatial reasoning, one facilitated by a formal model of object and event semantics that renders the continuous quantitative search space of an open-world, real-time environment tractable. We provide examples for how a semantically-informed AI system can exploit the precise, numerical information provided by a game en… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

    Comments: AAAI-19 Workshop on Games and Simulations for Artificial Intelligence

  21. arXiv:1811.11064  [pdf, other

    cs.AI

    Combining Deep Learning and Qualitative Spatial Reasoning to Learn Complex Structures from Sparse Examples with Noise

    Authors: Nikhil Krishnaswamy, Scott Friedman, James Pustejovsky

    Abstract: Many modern machine learning approaches require vast amounts of training data to learn new concepts; conversely, human learning often requires few examples--sometimes only one--from which the learner can abstract structural concepts. We present a novel approach to introducing new spatial structures to an AI agent, combining deep learning over qualitative spatial relations with various heuristic se… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

  22. arXiv:1810.00838  [pdf, other

    cs.RO

    Multimodal Interactive Learning of Primitive Actions

    Authors: Tuan Do, Nikhil Krishnaswamy, Kyeongmin Rim, James Pustejovsky

    Abstract: We describe an ongoing project in learning to perform primitive actions from demonstrations using an interactive interface. In our previous work, we have used demonstrations captured from humans performing actions as training samples for a neural network-based trajectory model of actions to be performed by a computational agent in novel setups. We found that our original framework had some limitat… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

    Comments: Presented at AI-HRI AAAI-FSS, 2018 (arXiv:1809.06606)

    Report number: AI-HRI/2018/02

  23. arXiv:1610.01713  [pdf, other

    cs.CL

    Generating Simulations of Motion Events from Verbal Descriptions

    Authors: James Pustejovsky, Nikhil Krishnaswamy

    Abstract: In this paper, we describe a computational model for motion events in natural language that maps from linguistic expressions, through a dynamic event interpretation, into three-dimensional temporal simulations in a model. Starting with the model from (Pustejovsky and Moszkowicz, 2011), we analyze motion events using temporally-traced Labelled Transition Systems. We model the distinction between pa… ▽ More

    Submitted 5 October, 2016; originally announced October 2016.

    Comments: 11 pages, 5 figures, *SEM workshop, COLING 2014

  24. arXiv:1610.01508  [pdf, other

    cs.CL

    VoxML: A Visualization Modeling Language

    Authors: James Pustejovsky, Nikhil Krishnaswamy

    Abstract: We present the specification for a modeling language, VoxML, which encodes semantic knowledge of real-world objects represented as three-dimensional models, and of events and attributes related to and enacted over these objects. VoxML is intended to overcome the limitations of existing 3D visual markup languages by allowing for the encoding of a broad range of semantic knowledge that can be exploi… ▽ More

    Submitted 5 October, 2016; originally announced October 2016.

    Comments: 8 pages, 9 figures, proceedings of LREC 2016

  25. arXiv:1610.01247  [pdf, other

    cs.CL cs.CV

    ECAT: Event Capture Annotation Tool

    Authors: Tuan Do, Nikhil Krishnaswamy, James Pustejovsky

    Abstract: This paper introduces the Event Capture Annotation Tool (ECAT), a user-friendly, open-source interface tool for annotating events and their participants in video, capable of extracting the 3D positions and orientations of objects in video captured by Microsoft's Kinect(R) hardware. The modeling language VoxML (Pustejovsky and Krishnaswamy, 2016) underlies ECAT's object, program, and attribute repr… ▽ More

    Submitted 4 October, 2016; originally announced October 2016.

    Comments: 4 pages, 4 figures, ISA workshop 2015

  26. arXiv:1610.00602  [pdf, other

    cs.CL

    Multimodal Semantic Simulations of Linguistically Underspecified Motion Events

    Authors: Nikhil Krishnaswamy, James Pustejovsky

    Abstract: In this paper, we describe a system for generating three-dimensional visual simulations of natural language motion expressions. We use a rich formal model of events and their participants to generate simulations that satisfy the minimal constraints entailed by the associated utterance, relying on semantic knowledge of physical objects and motion events. This paper outlines technical considerations… ▽ More

    Submitted 3 October, 2016; originally announced October 2016.