Search | arXiv e-print repository

arXiv:2005.08516 [pdf, other]

Towards Question Format Independent Numerical Reasoning: A Set of Prerequisite Tasks

Authors: Swaroop Mishra, Arindam Mitra, Neeraj Varshney, Bhavdeep Sachdeva, Chitta Baral

Abstract: Numerical reasoning is often important to accurately understand the world. Recently, several format-specific datasets have been proposed, such as numerical reasoning in the settings of Natural Language Inference (NLI), Reading Comprehension (RC), and Question Answering (QA). Several format-specific models and architectures in response to those datasets have also been proposed. However, there exist… ▽ More Numerical reasoning is often important to accurately understand the world. Recently, several format-specific datasets have been proposed, such as numerical reasoning in the settings of Natural Language Inference (NLI), Reading Comprehension (RC), and Question Answering (QA). Several format-specific models and architectures in response to those datasets have also been proposed. However, there exists a strong need for a benchmark which can evaluate the abilities of models, in performing question format independent numerical reasoning, as (i) the numerical reasoning capabilities we want to teach are not controlled by question formats, (ii) for numerical reasoning technology to have the best possible application, it must be able to process language and reason in a way that is not exclusive to a single format, task, dataset or domain. In pursuit of this goal, we introduce NUMBERGAME, a multifaceted benchmark to evaluate model performance across numerical reasoning tasks of eight diverse formats. We add four existing question types in our compilation. Two of the new types we add are about questions that require external numerical knowledge, commonsense knowledge and domain knowledge. For building a more practical numerical reasoning system, NUMBERGAME demands four capabilities beyond numerical reasoning: (i) detecting question format directly from data (ii) finding intermediate common format to which every format can be converted (iii) incorporating commonsense knowledge (iv) handling data imbalance across formats. We build several baselines, including a new model based on knowledge hunting using a cheatsheet. However, all baselines perform poorly in contrast to the human baselines, indicating the hardness of our benchmark. Our work takes forward the recent progress in generic system development, demonstrating the scope of these under-explored tasks. △ Less

Submitted 18 May, 2020; originally announced May 2020.

Comments: 10 pages

arXiv:2005.00816 [pdf, other]

DQI: Measuring Data Quality in NLP

Authors: Swaroop Mishra, Anjana Arunkumar, Bhavdeep Sachdeva, Chris Bryan, Chitta Baral

Abstract: Neural language models have achieved human level performance across several NLP datasets. However, recent studies have shown that these models are not truly learning the desired task; rather, their high performance is attributed to overfitting using spurious biases, which suggests that the capabilities of AI systems have been over-estimated. We introduce a generic formula for Data Quality Index (D… ▽ More Neural language models have achieved human level performance across several NLP datasets. However, recent studies have shown that these models are not truly learning the desired task; rather, their high performance is attributed to overfitting using spurious biases, which suggests that the capabilities of AI systems have been over-estimated. We introduce a generic formula for Data Quality Index (DQI) to help dataset creators create datasets free of such unwanted biases. We evaluate this formula using a recently proposed approach for adversarial filtering, AFLite. We propose a new data creation paradigm using DQI to create higher quality data. The data creation paradigm consists of several data visualizations to help data creators (i) understand the quality of data and (ii) visualize the impact of the created data instance on the overall quality. It also has a couple of automation methods to (i) assist data creators and (ii) make the model more robust to adversarial attacks. We use DQI along with these automation methods to renovate biased examples in SNLI. We show that models trained on the renovated SNLI dataset generalize better to out of distribution tasks. Renovation results in reduced model performance, exposing a large gap with respect to human performance. DQI systematically helps in creating harder benchmarks using active learning. Our work takes the process of dynamic dataset creation forward, wherein datasets evolve together with the evolving state of the art, therefore serving as a means of benchmarking the true progress of AI. △ Less

Submitted 2 May, 2020; originally announced May 2020.

Comments: 63 pages

arXiv:2005.00330 [pdf, other]

Visuo-Linguistic Question Answering (VLQA) Challenge

Authors: Shailaja Keyur Sampat, Yezhou Yang, Chitta Baral

Abstract: Understanding images and text together is an important aspect of cognition and building advanced Artificial Intelligence (AI) systems. As a community, we have achieved good benchmarks over language and vision domains separately, however joint reasoning is still a challenge for state-of-the-art computer vision and natural language processing (NLP) systems. We propose a novel task to derive joint in… ▽ More Understanding images and text together is an important aspect of cognition and building advanced Artificial Intelligence (AI) systems. As a community, we have achieved good benchmarks over language and vision domains separately, however joint reasoning is still a challenge for state-of-the-art computer vision and natural language processing (NLP) systems. We propose a novel task to derive joint inference about a given image-text modality and compile the Visuo-Linguistic Question Answering (VLQA) challenge corpus in a question answering setting. Each dataset item consists of an image and a reading passage, where questions are designed to combine both visual and textual information i.e., ignoring either modality would make the question unanswerable. We first explore the best existing vision-language architectures to solve VLQA subsets and show that they are unable to reason well. We then develop a modular method with slightly better baseline performance, but it is still far behind human performance. We believe that VLQA will be a good benchmark for reasoning over a visuo-linguistic context. The dataset, code and leaderboard is available at https://shailaja183.github.io/vlqa/. △ Less

Submitted 18 November, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

Comments: Findings of EMNLP 2020 (22 pages, 13 figures)

arXiv:2005.00316 [pdf, other]

Self-supervised Knowledge Triplet Learning for Zero-shot Question Answering

Authors: Pratyay Banerjee, Chitta Baral

Abstract: The aim of all Question Answering (QA) systems is to be able to generalize to unseen questions. Current supervised methods are reliant on expensive data annotation. Moreover, such annotations can introduce unintended annotator bias which makes systems focus more on the bias than the actual task. In this work, we propose Knowledge Triplet Learning (KTL), a self-supervised task over knowledge graphs… ▽ More The aim of all Question Answering (QA) systems is to be able to generalize to unseen questions. Current supervised methods are reliant on expensive data annotation. Moreover, such annotations can introduce unintended annotator bias which makes systems focus more on the bias than the actual task. In this work, we propose Knowledge Triplet Learning (KTL), a self-supervised task over knowledge graphs. We propose heuristics to create synthetic graphs for commonsense and scientific knowledge. We propose methods of how to use KTL to perform zero-shot QA and our experiments show considerable improvements over large pre-trained transformer models. △ Less

Submitted 17 September, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

Comments: Accepted to EMNLP 2020 Long Papers

arXiv:2004.03101 [pdf, other]

Knowledge Fusion and Semantic Knowledge Ranking for Open Domain Question Answering

Authors: Pratyay Banerjee, Chitta Baral

Abstract: Open Domain Question Answering requires systems to retrieve external knowledge and perform multi-hop reasoning by composing knowledge spread over multiple sentences. In the recently introduced open domain question answering challenge datasets, QASC and OpenBookQA, we need to perform retrieval of facts and compose facts to correctly answer questions. In our work, we learn a semantic knowledge ranki… ▽ More Open Domain Question Answering requires systems to retrieve external knowledge and perform multi-hop reasoning by composing knowledge spread over multiple sentences. In the recently introduced open domain question answering challenge datasets, QASC and OpenBookQA, we need to perform retrieval of facts and compose facts to correctly answer questions. In our work, we learn a semantic knowledge ranking model to re-rank knowledge retrieved through Lucene based information retrieval systems. We further propose a "knowledge fusion model" which leverages knowledge in BERT-based language models with externally retrieved knowledge and improves the knowledge understanding of the BERT-based language models. On both OpenBookQA and QASC datasets, the knowledge fusion model with semantically re-ranked knowledge outperforms previous attempts. △ Less

Submitted 17 April, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

Comments: 9 pages. 4 figures, 4 tables

arXiv:2003.05162 [pdf, other]

Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning

Authors: Zhiyuan Fang, Tejas Gokhale, Pratyay Banerjee, Chitta Baral, Yezhou Yang

Abstract: Captioning is a crucial and challenging task for video understanding. In videos that involve active agents such as humans, the agent's actions can bring about myriad changes in the scene. Observable changes such as movements, manipulations, and transformations of the objects in the scene, are reflected in conventional video captioning. Unlike images, actions in videos are also inherently linked to… ▽ More Captioning is a crucial and challenging task for video understanding. In videos that involve active agents such as humans, the agent's actions can bring about myriad changes in the scene. Observable changes such as movements, manipulations, and transformations of the objects in the scene, are reflected in conventional video captioning. Unlike images, actions in videos are also inherently linked to social aspects such as intentions (why the action is taking place), effects (what changes due to the action), and attributes that describe the agent. Thus for video understanding, such as when captioning videos or when answering questions about videos, one must have an understanding of these commonsense aspects. We present the first work on generating commonsense captions directly from videos, to describe latent aspects such as intentions, effects, and attributes. We present a new dataset "Video-to-Commonsense (V2C)" that contains $\sim9k$ videos of human agents performing various actions, annotated with 3 types of commonsense descriptions. Additionally we explore the use of open-ended video-based commonsense question answering (V2C-QA) as a way to enrich our captions. Both the generation task and the QA task can be used to enrich video captions. △ Less

Submitted 7 January, 2023; v1 submitted 11 March, 2020; originally announced March 2020.

Comments: EMNLP 2020. V2C Website: https://asu-apg.github.io/Video2Commonsense/

arXiv:2003.03446 [pdf, other]

Natural Language QA Approaches using Reasoning with External Knowledge

Authors: Chitta Baral, Pratyay Banerjee, Kuntal Kumar Pal, Arindam Mitra

Abstract: Question answering (QA) in natural language (NL) has been an important aspect of AI from its early days. Winograd's ``councilmen'' example in his 1972 paper and McCarthy's Mr. Hug example of 1976 highlights the role of external knowledge in NL understanding. While Machine Learning has been the go-to approach in NL processing as well as NL question answering (NLQA) for the last 30 years, recently t… ▽ More Question answering (QA) in natural language (NL) has been an important aspect of AI from its early days. Winograd's ``councilmen'' example in his 1972 paper and McCarthy's Mr. Hug example of 1976 highlights the role of external knowledge in NL understanding. While Machine Learning has been the go-to approach in NL processing as well as NL question answering (NLQA) for the last 30 years, recently there has been an increasingly emphasized thread on NLQA where external knowledge plays an important role. The challenges inspired by Winograd's councilmen example, and recent developments such as the Rebooting AI book, various NLQA datasets, research on knowledge acquisition in the NLQA context, and their use in various NLQA models have brought the issue of NLQA using ``reasoning'' with external knowledge to the forefront. In this paper, we present a survey of the recent work on them. We believe our survey will help establish a bridge between multiple fields of AI, especially between (a) the traditional fields of knowledge representation and reasoning and (b) the field of NL understanding and NLQA. △ Less

Submitted 6 March, 2020; originally announced March 2020.

Comments: 6 pages, 3 figures, Work in Progress

arXiv:2002.08325 [pdf, other]

VQA-LOL: Visual Question Answering under the Lens of Logic

Authors: Tejas Gokhale, Pratyay Banerjee, Chitta Baral, Yezhou Yang

Abstract: Logical connectives and their implications on the meaning of a natural language sentence are a fundamental aspect of understanding. In this paper, we investigate whether visual question answering (VQA) systems trained to answer a question about an image, are able to answer the logical composition of multiple such questions. When put under this \textit{Lens of Logic}, state-of-the-art VQA models ha… ▽ More Logical connectives and their implications on the meaning of a natural language sentence are a fundamental aspect of understanding. In this paper, we investigate whether visual question answering (VQA) systems trained to answer a question about an image, are able to answer the logical composition of multiple such questions. When put under this \textit{Lens of Logic}, state-of-the-art VQA models have difficulty in correctly answering these logically composed questions. We construct an augmentation of the VQA dataset as a benchmark, with questions containing logical compositions and linguistic transformations (negation, disjunction, conjunction, and antonyms). We propose our {Lens of Logic (LOL)} model which uses question-attention and logic-attention to understand logical connectives in the question, and a novel Fréchet-Compatibility Loss, which ensures that the answers of the component questions and the composed question are consistent with the inferred logical operation. Our model shows substantial improvement in learning logical compositions while retaining performance on VQA. We suggest this work as a move towards robustness by embedding logical connectives in visual understanding. △ Less

Submitted 15 July, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

Comments: Accepted to ECCV 2020

arXiv:1911.11744 [pdf, other]

Imitation Learning of Robot Policies by Combining Language, Vision and Demonstration

Authors: Simon Stepputtis, Joseph Campbell, Mariano Phielipp, Chitta Baral, Heni Ben Amor

Abstract: In this work we propose a novel end-to-end imitation learning approach which combines natural language, vision, and motion information to produce an abstract representation of a task, which in turn is used to synthesize specific motion controllers at run-time. This multimodal approach enables generalization to a wide variety of environmental conditions and allows an end-user to direct a robot poli… ▽ More In this work we propose a novel end-to-end imitation learning approach which combines natural language, vision, and motion information to produce an abstract representation of a task, which in turn is used to synthesize specific motion controllers at run-time. This multimodal approach enables generalization to a wide variety of environmental conditions and allows an end-user to direct a robot policy through verbal communication. We empirically validate our approach with an extensive set of simulations and show that it achieves a high task success rate over a variety of conditions while remaining amenable to probabilistic interpretability. △ Less

Submitted 26 November, 2019; originally announced November 2019.

Comments: Accepted to the NeurIPS 2019 Workshop on Robot Learning: Control and Interaction in the Real World, Vancouver, Canada

arXiv:1911.03869 [pdf, other]

Knowledge Guided Named Entity Recognition for BioMedical Text

Authors: Pratyay Banerjee, Kuntal Kumar Pal, Murthy Devarakonda, Chitta Baral

Abstract: In this work, we formulate the NER task as a multi-answer knowledge guided QA task (KGQA) which helps to predict entities only by assigning B, I and O tags without associating entity types with the tags. We provide different knowledge contexts, such as, entity types, questions, definitions and examples along with the text and train on a combined dataset of 18 biomedical corpora. This formulation (… ▽ More In this work, we formulate the NER task as a multi-answer knowledge guided QA task (KGQA) which helps to predict entities only by assigning B, I and O tags without associating entity types with the tags. We provide different knowledge contexts, such as, entity types, questions, definitions and examples along with the text and train on a combined dataset of 18 biomedical corpora. This formulation (a) enables systems to jointly learn NER specific features from varied NER datasets, (b) can use knowledge-text attention to identify words having higher similarity to provided knowledge, improving performance, (c) reduces system confusion by reducing the prediction classes to B, I, O only, and (d) makes detection of nested entities easier. We perform extensive experiments of this KGQA formulation on 18 biomedical NER datasets, and through experiments we note that knowledge helps in achieving better performance. Our problem formulation is able to achieve state-of-the-art results in 12 datasets. △ Less

Submitted 18 September, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

Comments: 6 pages, 2 figures, 5 tables, WIP

arXiv:1910.02947 [pdf, other]

Hadron production in pp and p-Pb collisions: A mass dependent phenomenon

Authors: S. Sahoo, R. C. Baral, P. K. Sahu, M. K. Parida

Abstract: The mass dependence plays a significant role in the yield enhancement or suppression of hadrons in pp and p-Pb collisions at the LHC energies. This has been observed by parameterizing the variation of yield ratios between any two hadrons with event charged-particle multiplicity using a single empirical function. We notice that this variation is independent of all quantum numbers and solely depends… ▽ More The mass dependence plays a significant role in the yield enhancement or suppression of hadrons in pp and p-Pb collisions at the LHC energies. This has been observed by parameterizing the variation of yield ratios between any two hadrons with event charged-particle multiplicity using a single empirical function. We notice that this variation is independent of all quantum numbers and solely depends on masses of hadrons and masses of their valence quarks. The function shows that the amount of quark deconfinement increases with event multiplicity, and the quark coalescence favours more the production of heavier hadrons compared to lighter ones. △ Less

Submitted 25 November, 2019; v1 submitted 6 October, 2019; originally announced October 2019.

Comments: Yield enhancement, strange hadron, mass, pp 7 TeV, p-Pb 5.02 TeV

arXiv:1909.08855 [pdf, other]

How Additional Knowledge can Improve Natural Language Commonsense Question Answering?

Authors: Arindam Mitra, Pratyay Banerjee, Kuntal Kumar Pal, Swaroop Mishra, Chitta Baral

Abstract: Recently several datasets have been proposed to encourage research in Question Answering domains where commonsense knowledge is expected to play an important role. Recent language models such as ROBERTA, BERT and GPT that have been pre-trained on Wikipedia articles and books have shown reasonable performance with little fine-tuning on several such Multiple Choice Question-Answering (MCQ) datasets.… ▽ More Recently several datasets have been proposed to encourage research in Question Answering domains where commonsense knowledge is expected to play an important role. Recent language models such as ROBERTA, BERT and GPT that have been pre-trained on Wikipedia articles and books have shown reasonable performance with little fine-tuning on several such Multiple Choice Question-Answering (MCQ) datasets. Our goal in this work is to develop methods to incorporate additional (commonsense) knowledge into language model-based approaches for better question-answering in such domains. In this work, we first categorize external knowledge sources, and show performance does improve on using such sources. We then explore three different strategies for knowledge incorporation and four different models for question-answering using external commonsense knowledge. We analyze our predictions to explore the scope of further improvements. △ Less

Submitted 17 April, 2020; v1 submitted 19 September, 2019; originally announced September 2019.

Comments: 14 pages, 14 figures, 3 tables

arXiv:1908.03645 [pdf, ps, other]

A Generate-Validate Approach to Answering Questions about Qualitative Relationships

Authors: Arindam Mitra, Chitta Baral, Aurgho Bhattacharjee, Ishan Shrivastava

Abstract: Qualitative relationships describe how increasing or decreasing one property (e.g. altitude) affects another (e.g. temperature). They are an important aspect of natural language question answering and are crucial for building chatbots or voice agents where one may enquire about qualitative relationships. Recently a dataset about question answering involving qualitative relationships has been propo… ▽ More Qualitative relationships describe how increasing or decreasing one property (e.g. altitude) affects another (e.g. temperature). They are an important aspect of natural language question answering and are crucial for building chatbots or voice agents where one may enquire about qualitative relationships. Recently a dataset about question answering involving qualitative relationships has been proposed, and a few approaches to answer such questions have been explored, in the heart of which lies a semantic parser that converts the natural language input to a suitable logical form. A problem with existing semantic parsers is that they try to directly convert the input sentences to a logical form. Since the output language varies with each application, it forces the semantic parser to learn almost everything from scratch. In this paper, we show that instead of using a semantic parser to produce the logical form, if we apply the generate-validate framework i.e. generate a natural language description of the logical form and validate if the natural language description is followed from the input text, we get a better scope for transfer learning and our method outperforms the state-of-the-art by a large margin of 7.93%. △ Less

Submitted 9 August, 2019; originally announced August 2019.

arXiv:1907.10738 [pdf, other]

Careful Selection of Knowledge to solve Open Book Question Answering

Authors: Pratyay Banerjee, Kuntal Kumar Pal, Arindam Mitra, Chitta Baral

Abstract: Open book question answering is a type of natural language based QA (NLQA) where questions are expected to be answered with respect to a given set of open book facts, and common knowledge about a topic. Recently a challenge involving such QA, OpenBookQA, has been proposed. Unlike most other NLQA tasks that focus on linguistic understanding, OpenBookQA requires deeper reasoning involving linguistic… ▽ More Open book question answering is a type of natural language based QA (NLQA) where questions are expected to be answered with respect to a given set of open book facts, and common knowledge about a topic. Recently a challenge involving such QA, OpenBookQA, has been proposed. Unlike most other NLQA tasks that focus on linguistic understanding, OpenBookQA requires deeper reasoning involving linguistic understanding as well as reasoning with common knowledge. In this paper we address QA with respect to the OpenBookQA dataset and combine state of the art language models with abductive information retrieval (IR), information gain based re-ranking, passage selection and weighted scoring to achieve 72.0% accuracy, an 11.6% improvement over the current state of the art. △ Less

Submitted 24 July, 2019; originally announced July 2019.

Comments: Accepted to ACL 2019

arXiv:1906.09954 [pdf, other]

Integrating Knowledge and Reasoning in Image Understanding

Authors: Somak Aditya, Yezhou Yang, Chitta Baral

Abstract: Deep learning based data-driven approaches have been successfully applied in various image understanding applications ranging from object recognition, semantic segmentation to visual question answering. However, the lack of knowledge integration as well as higher-level reasoning capabilities with the methods still pose a hindrance. In this work, we present a brief survey of a few representative re… ▽ More Deep learning based data-driven approaches have been successfully applied in various image understanding applications ranging from object recognition, semantic segmentation to visual question answering. However, the lack of knowledge integration as well as higher-level reasoning capabilities with the methods still pose a hindrance. In this work, we present a brief survey of a few representative reasoning mechanisms, knowledge integration methods and their corresponding image understanding applications developed by various groups of researchers, approaching the problem from a variety of angles. Furthermore, we discuss upon key efforts on integrating external knowledge with neural networks. Taking cues from these efforts, we conclude by discussing potential pathways to improve reasoning capabilities. △ Less

Submitted 24 June, 2019; originally announced June 2019.

Comments: 8 pages, 2 figures

Journal ref: IJCAI 2019

arXiv:1905.12042 [pdf, other]

Blocksworld Revisited: Learning and Reasoning to Generate Event-Sequences from Image Pairs

Authors: Tejas Gokhale, Shailaja Sampat, Zhiyuan Fang, Yezhou Yang, Chitta Baral

Abstract: The process of identifying changes or transformations in a scene along with the ability of reasoning about their causes and effects, is a key aspect of intelligence. In this work we go beyond recent advances in computational perception, and introduce a more challenging task, Image-based Event-Sequencing (IES). In IES, the task is to predict a sequence of actions required to rearrange objects from… ▽ More The process of identifying changes or transformations in a scene along with the ability of reasoning about their causes and effects, is a key aspect of intelligence. In this work we go beyond recent advances in computational perception, and introduce a more challenging task, Image-based Event-Sequencing (IES). In IES, the task is to predict a sequence of actions required to rearrange objects from the configuration in an input source image to the one in the target image. IES also requires systems to possess inductive generalizability. Motivated from evidence in cognitive development, we compile the first IES dataset, the Blocksworld Image Reasoning Dataset (BIRD) which contains images of wooden blocks in different configurations, and the sequence of moves to rearrange one configuration to the other. We first explore the use of existing deep learning architectures and show that these end-to-end methods under-perform in inferring temporal event-sequences and fail at inductive generalization. We then propose a modular two-step approach: Visual Perception followed by Event-Sequencing, and demonstrate improved performance by combining learning and reasoning. Finally, by showing an extension of our approach on natural images, we seek to pave the way for future research on event sequencing for real world scenes. △ Less

Submitted 28 May, 2019; originally announced May 2019.

Comments: 10 pages, 5 figures, for associated dataset, see https://asu-active-perception-group.github.io/bird_dataset_web/

arXiv:1905.00198 [pdf, ps, other]

Declarative Question Answering over Knowledge Bases containing Natural Language Text with Answer Set Programming

Authors: Arindam Mitra, Peter Clark, Oyvind Tafjord, Chitta Baral

Abstract: While in recent years machine learning (ML) based approaches have been the popular approach in developing end-to-end question answering systems, such systems often struggle when additional knowledge is needed to correctly answer the questions. Proposed alternatives involve translating the question and the natural language text to a logical representation and then use logical reasoning. However, th… ▽ More While in recent years machine learning (ML) based approaches have been the popular approach in developing end-to-end question answering systems, such systems often struggle when additional knowledge is needed to correctly answer the questions. Proposed alternatives involve translating the question and the natural language text to a logical representation and then use logical reasoning. However, this alternative falters when the size of the text gets bigger. To address this we propose an approach that does logical reasoning over premises written in natural language text. The proposed method uses recent features of Answer Set Programming (ASP) to call external NLP modules (which may be based on ML) which perform simple textual entailment. To test our approach we develop a corpus based on the life cycle questions and showed that Our system achieves up to $18\%$ performance gain when compared to standard MCQ solvers. △ Less

Submitted 1 May, 2019; originally announced May 2019.

arXiv:1904.09720 [pdf, other]

Understanding Roles and Entities: Datasets and Models for Natural Language Inference

Authors: Arindam Mitra, Ishan Shrivastava, Chitta Baral

Abstract: We present two new datasets and a novel attention mechanism for Natural Language Inference (NLI). Existing neural NLI models, even though when trained on existing large datasets, do not capture the notion of entity and role well and often end up making mistakes such as "Peter signed a deal" can be inferred from "John signed a deal". The two datasets have been developed to mitigate such issues and… ▽ More We present two new datasets and a novel attention mechanism for Natural Language Inference (NLI). Existing neural NLI models, even though when trained on existing large datasets, do not capture the notion of entity and role well and often end up making mistakes such as "Peter signed a deal" can be inferred from "John signed a deal". The two datasets have been developed to mitigate such issues and make the systems better at understanding the notion of "entities" and "roles". After training the existing architectures on the new dataset we observe that the existing architectures does not perform well on one of the new benchmark. We then propose a modification to the "word-to-word" attention function which has been uniformly reused across several popular NLI architectures. The resulting architectures perform as well as their unmodified counterparts on the existing benchmarks and perform significantly well on the new benchmark for "roles" and "entities". △ Less

Submitted 22 April, 2019; originally announced April 2019.

arXiv:1902.09674 [pdf]

Developing and Using Special-Purpose Lexicons for Cohort Selection from Clinical Notes

Authors: Samarth Rawal, Ashok Prakash, Soumya Adhya, Sidharth Kulkarni, Saadat Anwar, Chitta Baral, Murthy Devarakonda

Abstract: Background and Significance: Selecting cohorts for a clinical trial typically requires costly and time-consuming manual chart reviews resulting in poor participation. To help automate the process, National NLP Clinical Challenges (N2C2) conducted a shared challenge by defining 13 criteria for clinical trial cohort selection and by providing training and test datasets. This research was motivated b… ▽ More Background and Significance: Selecting cohorts for a clinical trial typically requires costly and time-consuming manual chart reviews resulting in poor participation. To help automate the process, National NLP Clinical Challenges (N2C2) conducted a shared challenge by defining 13 criteria for clinical trial cohort selection and by providing training and test datasets. This research was motivated by the N2C2 challenge. Methods: We broke down the task into 13 independent subtasks corresponding to each criterion and implemented subtasks using rules or a supervised machine learning model. Each task critically depended on knowledge resources in the form of task-specific lexicons, for which we developed a novel model-driven approach. The approach allowed us to first expand the lexicon from a seed set and then remove noise from the list, thus improving the accuracy. Results: Our system achieved an overall F measure of 0.9003 at the challenge, and was statistically tied for the first place out of 45 participants. The model-driven lexicon development and further debugging the rules/code on the training set improved overall F measure to 0.9140, overtaking the best numerical result at the challenge. Discussion: Cohort selection, like phenotype extraction and classification, is amenable to rule-based or simple machine learning methods, however, the lexicons involved, such as medication names or medical terms referring to a medical problem, critically determine the overall accuracy. Automated lexicon development has the potential for scalability and accuracy. △ Less

Submitted 25 February, 2019; originally announced February 2019.

Comments: 13 pages, paper describing the NLP system built for N2C2 Task 1 2018 shared challenge in biomedical NLP

arXiv:1812.03631 [pdf, other]

Spatial Knowledge Distillation to aid Visual Reasoning

Authors: Somak Aditya, Rudra Saha, Yezhou Yang, Chitta Baral

Abstract: For tasks involving language and vision, the current state-of-the-art methods tend not to leverage any additional information that might be present to gather relevant (commonsense) knowledge. A representative task is Visual Question Answering where large diagnostic datasets have been proposed to test a system's capability of answering questions about images. The training data is often accompanied… ▽ More For tasks involving language and vision, the current state-of-the-art methods tend not to leverage any additional information that might be present to gather relevant (commonsense) knowledge. A representative task is Visual Question Answering where large diagnostic datasets have been proposed to test a system's capability of answering questions about images. The training data is often accompanied by annotations of individual object properties and spatial locations. In this work, we take a step towards integrating this additional privileged information in the form of spatial knowledge to aid in visual reasoning. We propose a framework that combines recent advances in knowledge distillation (teacher-student framework), relational reasoning and probabilistic logical languages to incorporate such knowledge in existing neural networks for the task of Visual Question Answering. Specifically, for a question posed against an image, we use a probabilistic logical language to encode the spatial knowledge and the spatial understanding about the question in the form of a mask that is directly provided to the teacher network. The student network learns from the ground-truth information as well as the teachers prediction via distillation. We also demonstrate the impact of predicting such a mask inside the teachers network using attention. Empirically, we show that both the methods improve the test accuracy over a state-of-the-art approach on a publicly available dataset. △ Less

Submitted 11 December, 2018; v1 submitted 10 December, 2018; originally announced December 2018.

Comments: Equal contribution by first two authors. Accepted in WACV 2019

arXiv:1803.08896 [pdf, other]

Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering

Authors: Somak Aditya, Yezhou Yang, Chitta Baral

Abstract: Many vision and language tasks require commonsense reasoning beyond data-driven image and natural language processing. Here we adopt Visual Question Answering (VQA) as an example task, where a system is expected to answer a question in natural language about an image. Current state-of-the-art systems attempted to solve the task using deep neural architectures and achieved promising performance. Ho… ▽ More Many vision and language tasks require commonsense reasoning beyond data-driven image and natural language processing. Here we adopt Visual Question Answering (VQA) as an example task, where a system is expected to answer a question in natural language about an image. Current state-of-the-art systems attempted to solve the task using deep neural architectures and achieved promising performance. However, the resulting systems are generally opaque and they struggle in understanding questions for which extra knowledge is required. In this paper, we present an explicit reasoning layer on top of a set of penultimate neural network based systems. The reasoning layer enables reasoning and answering questions where additional knowledge is required, and at the same time provides an interpretable interface to the end users. Specifically, the reasoning layer adopts a Probabilistic Soft Logic (PSL) based engine to reason over a basket of inputs: visual relations, the semantic parse of the question, and background ontological knowledge from word2vec and ConceptNet. Experimental analysis of the answers and the key evidential predicates generated on the VQA dataset validate our approach. △ Less

Submitted 23 March, 2018; originally announced March 2018.

Comments: 9 pages, 3 figures, AAAI 2018

arXiv:1802.07966 [pdf, other]

Incremental and Iterative Learning of Answer Set Programs from Mutually Distinct Examples

Authors: Arindam Mitra, Chitta Baral

Abstract: Over the years the Artificial Intelligence (AI) community has produced several datasets which have given the machine learning algorithms the opportunity to learn various skills across various domains. However, a subclass of these machine learning algorithms that aimed at learning logic programs, namely the Inductive Logic Programming algorithms, have often failed at the task due to the vastness of… ▽ More Over the years the Artificial Intelligence (AI) community has produced several datasets which have given the machine learning algorithms the opportunity to learn various skills across various domains. However, a subclass of these machine learning algorithms that aimed at learning logic programs, namely the Inductive Logic Programming algorithms, have often failed at the task due to the vastness of these datasets. This has impacted the usability of knowledge representation and reasoning techniques in the development of AI systems. In this research, we try to address this scalability issue for the algorithms that learn answer set programs. We present a sound and complete algorithm which takes the input in a slightly different manner and performs an efficient and more user controlled search for a solution. We show via experiments that our algorithm can learn from two popular datasets from machine learning community, namely bAbl (a question answering dataset) and MNIST (a dataset for handwritten digit recognition), which to the best of our knowledge was not previously possible. The system is publicly available at https://goo.gl/KdWAcV. This paper is under consideration for acceptance in TPLP. △ Less

Submitted 1 May, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

arXiv:1611.06631 [pdf, ps, other]

On Selecting a Conjunction Operation in Probabilistic Soft Logic

Authors: Vladik Kreinovich, Chitta Baral

Abstract: Probabilistic Soft Logic has been proposed and used in several applications as an efficient way to deal with inconsistency, uncertainty and relational representation. In several applications, this approach has led to an adequate description of the corresponding human reasoning. In this paper, we provide a theoretical explanation for one of the semi-heuristic choices made in this approach: namely,… ▽ More Probabilistic Soft Logic has been proposed and used in several applications as an efficient way to deal with inconsistency, uncertainty and relational representation. In several applications, this approach has led to an adequate description of the corresponding human reasoning. In this paper, we provide a theoretical explanation for one of the semi-heuristic choices made in this approach: namely, we explain the choice of the corresponding conjunction operations. Our explanation leads to a more general family of operations which may be used in future applications of probabilistic soft logic. △ Less

Submitted 20 November, 2016; originally announced November 2016.

arXiv:1611.05896 [pdf, other]

Answering Image Riddles using Vision and Reasoning through Probabilistic Soft Logic

Authors: Somak Aditya, Yezhou Yang, Chitta Baral, Yiannis Aloimonos

Abstract: In this work, we explore a genre of puzzles ("image riddles") which involves a set of images and a question. Answering these puzzles require both capabilities involving visual detection (including object, activity recognition) and, knowledge-based or commonsense reasoning. We compile a dataset of over 3k riddles where each riddle consists of 4 images and a groundtruth answer. The annotations are v… ▽ More In this work, we explore a genre of puzzles ("image riddles") which involves a set of images and a question. Answering these puzzles require both capabilities involving visual detection (including object, activity recognition) and, knowledge-based or commonsense reasoning. We compile a dataset of over 3k riddles where each riddle consists of 4 images and a groundtruth answer. The annotations are validated using crowd-sourced evaluation. We also define an automatic evaluation metric to track future progress. Our task bears similarity with the commonly known IQ tasks such as analogy solving, sequence filling that are often used to test intelligence. We develop a Probabilistic Reasoning-based approach that utilizes probabilistic commonsense knowledge to answer these riddles with a reasonable accuracy. We demonstrate the results of our approach using both automatic and human evaluations. Our approach achieves some promising results for these riddles and provides a strong baseline for future attempts. We make the entire dataset and related materials publicly available to the community in ImageRiddle Website (http://bit.ly/22f9Ala). △ Less

Submitted 17 November, 2016; originally announced November 2016.

Comments: 14 pages, 10 figures

arXiv:1606.07971 [pdf, other]

doi 10.1142/S0218301316500920

Production of D-mesons in $p$+$p$ and $p$+Pb collisions at LHC energies

Authors: R. C. Baral, S. K. Tripathy, M. Younus, Z. Naik, P. K. Sahu

Abstract: We present theoretical model comparison with published ALICE results for D-mesons (D$^0$, D$^+$ and D$^{*+}$) in $p$+$p$ collisions at $\sqrt{s}$ = 7 TeV and $p$+Pb collisions at $\sqrt{s_{NN}}$ = 5.02 TeV. Event generator HIJING, transport calculation of AMPT and calculations from NLO(MNR) and FONLL have been used for this study. We found that HIJING and AMPT model predictions are matching with p… ▽ More We present theoretical model comparison with published ALICE results for D-mesons (D$^0$, D$^+$ and D$^{*+}$) in $p$+$p$ collisions at $\sqrt{s}$ = 7 TeV and $p$+Pb collisions at $\sqrt{s_{NN}}$ = 5.02 TeV. Event generator HIJING, transport calculation of AMPT and calculations from NLO(MNR) and FONLL have been used for this study. We found that HIJING and AMPT model predictions are matching with published D-meson cross-sections in $p$+$p$ collisions, while both under-predict the same in $p$+Pb collisions. Attempts were made to explain the $R_{pPb}$ data using NLO-pQCD(MNR), FONLL and other above mentioned models. △ Less

Submitted 24 October, 2016; v1 submitted 25 June, 2016; originally announced June 2016.

Comments: 11 pages, 3 figures (Accepted in IJMP-E)

Journal ref: IJMP E, Vol. 25, No. 11 (2016)

arXiv:1511.03292 [pdf, other]

From Images to Sentences through Scene Description Graphs using Commonsense Reasoning and Knowledge

Authors: Somak Aditya, Yezhou Yang, Chitta Baral, Cornelia Fermuller, Yiannis Aloimonos

Abstract: In this paper we propose the construction of linguistic descriptions of images. This is achieved through the extraction of scene description graphs (SDGs) from visual scenes using an automatically constructed knowledge base. SDGs are constructed using both vision and reasoning. Specifically, commonsense reasoning is applied on (a) detections obtained from existing perception methods on given image… ▽ More In this paper we propose the construction of linguistic descriptions of images. This is achieved through the extraction of scene description graphs (SDGs) from visual scenes using an automatically constructed knowledge base. SDGs are constructed using both vision and reasoning. Specifically, commonsense reasoning is applied on (a) detections obtained from existing perception methods on given images, (b) a "commonsense" knowledge base constructed using natural language processing of image annotations and (c) lexical ontological knowledge from resources such as WordNet. Amazon Mechanical Turk(AMT)-based evaluations on Flickr8k, Flickr30k and MS-COCO datasets show that in most cases, sentences auto-constructed from SDGs obtained by our method give a more relevant and thorough description of an image than a recent state-of-the-art image caption based approach. Our Image-Sentence Alignment Evaluation results are also comparable to that of the recent state-of-the art approaches. △ Less

Submitted 10 November, 2015; originally announced November 2015.

ACM Class: I.2.10

arXiv:1511.01960 [pdf, other]

An Action Language for Multi-Agent Domains: Foundations

Authors: Chitta Baral, Gregory Gelfond, Enrico Pontelli, Tran Cao Son

Abstract: In multi-agent domains (MADs), an agent's action may not just change the world and the agent's knowledge and beliefs about the world, but also may change other agents' knowledge and beliefs about the world and their knowledge and beliefs about other agents' knowledge and beliefs about the world. The goals of an agent in a multi-agent world may involve manipulating the knowledge and beliefs of othe… ▽ More In multi-agent domains (MADs), an agent's action may not just change the world and the agent's knowledge and beliefs about the world, but also may change other agents' knowledge and beliefs about the world and their knowledge and beliefs about other agents' knowledge and beliefs about the world. The goals of an agent in a multi-agent world may involve manipulating the knowledge and beliefs of other agents' and again, not just their knowledge/belief about the world, but also their knowledge about other agents' knowledge about the world. Our goal is to present an action language (mA+) that has the necessary features to address the above aspects in representing and RAC in MADs. mA+ allows the representation of and reasoning about different types of actions that an agent can perform in a domain where many other agents might be present -- such as world-altering actions, sensing actions, and announcement/communication actions. It also allows the specification of agents' dynamic awareness of action occurrences which has future implications on what agents' know about the world and other agents' knowledge about the world. mA+ considers three different types of awareness: full-, partial- awareness, and complete oblivion of an action occurrence and its effects. This keeps the language simple, yet powerful enough to address a large variety of knowledge manipulation scenarios in MADs. The semantics of mA+ relies on the notion of state, which is described by a pointed Kripke model and is used to encode the agent's knowledge and the real state of the world. It is defined by a transition function that maps pairs of actions and states into sets of states. We illustrate properties of the action theories, including properties that guarantee finiteness of the set of initial states and their practical implementability. Finally, we relate mA+ to other related formalisms that contribute to RAC in MADs. △ Less

Submitted 26 December, 2020; v1 submitted 5 November, 2015; originally announced November 2015.

Comments: 49 pages, 12 figures

arXiv:1306.4411 [pdf, other]

Event-Object Reasoning with Curated Knowledge Bases: Deriving Missing Information

Authors: Chitta Baral, Nguyen H. Vo

Abstract: The broader goal of our research is to formulate answers to why and how questions with respect to knowledge bases, such as AURA. One issue we face when reasoning with many available knowledge bases is that at times needed information is missing. Examples of this include partially missing information about next sub-event, first sub-event, last sub-event, result of an event, input to an event, desti… ▽ More The broader goal of our research is to formulate answers to why and how questions with respect to knowledge bases, such as AURA. One issue we face when reasoning with many available knowledge bases is that at times needed information is missing. Examples of this include partially missing information about next sub-event, first sub-event, last sub-event, result of an event, input to an event, destination of an event, and raw material involved in an event. In many cases one can recover part of the missing knowledge through reasoning. In this paper we give a formal definition about how such missing information can be recovered and then give an ASP implementation of it. We then discuss the implication of this with respect to answering why and how questions. △ Less

Submitted 19 June, 2013; v1 submitted 18 June, 2013; originally announced June 2013.

Comments: 13 pages

arXiv:1306.3548 [pdf, other]

Encoding Higher Level Extensions of Petri Nets in Answer Set Programming

Authors: Saadat Anwar, Chitta Baral, Katsumi Inoue

Abstract: Answering realistic questions about biological systems and pathways similar to the ones used by text books to test understanding of students about biological systems is one of our long term research goals. Often these questions require simulation based reasoning. To answer such questions, we need formalisms to build pathway models, add extensions, simulate, and reason with them. We chose Petri Net… ▽ More Answering realistic questions about biological systems and pathways similar to the ones used by text books to test understanding of students about biological systems is one of our long term research goals. Often these questions require simulation based reasoning. To answer such questions, we need formalisms to build pathway models, add extensions, simulate, and reason with them. We chose Petri Nets and Answer Set Programming (ASP) as suitable formalisms, since Petri Net models are similar to biological pathway diagrams; and ASP provides easy extension and strong reasoning abilities. We found that certain aspects of biological pathways, such as locations and substance types, cannot be represented succinctly using regular Petri Nets. As a result, we need higher level constructs like colored tokens. In this paper, we show how Petri Nets with colored tokens can be encoded in ASP in an intuitive manner, how additional Petri Net extensions can be added by making small code changes, and how this work furthers our long term research goals. Our approach can be adapted to other domains with similar modeling needs. △ Less

Submitted 24 June, 2013; v1 submitted 15 June, 2013; originally announced June 2013.

arXiv:1306.3542 [pdf, other]

Encoding Petri Nets in Answer Set Programming for Simulation Based Reasoning

Authors: Saadat Anwar, Chitta Baral, Katsumi Inoue

Abstract: One of our long term research goals is to develop systems to answer realistic questions (e.g., some mentioned in textbooks) about biological pathways that a biologist may ask. To answer such questions we need formalisms that can model pathways, simulate their execution, model intervention to those pathways, and compare simulations under different circumstances. We found Petri Nets to be the starti… ▽ More One of our long term research goals is to develop systems to answer realistic questions (e.g., some mentioned in textbooks) about biological pathways that a biologist may ask. To answer such questions we need formalisms that can model pathways, simulate their execution, model intervention to those pathways, and compare simulations under different circumstances. We found Petri Nets to be the starting point of a suitable formalism for the modeling and simulation needs. However, we need to make extensions to the Petri Net model and also reason with multiple simulation runs and parallel state evolutions. Towards that end Answer Set Programming (ASP) implementation of Petri Nets would allow us to do both. In this paper we show how ASP can be used to encode basic Petri Nets in an intuitive manner. We then show how we can modify this encoding to model several Petri Net extensions by making small changes. We then highlight some of the reasoning capabilities that we will use to accomplish our ultimate research goal. △ Less

Submitted 24 June, 2013; v1 submitted 14 June, 2013; originally announced June 2013.

arXiv:1210.5670 [pdf, ps, other]

Typed Answer Set Programming and Inverse Lambda Algorithms

Authors: Chitta Baral, Juraj Dzifcak, Marcos A. Gonzalez, Aaron Gottesman

Abstract: Our broader goal is to automatically translate English sentences into formulas in appropriate knowledge representation languages as a step towards understanding and thus answering questions with respect to English text. Our focus in this paper is on the language of Answer Set Programming (ASP). Our approach to translate sentences to ASP rules is inspired by Montague's use of lambda calculus formul… ▽ More Our broader goal is to automatically translate English sentences into formulas in appropriate knowledge representation languages as a step towards understanding and thus answering questions with respect to English text. Our focus in this paper is on the language of Answer Set Programming (ASP). Our approach to translate sentences to ASP rules is inspired by Montague's use of lambda calculus formulas as meaning of words and phrases. With ASP as the target language the meaning of words and phrases are ASP-lambda formulas. In an earlier work we illustrated our approach by manually developing a dictionary of words and their ASP-lambda formulas. However such an approach is not scalable. In this paper our focus is on two algorithms that allow one to construct ASP-lambda formulas in an inverse manner. In particular the two algorithms take as input two lambda-calculus expressions G and H and compute a lambda-calculus expression F such that F with input as G, denoted by F@G, is equal to H; and similarly G@F = H. We present correctness and complexity results about these algorithms. To do that we develop the notion of typed ASP-lambda calculus theories and their orders and use it in developing the completeness results. (To appear in Theory and Practice of Logic Programming.) △ Less

Submitted 20 October, 2012; originally announced October 2012.

Comments: To appear in Theory and Practice of Logic Programming

arXiv:1203.3641 [pdf, ps, other]

doi 10.1016/j.physletb.2012.10.078

Inclusive J/psi production in pp collisions at sqrt(s) = 2.76 TeV

Authors: ALICE Collaboration, B. Abelev, J. Adam, D. Adamova, A. M. Adare, M. M. Aggarwal, G. Aglieri Rinella, A. G. Agocs, A. Agostinelli, S. Aguilar Salazar, Z. Ahammed, A. Ahmad Masoodi, N. Ahmad, S. U. Ahn, A. Akindinov, D. Aleksandrov, B. Alessandro, R. Alfaro Molina, A. Alici, A. Alkin, E. Almaraz Avina, J. Alme, T. Alt, V. Altini, S. Altinpinar , et al. (948 additional authors not shown)

Abstract: The ALICE Collaboration has measured inclusive J/psi production in pp collisions at a center of mass energy sqrt(s)=2.76 TeV at the LHC. The results presented in this Letter refer to the rapidity ranges |y|<0.9 and 2.5<y<4 and have been obtained by measuring the electron and muon pair decay channels, respectively. The integrated luminosities for the two channels are L^e_int=1.1 nb^-1 and L^mu_int=… ▽ More The ALICE Collaboration has measured inclusive J/psi production in pp collisions at a center of mass energy sqrt(s)=2.76 TeV at the LHC. The results presented in this Letter refer to the rapidity ranges |y|<0.9 and 2.5<y<4 and have been obtained by measuring the electron and muon pair decay channels, respectively. The integrated luminosities for the two channels are L^e_int=1.1 nb^-1 and L^mu_int=19.9 nb^-1, and the corresponding signal statistics are N_J/psi^e+e-=59 +/- 14 and N_J/psi^mu+mu-=1364 +/- 53. We present dsigma_J/psi/dy for the two rapidity regions under study and, for the forward-y range, d^2sigma_J/psi/dydp_t in the transverse momentum domain 0<p_t<8 GeV/c. The results are compared with previously published results at sqrt(s)=7 TeV and with theoretical calculations. △ Less

Submitted 6 November, 2012; v1 submitted 16 March, 2012; originally announced March 2012.

Comments: 7 figures, 3 tables, accepted for publication in Phys. Lett. B

Report number: CERN-PH-EP-2012-055

Journal ref: Phys.Lett.B 718 (2012) 295-306, Phys.Lett.B 748 (2015) 472-473 (erratum)

arXiv:1108.3850 [pdf, other]

Solving puzzles described in English by automated translation to answer set programming and learning how to do that translation

Authors: Chitta Baral, Juraj Dzifcak

Abstract: We present a system capable of automatically solving combinatorial logic puzzles given in (simplified) English. It involves translating the English descriptions of the puzzles into answer set programming(ASP) and using ASP solvers to provide solutions of the puzzles. To translate the descriptions, we use a lambda-calculus based approach using Probabilistic Combinatorial Categorial Grammars (PCCG)… ▽ More We present a system capable of automatically solving combinatorial logic puzzles given in (simplified) English. It involves translating the English descriptions of the puzzles into answer set programming(ASP) and using ASP solvers to provide solutions of the puzzles. To translate the descriptions, we use a lambda-calculus based approach using Probabilistic Combinatorial Categorial Grammars (PCCG) where the meanings of words are associated with parameters to be able to distinguish between multiple meanings of the same word. Meaning of many words and the parameters are learned. The puzzles are represented in ASP using an ontology which is applicable to a large set of logic puzzles. △ Less

Submitted 18 August, 2011; originally announced August 2011.

arXiv:1108.3848 [pdf, other]

Language understanding as a step towards human level intelligence - automatizing the construction of the initial dictionary from example sentences

Authors: Chitta Baral, Juraj Dzifcak

Abstract: For a system to understand natural language, it needs to be able to take natural language text and answer questions given in natural language with respect to that text; it also needs to be able to follow instructions given in natural language. To achieve this, a system must be able to process natural language and be able to capture the knowledge within that text. Thus it needs to be able to transl… ▽ More For a system to understand natural language, it needs to be able to take natural language text and answer questions given in natural language with respect to that text; it also needs to be able to follow instructions given in natural language. To achieve this, a system must be able to process natural language and be able to capture the knowledge within that text. Thus it needs to be able to translate natural language text into a formal language. We discuss our approach to do this, where the translation is achieved by composing the meaning of words in a sentence. Our initial approach uses an inverse lambda method that we developed (and other methods) to learn meaning of words from meaning of sentences and an initial lexicon. We then present an improved method where the initial lexicon is also learned by analyzing the training sentence and meaning pairs. We evaluate our methods and compare them with other existing methods on a corpora of database querying and robot command and control. △ Less

Submitted 18 August, 2011; originally announced August 2011.

arXiv:1108.3843 [pdf, ps, other]

Using Inverse lambda and Generalization to Translate English to Formal Languages

Authors: Chitta Baral, Juraj Dzifcak, Marcos Alvarez Gonzalez, Jiayu Zhou

Abstract: We present a system to translate natural language sentences to formulas in a formal or a knowledge representation language. Our system uses two inverse lambda-calculus operators and using them can take as input the semantic representation of some words, phrases and sentences and from that derive the semantic representation of other words and phrases. Our inverse lambda operator works on many forma… ▽ More We present a system to translate natural language sentences to formulas in a formal or a knowledge representation language. Our system uses two inverse lambda-calculus operators and using them can take as input the semantic representation of some words, phrases and sentences and from that derive the semantic representation of other words and phrases. Our inverse lambda operator works on many formal languages including first order logic, database query languages and answer set programming. Our system uses a syntactic combinatorial categorial parser to parse natural language sentences and also to construct the semantic meaning of the sentences as directed by their parsing. The same parser is used for both. In addition to the inverse lambda-calculus operators, our system uses a notion of generalization to learn semantic representation of words from the semantic representation of other words that are of the same category. Together with this, we use an existing statistical learning approach to assign weights to deal with multiple meanings of words. Our system produces improved results on standard corpora on natural language interfaces for robot command and control and database queries. △ Less

Submitted 18 August, 2011; originally announced August 2011.

Journal ref: Proceedings of International Conference on Computational Semantics (IWCS) 2011, Oxford, pp:35-44

arXiv:1007.3700 [pdf, ps, other]

doi 10.1017/S1471068410000359

Logic Programming for Finding Models in the Logics of Knowledge and its Applications: A Case Study

Authors: Chitta Baral, Gregory Gelfond, Enrico Pontelli, Tran Cao Son

Abstract: The logics of knowledge are modal logics that have been shown to be effective in representing and reasoning about knowledge in multi-agent domains. Relatively few computational frameworks for dealing with computation of models and useful transformations in logics of knowledge (e.g., to support multi-agent planning with knowledge actions and degrees of visibility) have been proposed. This paper exp… ▽ More The logics of knowledge are modal logics that have been shown to be effective in representing and reasoning about knowledge in multi-agent domains. Relatively few computational frameworks for dealing with computation of models and useful transformations in logics of knowledge (e.g., to support multi-agent planning with knowledge actions and degrees of visibility) have been proposed. This paper explores the use of logic programming (LP) to encode interesting forms of logics of knowledge and compute Kripke models. The LP modeling is expanded with useful operators on Kripke structures, to support multi-agent planning in the presence of both world-altering and knowledge actions. This results in the first ever implementation of a planner for this type of complex multi-agent domains. △ Less

Submitted 21 July, 2010; originally announced July 2010.

Comments: 16 pages, 1 figure, International Conference on Logic Programming 2010

Journal ref: Theory and Practice of Logic Programming, Volume 10, Special Issue 4-6, July 2010, pages 675-690

arXiv:1001.4277 [pdf]

Towards Effective Sentence Simplification for Automatic Processing of Biomedical Text

Authors: Siddhartha Jonnalagadda, Luis Tari, Jorg Hakenberg, Chitta Baral, Graciela Gonzalez

Abstract: The complexity of sentences characteristic to biomedical articles poses a challenge to natural language parsers, which are typically trained on large-scale corpora of non-technical text. We propose a text simplification process, bioSimplify, that seeks to reduce the complexity of sentences in biomedical abstracts in order to improve the performance of syntactic parsers on the processed sentences… ▽ More The complexity of sentences characteristic to biomedical articles poses a challenge to natural language parsers, which are typically trained on large-scale corpora of non-technical text. We propose a text simplification process, bioSimplify, that seeks to reduce the complexity of sentences in biomedical abstracts in order to improve the performance of syntactic parsers on the processed sentences. Syntactic parsing is typically one of the first steps in a text mining pipeline. Thus, any improvement in performance would have a ripple effect over all processing steps. We evaluated our method using a corpus of biomedical sentences annotated with syntactic links. Our empirical results show an improvement of 2.90% for the Charniak-McClosky parser and of 4.23% for the Link Grammar parser when processing simplified sentences rather than the original sentences in the corpus. △ Less

Submitted 24 January, 2010; originally announced January 2010.

Comments: 4 pages, In Proc. of the NAACL-HLT 2009, Boulder, USA, June

Journal ref: Proc. of the NAACL-HLT 2009, Boulder, USA, June 2009

arXiv:0812.0659 [pdf, ps, other]

Probabilistic reasoning with answer sets

Authors: Chitta Baral, Michael Gelfond, Nelson Rushton

Abstract: This paper develops a declarative language, P-log, that combines logical and probabilistic arguments in its reasoning. Answer Set Prolog is used as the logical foundation, while causal Bayes nets serve as a probabilistic foundation. We give several non-trivial examples and illustrate the use of P-log for knowledge representation and updating of knowledge. We argue that our approach to updates is… ▽ More This paper develops a declarative language, P-log, that combines logical and probabilistic arguments in its reasoning. Answer Set Prolog is used as the logical foundation, while causal Bayes nets serve as a probabilistic foundation. We give several non-trivial examples and illustrate the use of P-log for knowledge representation and updating of knowledge. We argue that our approach to updates is more appealing than existing approaches. We give sufficiency conditions for the coherency of P-log programs and show that Bayes nets can be easily mapped to coherent P-log programs. △ Less

Submitted 3 December, 2008; originally announced December 2008.

Comments: 77 pages. To appear in Theory and Practice of Logic Programming (TPLP)

arXiv:cs/0609111 [pdf, ps, other]

doi 10.2168/LMCS-2(4:2)2006

A State-Based Regression Formulation for Domains with Sensing Actions<br> and Incomplete Information

Authors: Le-Chi Tuan, Chitta Baral, Tran Cao Son

Abstract: We present a state-based regression function for planning domains where an agent does not have complete information and may have sensing actions. We consider binary domains and employ a three-valued characterization of domains with sensing actions to define the regression function. We prove the soundness and completeness of our regression formulation with respect to the definition of progression… ▽ More We present a state-based regression function for planning domains where an agent does not have complete information and may have sensing actions. We consider binary domains and employ a three-valued characterization of domains with sensing actions to define the regression function. We prove the soundness and completeness of our regression formulation with respect to the definition of progression. More specifically, we show that (i) a plan obtained through regression for a planning problem is indeed a progression solution of that planning problem, and that (ii) for each plan found through progression, using regression one obtains that plan or an equivalent one. △ Less

Submitted 1 October, 2006; v1 submitted 19 September, 2006; originally announced September 2006.

Comments: 34 pages, 7 Figures

ACM Class: I.2.4; I.2.8

Journal ref: Logical Methods in Computer Science, Volume 2, Issue 4 (October 2, 2006) lmcs:2238

arXiv:cs/0605017 [pdf, ps, other]

Reasoning and Planning with Sensing Actions, Incomplete Information, and Static Causal Laws using Answer Set Programming

Authors: Phan Huy Tu, Tran Cao Son, Chitta Baral

Abstract: We extend the 0-approximation of sensing actions and incomplete information in [Son and Baral 2000] to action theories with static causal laws and prove its soundness with respect to the possible world semantics. We also show that the conditional planning problem with respect to this approximation is NP-complete. We then present an answer set programming based conditional planner, called ASCP, t… ▽ More We extend the 0-approximation of sensing actions and incomplete information in [Son and Baral 2000] to action theories with static causal laws and prove its soundness with respect to the possible world semantics. We also show that the conditional planning problem with respect to this approximation is NP-complete. We then present an answer set programming based conditional planner, called ASCP, that is capable of generating both conformant plans and conditional plans in the presence of sensing actions, incomplete information about the initial state, and static causal laws. We prove the correctness of our implementation and argue that our planner is sound and complete with respect to the proposed approximation. Finally, we present experimental results comparing ASCP to other planners. △ Less

Submitted 4 May, 2006; originally announced May 2006.

Comments: 72 pages, 3 figures, a preliminary version of this paper appeared in the proceedings of the 7th International Conference on Logic Programming and Non-Monotonic Reasoning, 2004. To appear in Theory and Practice of Logic Programming

ACM Class: I.2.3; I.2.4; I.2.8

arXiv:cs/0405071 [pdf, ps, other]

Regression with respect to sensing actions and partial states

Authors: Le-Chi Tuan, Chitta Baral, Tran Cao Son

Abstract: In this paper, we present a state-based regression function for planning domains where an agent does not have complete information and may have sensing actions. We consider binary domains and employ the 0-approximation [Son & Baral 2001] to define the regression function. In binary domains, the use of 0-approximation means using 3-valued states. Although planning using this approach is incomplet… ▽ More In this paper, we present a state-based regression function for planning domains where an agent does not have complete information and may have sensing actions. We consider binary domains and employ the 0-approximation [Son & Baral 2001] to define the regression function. In binary domains, the use of 0-approximation means using 3-valued states. Although planning using this approach is incomplete with respect to the full semantics, we adopt it to have a lower complexity. We prove the soundness and completeness of our regression formulation with respect to the definition of progression. More specifically, we show that (i) a plan obtained through regression for a planning problem is indeed a progression solution of that planning problem, and that (ii) for each plan found through progression, using regression one obtains that plan or an equivalent one. We then develop a conditional planner that utilizes our regression function. We prove the soundness and completeness of our planning algorithm and present experimental results with respect to several well known planning problems in the literature. △ Less

Submitted 21 May, 2004; originally announced May 2004.

Comments: 38 pages

ACM Class: I.2.4; I.2.8

arXiv:cs/0207023 [pdf, ps, other]

Domain-Dependent Knowledge in Answer Set Planning

Authors: Tran Cao Son, Chitta Baral, Nam Tran, Sheila McIlraith

Abstract: In this paper we consider three different kinds of domain-dependent control knowledge (temporal, procedural and HTN-based) that are useful in planning. Our approach is declarative and relies on the language of logic programming with answer set semantics (AnsProlog*). AnsProlog* is designed to plan without control knowledge. We show how temporal, procedural and HTN-based control knowledge can be… ▽ More In this paper we consider three different kinds of domain-dependent control knowledge (temporal, procedural and HTN-based) that are useful in planning. Our approach is declarative and relies on the language of logic programming with answer set semantics (AnsProlog*). AnsProlog* is designed to plan without control knowledge. We show how temporal, procedural and HTN-based control knowledge can be incorporated into AnsProlog* by the modular addition of a small number of domain-dependent rules, without the need to modify the planner. We formally prove the correctness of our planner, both in the absence and presence of the control knowledge. Finally, we perform some initial experimentation that demonstrates the potential reduction in planning time that can be achieved when procedural domain knowledge is used to solve planning problems with large plan length. △ Less

Submitted 29 August, 2005; v1 submitted 7 July, 2002; originally announced July 2002.

Comments: 70 pages, accepted for publication, TOCL Version with all proofs

ACM Class: I.2.4; I.2.3; I.2.8

arXiv:cs/0003073

Proceedings of the 8th International Workshop on Non-Monotonic Reasoning, NMR'2000

Authors: Chitta Baral, Miroslaw Truszczynski

Abstract: The papers gathered in this collection were presented at the 8th International Workshop on Nonmonotonic Reasoning, NMR2000. The series was started by John McCarthy in 1978. The first international NMR workshop was held at Mohonk Mountain House, New Paltz, New York in June, 1984, and was organized by Ray Reiter and Bonnie Webber. In the last 10 years the area of nonmonotonic reasoning has seen… ▽ More The papers gathered in this collection were presented at the 8th International Workshop on Nonmonotonic Reasoning, NMR2000. The series was started by John McCarthy in 1978. The first international NMR workshop was held at Mohonk Mountain House, New Paltz, New York in June, 1984, and was organized by Ray Reiter and Bonnie Webber. In the last 10 years the area of nonmonotonic reasoning has seen a number of important developments. Significant theoretical advances were made in the understanding of general abstract principles underlying nonmonotonicity. Key results on the expressibility and computational complexity of nonmonotonic logics were established. The role of nonmonotonic reasoning in belief revision, abduction, reasoning about action, planing and uncertainty was further clarified. Several successful NMR systems were built and used in applications such as planning, scheduling, logic programming and constraint satisfaction. The papers in the proceedings reflect these recent advances in the field. They are grouped into sections corresponding to special sessions as they were held at the workshop: 1. General NMR track 2. Abductive reasonig 3. Belief revision: theory and practice 4. Representing action and planning 5. Systems descriptions and demonstrations 6. Uncertainty frameworks in NMR △ Less

Submitted 22 March, 2000; originally announced March 2000.

Comments: Contributing editors: Marc Denecker, Antonis Kakas, Francesca Toni - Abductive Reasoning; Samir Chopra, Mary-Anne Williams - Belief change: theory and practice; Vladimir Lifschitz, Alessandro Provetti - Representing actions and planning; Juergen Dix - System demonstrations and presentations; Salem Benferhat, Henri Prade - Uncertainty frameworks in NMR

ACM Class: I2.2; I2.3; I2.4; I2.8; F4.1

Showing 101–143 of 143 results for author: Baral, C