Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Rossiello, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.13560  [pdf, ps, other

    cs.DB

    Open Government Data Corpus for Table Search

    Authors: Michael Glass, Sugato Bagchi, Oktie Hassanzadeh, Gaetano Rossiello, Alfio Gliozzo

    Abstract: Increasing amounts of structured data can provide value for research and business if the relevant data can be located. Often the data is in a data lake without a consistent schema, making locating useful data challenging. Table search is a growing research area, but existing benchmarks have been limited to displayed tables. Tables sized and formatted for display in a Wikipedia page or ArXiv paper… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  2. arXiv:2306.11843  [pdf, other

    cs.CL cs.AI cs.DB cs.IR

    Retrieval-Based Transformer for Table Augmentation

    Authors: Michael Glass, Xueqing Wu, Ankita Rajaram Naik, Gaetano Rossiello, Alfio Gliozzo

    Abstract: Data preparation, also called data wrangling, is considered one of the most expensive and time-consuming steps when performing analytics or building machine learning models. Preparing data typically involves collecting and merging data from complex heterogeneous, and often large-scale data sources, such as data lakes. In this paper, we introduce a novel approach toward automatic data wrangling in… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Findings of ACL 2023

  3. arXiv:2210.13952  [pdf, other

    cs.CL cs.AI cs.IR

    KnowGL: Knowledge Generation and Linking from Text

    Authors: Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Nandana Mihindukulasooriya, Owen Cornec, Alfio Massimiliano Gliozzo

    Abstract: We propose KnowGL, a tool that allows converting text into structured relational data represented as a set of ABox assertions compliant with the TBox of a given Knowledge Graph (KG), such as Wikidata. We address this problem as a sequence generation task by leveraging pre-trained sequence-to-sequence language models, e.g. BART. Given a sentence, we fine-tune such models to detect pairs of entity m… ▽ More

    Submitted 22 November, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: AAAI-23 Demo Track

  4. arXiv:2207.06300  [pdf, other

    cs.CL cs.AI cs.IR

    Re2G: Retrieve, Rerank, Generate

    Authors: Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Ankita Rajaram Naik, Pengshan Cai, Alfio Gliozzo

    Abstract: As demonstrated by GPT-3 and T5, transformers grow in capability as parameter spaces become larger and larger. However, for tasks that require a large amount of knowledge, non-parametric memory allows models to grow dramatically with a sub-linear increase in computational cost and GPU memory requirements. Recent models such as RAG and REALM have introduced retrieval into conditional generation. Th… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at NAACL 2022

  5. arXiv:2207.05188  [pdf, other

    cs.AI cs.CL cs.IR

    Knowledge Graph Induction enabling Recommending and Trend Analysis: A Corporate Research Community Use Case

    Authors: Nandana Mihindukulasooriya, Mike Sava, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Irene Yachbes, Aditya Gidh, Jillian Duckwitz, Kovit Nisar, Michael Santos, Alfio Gliozzo

    Abstract: A research division plays an important role of driving innovation in an organization. Drawing insights, following trends, keeping abreast of new research, and formulating strategies are increasingly becoming more challenging for both researchers and executives as the amount of information grows in both velocity and volume. In this paper we present a use case of how a corporate research community,… ▽ More

    Submitted 15 September, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted at ISWC 2022

    MSC Class: 68T01; 68T30 ACM Class: I.2.7; I.2.4; H.5

  6. arXiv:2204.03985  [pdf, other

    cs.CL cs.AI cs.LG

    KGI: An Integrated Framework for Knowledge Intensive Language Tasks

    Authors: Md Faisal Mahbub Chowdhury, Michael Glass, Gaetano Rossiello, Alfio Gliozzo, Nandana Mihindukulasooriya

    Abstract: In this paper, we present a system to showcase the capabilities of the latest state-of-the-art retrieval augmented generation models trained on knowledge-intensive language tasks, such as slot filling, open domain question answering, dialogue, and fact-checking. Moreover, given a user query, we show how the output from these different models can be combined to cross-examine the outputs of each oth… ▽ More

    Submitted 21 September, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: EMNLP 2022 Demo Track

  7. arXiv:2202.13229  [pdf, other

    cs.CL cs.AI

    A Generative Model for Relation Extraction and Classification

    Authors: Jian Ni, Gaetano Rossiello, Alfio Gliozzo, Radu Florian

    Abstract: Relation extraction (RE) is an important information extraction task which provides essential information to many NLP applications such as knowledge base population and question answering. In this paper, we present a novel generative model for relation extraction and classification (which we call GREC), where RE is modeled as a sequence-to-sequence generation task. We explore various encoding repr… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

  8. arXiv:2201.05302  [pdf, other

    cs.CL cs.AI

    Applying a Generic Sequence-to-Sequence Model for Simple and Effective Keyphrase Generation

    Authors: Md Faisal Mahbub Chowdhury, Gaetano Rossiello, Michael Glass, Nandana Mihindukulasooriya, Alfio Gliozzo

    Abstract: In recent years, a number of keyphrase generation (KPG) approaches were proposed consisting of complex model architectures, dedicated training paradigms and decoding strategies. In this work, we opt for simplicity and show how a commonly used seq2seq language model, BART, can be easily adapted to generate keyphrases from the text in a single batch computation using a simple training procedure. Emp… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  9. arXiv:2112.07606  [pdf, ps, other

    cs.CL cs.AI

    Semantic Answer Type and Relation Prediction Task (SMART 2021)

    Authors: Nandana Mihindukulasooriya, Mohnish Dubey, Alfio Gliozzo, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Ricardo Usbeck, Gaetano Rossiello, Uttam Kumar

    Abstract: Each year the International Semantic Web Conference organizes a set of Semantic Web Challenges to establish competitions that will advance state-of-the-art solutions in some problem domains. The Semantic Answer Type and Relation Prediction Task (SMART) task is one of the ISWC 2021 Semantic Web challenges. This is the second year of the challenge after a successful SMART 2020 at ISWC 2020. This yea… ▽ More

    Submitted 10 January, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    ACM Class: F.4.1; I.2.4; I.2.7

  10. arXiv:2111.05825  [pdf, other

    cs.CL cs.AI

    A Two-Stage Approach towards Generalization in Knowledge Base Question Answering

    Authors: Srinivas Ravishankar, June Thai, Ibrahim Abdelaziz, Nandana Mihidukulasooriya, Tahira Naseem, Pavan Kapanipathi, Gaetano Rossiello, Achille Fokoue

    Abstract: Most existing approaches for Knowledge Base Question Answering (KBQA) focus on a specific underlying knowledge base either because of inherent assumptions in the approach, or because evaluating it on a different knowledge base requires non-trivial changes. However, many popular knowledge bases share similarities in their underlying schemas that can be leveraged to facilitate generalization across… ▽ More

    Submitted 17 November, 2021; v1 submitted 10 November, 2021; originally announced November 2021.

  11. arXiv:2108.13934  [pdf, other

    cs.CL cs.AI cs.IR

    Robust Retrieval Augmented Generation for Zero-shot Slot Filling

    Authors: Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Alfio Gliozzo

    Abstract: Automatically inducing high quality knowledge graphs from a given collection of documents still remains a challenging problem in AI. One way to make headway for this problem is through advancements in a related task known as slot filling. In this task, given an entity query in form of [Entity, Slot, ?], a system is asked to fill the slot by generating or extracting the missing value exploiting evi… ▽ More

    Submitted 13 September, 2021; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted at EMNLP 2021. arXiv admin note: substantial text overlap with arXiv:2104.08610

  12. arXiv:2108.07337  [pdf, other

    cs.CL cs.AI

    Generative Relation Linking for Question Answering over Knowledge Bases

    Authors: Gaetano Rossiello, Nandana Mihindukulasooriya, Ibrahim Abdelaziz, Mihaela Bornea, Alfio Gliozzo, Tahira Naseem, Pavan Kapanipathi

    Abstract: Relation linking is essential to enable question answering over knowledge bases. Although there are various efforts to improve relation linking performance, the current state-of-the-art methods do not achieve optimal results, therefore, negatively impacting the overall end-to-end question answering performance. In this work, we propose a novel approach for relation linking framing it as a generati… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: Accepted at the 20th International Semantic Web Conference (ISWC 2021)

  13. arXiv:2104.08610  [pdf, other

    cs.AI cs.CL

    Zero-shot Slot Filling with DPR and RAG

    Authors: Michael Glass, Gaetano Rossiello, Alfio Gliozzo

    Abstract: The ability to automatically extract Knowledge Graphs (KG) from a given collection of documents is a long-standing problem in Artificial Intelligence. One way to assess this capability is through the task of slot filling. Given an entity query in form of [Entity, Slot, ?], a system is asked to `fill' the slot by generating or extracting the missing value from a relevant passage or passages. This c… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  14. arXiv:2012.04780  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Open Knowledge Graphs Canonicalization using Variational Autoencoders

    Authors: Sarthak Dash, Gaetano Rossiello, Nandana Mihindukulasooriya, Sugato Bagchi, Alfio Gliozzo

    Abstract: Noun phrases and Relation phrases in open knowledge graphs are not canonicalized, leading to an explosion of redundant and ambiguous subject-relation-object triples. Existing approaches to solve this problem take a two-step approach. First, they generate embedding representations for both noun and relation phrases, then a clustering algorithm is used to group them using the embeddings as features.… ▽ More

    Submitted 27 September, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted to EMNLP 2021

  15. arXiv:2012.01707  [pdf, other

    cs.CL cs.AI

    Leveraging Abstract Meaning Representation for Knowledge Base Question Answering

    Authors: Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander Gray, Ramon Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, Dinesh Garg, Alfio Gliozzo, Sairam Gurajada, Hima Karanam, Naweed Khan, Dinesh Khandelwal, Young-Suk Lee, Yunyao Li, Francois Luus, Ndivhuwo Makondo, Nandana Mihindukulasooriya, Tahira Naseem, Sumit Neelam, Lucian Popa, Revanth Reddy , et al. (5 additional authors not shown)

    Abstract: Knowledge base question answering (KBQA)is an important task in Natural Language Processing. Existing approaches face significant challenges including complex question understanding, necessity for reasoning, and lack of large end-to-end training datasets. In this work, we propose Neuro-Symbolic Question Answering (NSQA), a modular KBQA system, that leverages (1) Abstract Meaning Representation (AM… ▽ More

    Submitted 2 June, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted to Findings of ACL

  16. Leveraging Semantic Parsing for Relation Linking over Knowledge Bases

    Authors: Nandana Mihindukulasooriya, Gaetano Rossiello, Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Mo Yu, Alfio Gliozzo, Salim Roukos, Alexander Gray

    Abstract: Knowledgebase question answering systems are heavily dependent on relation extraction and linking modules. However, the task of extracting and linking relations from text to knowledgebases faces two primary challenges; the ambiguity of natural language and lack of training data. To overcome these challenges, we present SLING, a relation linking framework which leverages semantic parsing using Abst… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: Accepted at the 19th International Semantic Web Conference (ISWC 2020)

    MSC Class: 68T35 ACM Class: I.2.7; I.2.4

  17. arXiv:2006.12641  [pdf, ps, other

    cs.CL cs.LG cs.PL

    Exploring Software Naturalness through Neural Language Models

    Authors: Luca Buratti, Saurabh Pujar, Mihaela Bornea, Scott McCarley, Yunhui Zheng, Gaetano Rossiello, Alessandro Morari, Jim Laredo, Veronika Thost, Yufan Zhuang, Giacomo Domeniconi

    Abstract: The Software Naturalness hypothesis argues that programming languages can be understood through the same techniques used in natural language processing. We explore this hypothesis through the use of a pre-trained transformer-based language model to perform code analysis tasks. Present approaches to code analysis depend heavily on features derived from the Abstract Syntax Tree (AST) while our trans… ▽ More

    Submitted 24 June, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

  18. Knowledge Graph Embeddings and Explainable AI

    Authors: Federico Bianchi, Gaetano Rossiello, Luca Costabello, Matteo Palmonari, Pasquale Minervini

    Abstract: Knowledge graph embeddings are now a widely adopted approach to knowledge representation in which entities and relationships are embedded in vector spaces. In this chapter, we introduce the reader to the concept of knowledge graph embeddings by explaining what they are, how they can be generated and how they can be evaluated. We summarize the state-of-the-art in this field by describing the approa… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: Federico Bianchi, Gaetano Rossiello, Luca Costabello, Matteo Plamonari, Pasquale Minervini, Knowledge Graph Embeddings and Explainable AI. In: Ilaria Tiddi, Freddy Lecue, Pascal Hitzler (eds.), Knowledge Graphs for eXplainable AI -- Foundations, Applications and Challenges. Studies on the Semantic Web, IOS Press, Amsterdam, 2020

  19. arXiv:1702.02367  [pdf, ps, other

    cs.CL

    Iterative Multi-document Neural Attention for Multiple Answer Prediction

    Authors: Claudio Greco, Alessandro Suglia, Pierpaolo Basile, Gaetano Rossiello, Giovanni Semeraro

    Abstract: People have information needs of varying complexity, which can be solved by an intelligent agent able to answer questions formulated in a proper way, eventually considering user context and preferences. In a scenario in which the user profile can be considered as a question, intelligent agents able to answer questions can be used to find the most relevant answers for a given user. In this work we… ▽ More

    Submitted 8 February, 2017; originally announced February 2017.

    Comments: Paper accepted and presented at the Deep Understanding and Reasoning: A challenge for Next-generation Intelligent Agents (URANIA) workshop, held in the context of the AI*IA 2016 conference