Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Hokamp, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12860  [pdf, other

    cs.CL cs.AI

    STAGE: Simplified Text-Attributed Graph Embeddings Using Pre-trained LLMs

    Authors: Aaron Zolnai-Lucas, Jack Boylan, Chris Hokamp, Parsa Ghaffari

    Abstract: We present Simplified Text-Attributed Graph Embeddings (STAGE), a straightforward yet effective method for enhancing node features in Graph Neural Network (GNN) models that encode Text-Attributed Graphs (TAGs). Our approach leverages Large-Language Models (LLMs) to generate embeddings for textual attributes. STAGE achieves competitive results on various node classification benchmarks while also ma… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2404.15923  [pdf, other

    cs.AI cs.CL

    KGValidator: A Framework for Automatic Validation of Knowledge Graph Construction

    Authors: Jack Boylan, Shashank Mangla, Dominic Thorn, Demian Gholipour Ghalandari, Parsa Ghaffari, Chris Hokamp

    Abstract: This study explores the use of Large Language Models (LLMs) for automatic evaluation of knowledge graph (KG) completion models. Historically, validating information in KGs has been a challenging task, requiring large-scale human annotation at prohibitive cost. With the emergence of general-purpose generative AI and LLMs, it is now plausible that human-in-the-loop validation could be replaced by a… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Text2KG 2024, ESWC 2024

  3. arXiv:2312.11399  [pdf, other

    cs.CL

    News Signals: An NLP Library for Text and Time Series

    Authors: Chris Hokamp, Demian Gholipour Ghalandari, Parsa Ghaffari

    Abstract: We present an open-source Python library for building and using datasets where inputs are clusters of textual data, and outputs are sequences of real values representing one or more time series signals. The news-signals library supports diverse data science and NLP problem settings related to the prediction of time series behaviour using textual data feeds. For example, in the news domain, inputs… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: EMNLP NLP-OSS Workshop, December 2023

  4. arXiv:2205.08221  [pdf, other

    cs.CL

    Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning

    Authors: Demian Gholipour Ghalandari, Chris Hokamp, Georgiana Ifrim

    Abstract: Sentence compression reduces the length of text by removing non-essential content while preserving important facts and grammaticality. Unsupervised objective driven methods for sentence compression can be used to create customized models without the need for ground-truth training data, while allowing flexibility in the objective function(s) that are used for learning and inference. Recent unsuperv… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  5. arXiv:2006.08748  [pdf, other

    cs.CL

    DynE: Dynamic Ensemble Decoding for Multi-Document Summarization

    Authors: Chris Hokamp, Demian Gholipour Ghalandari, Nghia The Pham, John Glover

    Abstract: Sequence-to-sequence (s2s) models are the basis for extensive work in natural language processing. However, some applications, such as multi-document summarization, multi-modal machine translation, and the automatic post-editing of machine translation, require mapping a set of multiple distinct inputs into a single output sequence. Recent work has introduced bespoke architectures for these multi-i… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  6. arXiv:2005.10070  [pdf, other

    cs.CL

    A Large-Scale Multi-Document Summarization Dataset from the Wikipedia Current Events Portal

    Authors: Demian Gholipour Ghalandari, Chris Hokamp, Nghia The Pham, John Glover, Georgiana Ifrim

    Abstract: Multi-document summarization (MDS) aims to compress the content in large document collections into short summaries and has important applications in story clustering for newsfeeds, presentation of search results, and timeline generation. However, there is a lack of datasets that realistically address such use cases at a scale large enough for training supervised models for this task. This work pre… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: Camera-ready version for ACL 2020

  7. arXiv:1907.06214  [pdf, other

    cs.LG stat.ML

    Task Selection Policies for Multitask Learning

    Authors: John Glover, Chris Hokamp

    Abstract: One of the questions that arises when designing models that learn to solve multiple tasks simultaneously is how much of the available training budget should be devoted to each individual task. We refer to any formalized approach to addressing this problem (learned or otherwise) as a task selection policy. In this work we provide an empirical evaluation of the performance of some common task select… ▽ More

    Submitted 14 July, 2019; originally announced July 2019.

  8. arXiv:1906.09675  [pdf, other

    cs.CL

    Evaluating the Supervised and Zero-shot Performance of Multi-lingual Translation Models

    Authors: Chris Hokamp, John Glover, Demian Gholipour

    Abstract: We study several methods for full or partial sharing of the decoder parameters of multilingual NMT models. We evaluate both fully supervised and zero-shot translation performance in 110 unique translation directions using only the WMT 2019 shared task parallel datasets for training. We use additional test sets and re-purpose evaluation methods recently used for unsupervised MT in order to evaluate… ▽ More

    Submitted 23 June, 2019; originally announced June 2019.

  9. arXiv:1811.02278  [pdf, ps, other

    cs.CL

    Off-the-Shelf Unsupervised NMT

    Authors: Chris Hokamp, Sebastian Ruder, John Glover

    Abstract: We frame unsupervised machine translation (MT) in the context of multi-task learning (MTL), combining insights from both directions. We leverage off-the-shelf neural MT architectures to train unsupervised MT models with no parallel data and show that such models can achieve reasonably good performance, competitive with models purpose-built for unsupervised MT. Finally, we propose improvements that… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

  10. arXiv:1805.07731  [pdf, ps, other

    cs.CL

    Generating High-Quality Surface Realizations Using Data Augmentation and Factored Sequence Models

    Authors: Henry Elder, Chris Hokamp

    Abstract: This work presents a new state of the art in reconstruction of surface realizations from obfuscated text. We identify the lack of sufficient training data as the major obstacle to training high-performing models, and solve this issue by generating large amounts of synthetic training data. We also propose preprocessing techniques which make the structure contained in the input features more accessi… ▽ More

    Submitted 20 May, 2018; originally announced May 2018.

  11. arXiv:1706.05083  [pdf, other

    cs.CL

    Ensembling Factored Neural Machine Translation Models for Automatic Post-Editing and Quality Estimation

    Authors: Chris Hokamp

    Abstract: This work presents a novel approach to Automatic Post-Editing (APE) and Word-Level Quality Estimation (QE) using ensembles of specialized Neural Machine Translation (NMT) systems. Word-level features that have proven effective for QE are included as input factors, expanding the representation of the original source and the machine translation hypothesis, which are used to generate an automatically… ▽ More

    Submitted 15 July, 2017; v1 submitted 15 June, 2017; originally announced June 2017.

    Comments: APE/QE System Description Paper for WMT/CMT 2017

  12. arXiv:1704.07138  [pdf, other

    cs.CL

    Lexically Constrained Decoding for Sequence Generation Using Grid Beam Search

    Authors: Chris Hokamp, Qun Liu

    Abstract: We present Grid Beam Search (GBS), an algorithm which extends beam search to allow the inclusion of pre-specified lexical constraints. The algorithm can be used with any model that generates a sequence $ \mathbf{\hat{y}} = \{y_{0}\ldots y_{T}\} $, by maximizing $ p(\mathbf{y} | \mathbf{x}) = \prod\limits_{t}p(y_{t} | \mathbf{x}; \{y_{0} \ldots y_{t-1}\}) $. Lexical constraints take the form of phr… ▽ More

    Submitted 2 May, 2017; v1 submitted 24 April, 2017; originally announced April 2017.

    Comments: Accepted as a long paper at ACL 2017