Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Aly, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13124  [pdf, other

    cs.CL

    Learning to Generate Answers with Citations via Factual Consistency Models

    Authors: Rami Aly, Zhiqiang Tang, Samson Tan, George Karypis

    Abstract: Large Language Models (LLMs) frequently hallucinate, impeding their reliability in mission-critical situations. One approach to address this issue is to provide citations to relevant sources alongside generated content, enhancing the verifiability of generations. However, citing passages accurately in answers remains a substantial challenge. This paper proposes a weakly-supervised fine-tuning meth… ▽ More

    Submitted 15 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024. Code is available at https://github.com/amazon-science/learning-to-generate-answers-with-citations

  2. arXiv:2404.03818  [pdf, other

    cs.CL

    PRobELM: Plausibility Ranking Evaluation for Language Models

    Authors: Zhangdie Yuan, Eric Chamoun, Rami Aly, Chenxi Whitehouse, Andreas Vlachos

    Abstract: This paper introduces PRobELM (Plausibility Ranking Evaluation for Language Models), a benchmark designed to assess language models' ability to discern more plausible from less plausible scenarios through their parametric knowledge. While benchmarks such as TruthfulQA emphasise factual accuracy or truthfulness, and others such as COPA explore plausible scenarios without explicitly incorporating wo… ▽ More

    Submitted 7 August, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2310.14198  [pdf, other

    cs.CL

    QA-NatVer: Question Answering for Natural Logic-based Fact Verification

    Authors: Rami Aly, Marek Strong, Andreas Vlachos

    Abstract: Fact verification systems assess a claim's veracity based on evidence. An important consideration in designing them is faithfulness, i.e. generating explanations that accurately reflect the reasoning of the model. Recent works have focused on natural logic, which operates directly on natural language by capturing the semantic relation of spans between an aligned claim with its evidence via set-the… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  4. arXiv:2305.12576  [pdf, other

    cs.CL

    Automated Few-shot Classification with Instruction-Finetuned Language Models

    Authors: Rami Aly, Xingjian Shi, Kaixiang Lin, Aston Zhang, Andrew Gordon Wilson

    Abstract: A particularly successful class of approaches for few-shot learning combines language models with prompts -- hand-crafted task descriptions that complement data samples. However, designing prompts by hand for each task commonly requires domain knowledge and substantial guesswork. We observe, in the context of classification tasks, that instruction finetuned language models exhibit remarkable promp… ▽ More

    Submitted 21 October, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: EMNLP2023 Findings

  5. arXiv:2212.05276  [pdf, other

    cs.CL

    Natural Logic-guided Autoregressive Multi-hop Document Retrieval for Fact Verification

    Authors: Rami Aly, Andreas Vlachos

    Abstract: A key component of fact verification is thevevidence retrieval, often from multiple documents. Recent approaches use dense representations and condition the retrieval of each document on the previously retrieved ones. The latter step is performed over all the documents in the collection, requiring storing their dense representations in an index, thus incurring a high memory footprint. An alternati… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.

    Comments: EMNLP2022

  6. arXiv:2206.04449  [pdf, other

    cs.CV

    Segmentation Enhanced Lameness Detection in Dairy Cows from RGB and Depth Video

    Authors: Eric Arazo, Robin Aly, Kevin McGuinness

    Abstract: Cow lameness is a severe condition that affects the life cycle and life quality of dairy cows and results in considerable economic losses. Early lameness detection helps farmers address illnesses early and avoid negative effects caused by the degeneration of cows' condition. We collected a dataset of short clips of cows passing through a hallway exiting a milking station and annotated the degree o… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted at the CV4Animals workshop in CVPR 2022

  7. arXiv:2106.05707  [pdf, other

    cs.CL

    FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information

    Authors: Rami Aly, Zhijiang Guo, Michael Schlichtkrull, James Thorne, Andreas Vlachos, Christos Christodoulopoulos, Oana Cocarascu, Arpit Mittal

    Abstract: Fact verification has attracted a lot of attention in the machine learning and natural language processing communities, as it is one of the key methods for detecting misinformation. Existing large-scale benchmarks for this task have focused mostly on textual sources, i.e. unstructured information, and thus ignored the wealth of information available in structured formats, such as tables. In this p… ▽ More

    Submitted 12 October, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: Accepted at NeurIPS 2021 Datasets and Benchmarks Track

  8. arXiv:1906.02002  [pdf, other

    cs.CL

    Every child should have parents: a taxonomy refinement algorithm based on hyperbolic term embeddings

    Authors: Rami Aly, Shantanu Acharya, Alexander Ossa, Arne Köhn, Chris Biemann, Alexander Panchenko

    Abstract: We introduce the use of Poincaré embeddings to improve existing state-of-the-art approaches to domain-specific taxonomy induction from text as a signal for both relocating wrong hyponym terms within a (pre-induced) taxonomy as well as for attaching disconnected terms in a taxonomy. This method substantially improves previous state-of-the-art results on the SemEval-2016 Task 13 on taxonomy extracti… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: 7 pages (5 + 2 pages references), 2 Figures, 3 Tables, Accepted to the ACL 2019 conference. Will appear in its proceedings

  9. arXiv:1611.03660  [pdf, other

    cs.CY

    Using text mining and machine learning for detection of child abuse

    Authors: Chintan Amrit, Tim Paauw, Robin Aly, Miha Lavric

    Abstract: Abuse in any form is a grave threat to a child's health. Public health institutions in the Netherlands try to identify and prevent different kinds of abuse, and building a decision support system can help such institutions achieve this goal. Such decision support relies on the analysis of relevant child health data. A significant part of the medical data that the institutions have on children is u… ▽ More

    Submitted 16 November, 2016; v1 submitted 11 November, 2016; originally announced November 2016.

    Comments: 31 pages, 7 figures and 12 tables

    ACM Class: H.4.2; I.2.7

  10. Predicting Relevance based on Assessor Disagreement: Analysis and Practical Applications for Search Evaluation

    Authors: Thomas Demeester, Robin Aly, Djoerd Hiemstra, Dong Nguyen, Chris Develder

    Abstract: Evaluation of search engines relies on assessments of search results for selected test queries, from which we would ideally like to draw conclusions in terms of relevance of the results for general (e.g., future, unknown) users. In practice however, most evaluation scenarios only allow us to conclusively determine the relevance towards the particular assessor that provided the judgments. A factor… ▽ More

    Submitted 23 November, 2015; originally announced November 2015.

    Comments: Accepted for publication in Springer Information Retrieval Journal, special issue on Information Retrieval Evaluation using Test Collections

  11. arXiv:1312.1913  [pdf, other

    cs.IR

    Adapting Binary Information Retrieval Evaluation Metrics for Segment-based Retrieval Tasks

    Authors: Robin Aly, Maria Eskevich, Roeland Ordelman, Gareth J. F. Jones

    Abstract: This report describes metrics for the evaluation of the effectiveness of segment-based retrieval based on existing binary information retrieval metrics. This metrics are described in the context of a task for the hyperlinking of video segments. This evaluation approach re-uses existing evaluation measures from the standard Cranfield evaluation paradigm. Our adaptation approach can in principle be… ▽ More

    Submitted 6 December, 2013; originally announced December 2013.

    Comments: Explanation of evaluation measures for the linking task of the MediaEval Workshop 2013