Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Papakostas, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.06910  [pdf, other

    cs.IR

    InRanker: Distilled Rankers for Zero-shot Information Retrieval

    Authors: Thiago Laitz, Konstantinos Papakostas, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Despite multi-billion parameter neural rankers being common components of state-of-the-art information retrieval pipelines, they are rarely used in production due to the enormous amount of compute required for inference. In this work, we propose a new method for distilling large rankers into their smaller versions focusing on out-of-domain effectiveness. We introduce InRanker, a version of monoT5… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  2. arXiv:2310.18696  [pdf, other

    cs.CL cs.AI cs.LG

    Probing LLMs for Joint Encoding of Linguistic Categories

    Authors: Giulio Starace, Konstantinos Papakostas, Rochelle Choenni, Apostolos Panagiotopoulos, Matteo Rosati, Alina Leidinger, Ekaterina Shutova

    Abstract: Large Language Models (LLMs) exhibit impressive performance on a range of NLP tasks, due to the general-purpose linguistic knowledge acquired during pretraining. Existing model interpretability research (Tenney et al., 2019) suggests that a linguistic hierarchy emerges in the LLM layers, with lower layers better suited to solving syntactic tasks and higher layers employed for semantic processing.… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted in EMNLP Findings 2023

  3. arXiv:2305.12483  [pdf, other

    cs.CL cs.AI cs.LG

    Model Analysis & Evaluation for Ambiguous Question Answering

    Authors: Konstantinos Papakostas, Irene Papadopoulou

    Abstract: Ambiguous questions are a challenge for Question Answering models, as they require answers that cover multiple interpretations of the original query. To this end, these models are required to generate long-form answers that often combine conflicting pieces of information. Although recent advances in the field have shown strong capabilities in generating fluent responses, certain research questions… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted in the Findings of ACL 2023

  4. arXiv:2304.06391  [pdf, other

    cs.CV cs.AI cs.LG

    VISION DIFFMASK: Faithful Interpretation of Vision Transformers with Differentiable Patch Masking

    Authors: Angelos Nalmpantis, Apostolos Panagiotopoulos, John Gkountouras, Konstantinos Papakostas, Wilker Aziz

    Abstract: The lack of interpretability of the Vision Transformer may hinder its use in critical real-world applications despite its effectiveness. To overcome this issue, we propose a post-hoc interpretability method called VISION DIFFMASK, which uses the activations of the model's hidden layers to predict the relevant parts of the input that contribute to its final predictions. Our approach uses a gating m… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted in the XAI4CV Workshop at CVPR 2023