Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Lesci, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04327  [pdf, other

    cs.LG

    Causal Estimation of Memorisation Profiles

    Authors: Pietro Lesci, Clara Meister, Thomas Hofmann, Andreas Vlachos, Tiago Pimentel

    Abstract: Understanding memorisation in language models has practical and societal implications, e.g., studying models' training dynamics or preventing copyright infringements. Prior work defines memorisation as the causal effect of training with an instance on the model's ability to predict that instance. This definition relies on a counterfactual: the ability to observe what would have happened had the mo… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Published at the ACL 2024 Conference (main)

  2. arXiv:2404.05623  [pdf, other

    cs.LG cs.CL

    AnchorAL: Computationally Efficient Active Learning for Large and Imbalanced Datasets

    Authors: Pietro Lesci, Andreas Vlachos

    Abstract: Active learning for imbalanced classification tasks is challenging as the minority classes naturally occur rarely. Gathering a large pool of unlabelled data is thus essential to capture minority instances. Standard pool-based active learning is computationally expensive on large pools and often reaches low accuracy by overfitting the initial decision boundary, thus failing to explore the input spa… ▽ More

    Submitted 24 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Published at the NAACL 2024 Conference (main)

  3. arXiv:2305.17020  [pdf, other

    cs.CL cs.LG

    Diable: Efficient Dialogue State Tracking as Operations on Tables

    Authors: Pietro Lesci, Yoshinari Fujinuma, Momchil Hardalov, Chao Shang, Yassine Benajiba, Lluis Marquez

    Abstract: Sequence-to-sequence state-of-the-art systems for dialogue state tracking (DST) use the full dialogue history as input, represent the current state as a list with all the slots, and generate the entire state from scratch at each dialogue turn. This approach is inefficient, especially when the number of slots is large and the conversation is long. We propose Diable, a new task formalisation that si… ▽ More

    Submitted 1 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023 (Findings)