Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Langedijk, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.11282  [pdf, other

    cs.CL

    ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation

    Authors: Jaap Jumelet, Michael Hanna, Marianne de Heer Kloots, Anna Langedijk, Charlotte Pouw, Oskar van der Wal

    Abstract: We present the submission of the ILLC at the University of Amsterdam to the BabyLM challenge (Warstadt et al., 2023), in the strict-small track. Our final model, ChapGTP, is a masked language model that was trained for 200 epochs, aided by a novel data augmentation technique called Automatic Task Formation. We discuss in detail the performance of this model on the three evaluation suites: BLiMP, (… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Part of the BabyLM challenge at CoNLL

  2. arXiv:2310.03686  [pdf, other

    cs.CL

    DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers

    Authors: Anna Langedijk, Hosein Mohebbi, Gabriele Sarti, Willem Zuidema, Jaap Jumelet

    Abstract: In recent years, many interpretability methods have been proposed to help interpret the internal states of Transformer-models, at different levels of precision and complexity. Here, to analyze encoder-decoder Transformers, we propose a simple, new method: DecoderLens. Inspired by the LogitLens (for decoder-only Transformers), this method involves allowing the decoder to cross-attend representation… ▽ More

    Submitted 3 April, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Accepted to Findings of NAACL 2024

  3. arXiv:2104.04736  [pdf, other

    cs.CL cs.AI

    Meta-Learning for Fast Cross-Lingual Adaptation in Dependency Parsing

    Authors: Anna Langedijk, Verna Dankers, Phillip Lippe, Sander Bos, Bryan Cardenas Guevara, Helen Yannakoudakis, Ekaterina Shutova

    Abstract: Meta-learning, or learning to learn, is a technique that can help to overcome resource scarcity in cross-lingual NLP problems, by enabling fast adaptation to new tasks. We apply model-agnostic meta-learning (MAML) to the task of cross-lingual dependency parsing. We train our model on a diverse set of languages to learn a parameter initialization that can adapt quickly to new languages. We find tha… ▽ More

    Submitted 23 March, 2022; v1 submitted 10 April, 2021; originally announced April 2021.

    Comments: - Add additional results (Appendix D) - Cosmetic updates for camera-ready version ACL 2022

  4. arXiv:2103.14679  [pdf

    cs.CR cs.CY

    Secure Platform for Processing Sensitive Data on Shared HPC Systems

    Authors: Michel Scheerman, Narges Zarrabi, Martijn Kruiten, Maxime Mogé, Lykle Voort, Annette Langedijk, Ruurd Schoonhoven, Tom Emery

    Abstract: High performance computing clusters operating in shared and batch mode pose challenges for processing sensitive data. In the meantime, the need for secure processing of sensitive data on HPC system is growing. In this work we present a novel method for creating secure computing environments on traditional multi-tenant high-performance computing clusters. Our platform as a service provides a custom… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.