Skip to main content

Showing 1–12 of 12 results for author: Lindemann, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04543  [pdf, other

    cs.CL

    Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations

    Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

    Abstract: Models need appropriate inductive biases to effectively learn from small amounts of data and generalize systematically outside of the training distribution. While Transformers are highly versatile and powerful, they can still benefit from enhanced structural inductive biases for seq2seq tasks, especially those involving syntactic transformations, such as converting active to passive voice or seman… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2310.13561  [pdf, other

    cs.CL cs.LG

    Cache & Distil: Optimising API Calls to Large Language Models

    Authors: Guillem Ramírez, Matthias Lindemann, Alexandra Birch, Ivan Titov

    Abstract: Large-scale deployment of generative AI tools often depends on costly API calls to a Large Language Model (LLM) to fulfil user queries. To curtail the frequency of these calls, one can employ a smaller language model -- a student -- which is continuously trained on the responses of the LLM. This student gradually gains proficiency in independently handling an increasing number of user requests, a… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  3. arXiv:2310.00796  [pdf, other

    cs.CL

    SIP: Injecting a Structural Inductive Bias into a Seq2Seq Model by Simulation

    Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

    Abstract: Strong inductive biases enable learning from little data and help generalization outside of the training distribution. Popular neural architectures such as Transformers lack strong structural inductive biases for seq2seq NLP tasks on their own. Consequently, they struggle with systematic generalization beyond the training distribution, e.g. with extrapolating to longer inputs, even when pre-traine… ▽ More

    Submitted 10 July, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: ACL 2024 camera-ready

  4. arXiv:2305.16954  [pdf, other

    cs.CL

    Compositional Generalization without Trees using Multiset Tagging and Latent Permutations

    Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

    Abstract: Seq2seq models have been shown to struggle with compositional generalization in semantic parsing, i.e. generalizing to unseen compositions of phenomena that the model handles correctly in isolation. We phrase semantic parsing as a two-step process: we first tag each input token with a multiset of output tokens. Then we arrange the tokens into an output sequence using a new way of parameterizing… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  5. arXiv:2210.03183  [pdf, other

    cs.CL

    Compositional Generalisation with Structured Reordering and Fertility Layers

    Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

    Abstract: Seq2seq models have been shown to struggle with compositional generalisation, i.e. generalising to new and potentially more complex structures than seen during training. Taking inspiration from grammar-based models that excel at compositional generalisation, we present a flexible end-to-end differentiable neural model that composes two structural operations: a fertility step, which we introduce in… ▽ More

    Submitted 15 February, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: EACL 2023 camera-ready

    ACM Class: I.2.7

  6. arXiv:2202.10195  [pdf, other

    cs.DS

    Efficient computation of oriented vertex and arc colorings of special digraphs

    Authors: Frank Gurski, Dominique Komander, Marvin Lindemann

    Abstract: In this paper we study the oriented vertex and arc coloring problem on edge series-parallel digraphs (esp-digraphs) which are related to the well known series-parallel graphs. Series-parallel graphs are graphs with two distinguished vertices called terminals, formed recursively by parallel and series composition. These graphs have applications in modeling series and parallel electric circuits and… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: 21 pages, 8 figures. arXiv admin note: text overlap with arXiv:2012.13764

  7. arXiv:2103.09171  [pdf, other

    cs.LG eess.SP

    Interpretable Deep Learning for the Remote Characterisation of Ambulation in Multiple Sclerosis using Smartphones

    Authors: Andrew P. Creagh, Florian Lipsmeier, Michael Lindemann, Maarten De Vos

    Abstract: The emergence of digital technologies such as smartphones in healthcare applications have demonstrated the possibility of developing rich, continuous, and objective measures of multiple sclerosis (MS) disability that can be administered remotely and out-of-clinic. In this work, deep convolutional neural networks (DCNN) applied to smartphone inertial sensor data were shown to better distinguish hea… ▽ More

    Submitted 22 June, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

  8. arXiv:2012.13764  [pdf, other

    math.CO cs.DS

    Efficient computation of the oriented chromatic number of recursively defined digraphs

    Authors: Frank Gurski, Dominique Komander, Marvin Lindemann

    Abstract: In this paper we consider colorings of oriented graphs, i.e. digraphs without cycles of length 2. Given some oriented graph $G=(V,E)$, an oriented $r$-coloring for $G$ is a partition of the vertex set $V$ into $r$ independent sets, such that all the arcs between two of these sets have the same direction. The oriented chromatic number of $G$ is the smallest integer $r$ such that $G$ permits an orie… ▽ More

    Submitted 12 March, 2021; v1 submitted 26 December, 2020; originally announced December 2020.

    Comments: 25 pages. arXiv admin note: text overlap with arXiv:2006.13911

  9. arXiv:2009.07365  [pdf, other

    cs.CL

    Fast semantic parsing with well-typedness guarantees

    Authors: Matthias Lindemann, Jonas Groschwitz, Alexander Koller

    Abstract: AM dependency parsing is a linguistically principled method for neural semantic parsing with high accuracy across multiple graphbanks. It relies on a type system that models semantic valency but makes existing parsers slow. We describe an A* parser and a transition-based parser for AM dependency parsing which guarantee well-typedness and improve parsing speed by up to 3 orders of magnitude, while… ▽ More

    Submitted 6 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: Accepted at EMNLP 2020, camera-ready version

  10. arXiv:2004.14236  [pdf, other

    cs.CL

    Normalizing Compositional Structures Across Graphbanks

    Authors: Lucia Donatelli, Jonas Groschwitz, Alexander Koller, Matthias Lindemann, Pia Weißenhorn

    Abstract: The emergence of a variety of graph-based meaning representations (MRs) has sparked an important conversation about how to adequately represent semantic structure. These MRs exhibit structural differences that reflect different theoretical and design considerations, presenting challenges to uniform linguistic analysis and cross-framework semantic parsing. Here, we ask the question of which design… ▽ More

    Submitted 30 April, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: 16 pages, 6 figures

  11. arXiv:1906.11746  [pdf, other

    cs.CL

    Compositional Semantic Parsing Across Graphbanks

    Authors: Matthias Lindemann, Jonas Groschwitz, Alexander Koller

    Abstract: Most semantic parsers that map sentences to graph-based meaning representations are hand-designed for specific graphbanks. We present a compositional neural semantic parser which achieves, for the first time, competitive accuracies across a diverse range of graphbanks. Incorporating BERT embeddings and multi-task learning improves the accuracy further, setting new states of the art on DM, PAS, PSD… ▽ More

    Submitted 13 July, 2019; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: Accepted at ACL 2019

  12. AMR Dependency Parsing with a Typed Semantic Algebra

    Authors: Jonas Groschwitz, Matthias Lindemann, Meaghan Fowlie, Mark Johnson, Alexander Koller

    Abstract: We present a semantic parser for Abstract Meaning Representations which learns to parse strings into tree representations of the compositional structure of an AMR graph. This allows us to use standard neural techniques for supertagging and dependency tree parsing, constrained by a linguistically principled type system. We present two approximative decoding algorithms, which achieve state-of-the-ar… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

    Comments: This paper will be presented at ACL 2018 (see https://acl2018.org/programme/papers/)

    Journal ref: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2018