Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Fromm, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05740  [pdf, other

    cs.CL

    Do Multilingual Large Language Models Mitigate Stereotype Bias?

    Authors: Shangrui Nie, Michael Fromm, Charles Welch, Rebekka Görge, Akbar Karimi, Joan Plepi, Nazia Afsan Mowmita, Nicolas Flores-Herr, Mehdi Ali, Lucie Flek

    Abstract: While preliminary findings indicate that multilingual LLMs exhibit reduced bias compared to monolingual ones, a comprehensive understanding of the effect of multilingual training on bias mitigation, is lacking. This study addresses this gap by systematically training six LLMs of identical size (2.6B parameters) and architecture: five monolingual models (English, German, French, Italian, and Spanis… ▽ More

    Submitted 9 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 19 pages, 8 figures, C3NLP 2024

  2. arXiv:2402.13703  [pdf, other

    cs.CL

    Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions?

    Authors: Alexander Arno Weber, Klaudia Thellmann, Jan Ebert, Nicolas Flores-Herr, Jens Lehmann, Michael Fromm, Mehdi Ali

    Abstract: The adaption of multilingual pre-trained Large Language Models (LLMs) into eloquent and helpful assistants is essential to facilitate their use across different language regions. In that spirit, we are the first to conduct an extensive study of the performance of multilingual models on parallel, multi-turn instruction-tuning benchmarks across a selection of the most-spoken Indo-European languages.… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 22 pages, 7 figures

  3. arXiv:2312.01005  [pdf, other

    astro-ph.GA cs.LG eess.IV

    Generating Images of the M87* Black Hole Using GANs

    Authors: Arya Mohan, Pavlos Protopapas, Keerthi Kunnumkai, Cecilia Garraffo, Lindy Blackburn, Koushik Chatterjee, Sheperd S. Doeleman, Razieh Emami, Christian M. Fromm, Yosuke Mizuno, Angelo Ricarte

    Abstract: In this paper, we introduce a novel data augmentation methodology based on Conditional Progressive Generative Adversarial Networks (CPGAN) to generate diverse black hole (BH) images, accounting for variations in spin and electron temperature prescriptions. These generated images are valuable resources for training deep learning algorithms to accurately estimate black hole parameters from observati… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 11 pages, 7 figures. Accepted by Monthly Notices of the Royal Astronomical Society Journal

  4. arXiv:2310.08754  [pdf, other

    cs.LG

    Tokenizer Choice For LLM Training: Negligible or Crucial?

    Authors: Mehdi Ali, Michael Fromm, Klaudia Thellmann, Richard Rutmann, Max Lübbering, Johannes Leveling, Katrin Klug, Jan Ebert, Niclas Doll, Jasper Schulze Buschhoff, Charvi Jain, Alexander Arno Weber, Lena Jurkschat, Hammam Abdelwahab, Chelsea John, Pedro Ortiz Suarez, Malte Ostendorff, Samuel Weinbach, Rafet Sifa, Stefan Kesselheim, Nicolas Flores-Herr

    Abstract: The recent success of Large Language Models (LLMs) has been predominantly driven by curating the training dataset composition, scaling of model architectures and dataset sizes and advancements in pretraining objectives, leaving tokenizer influence as a blind spot. Shedding light on this underexplored area, we conduct a comprehensive study on the influence of tokenizer choice on LLM downstream perf… ▽ More

    Submitted 17 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  5. arXiv:2205.09803  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Towards a Holistic View on Argument Quality Prediction

    Authors: Michael Fromm, Max Berrendorf, Johanna Reiml, Isabelle Mayerhofer, Siddharth Bhargava, Evgeniy Faerman, Thomas Seidl

    Abstract: Argumentation is one of society's foundational pillars, and, sparked by advances in NLP and the vast availability of text data, automated mining of arguments receives increasing attention. A decisive property of arguments is their strength or quality. While there are works on the automated estimation of argument strength, their scope is narrow: they focus on isolated datasets and neglect the inter… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  6. arXiv:2111.11388  [pdf, other

    cs.CV cs.AI cs.LG

    Conifer Seedling Detection in UAV-Imagery with RGB-Depth Information

    Authors: Jason Jooste, Michael Fromm, Matthias Schubert

    Abstract: Monitoring of reforestation is currently being considerably streamlined through the use of drones and image recognition algorithms, which have already proven to be effective on colour imagery. In addition to colour imagery, elevation data is often also available. The primary aim of this work was to improve the performance of the faster-RCNN object detection algorithm by integrating this height inf… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  7. arXiv:2109.11319  [pdf, other

    cs.LG cs.AI cs.CL

    Active Learning for Argument Strength Estimation

    Authors: Nataliia Kees, Michael Fromm, Evgeniy Faerman, Thomas Seidl

    Abstract: High-quality arguments are an essential part of decision-making. Automatically predicting the quality of an argument is a complex task that recently got much attention in argument mining. However, the annotation effort for this task is exceptionally high. Therefore, we test uncertainty-based active learning (AL) methods on two popular argument-strength data sets to estimate whether sample-efficien… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

  8. arXiv:2012.07743  [pdf, other

    cs.CY cs.AI cs.CL cs.LG

    Argument Mining Driven Analysis of Peer-Reviews

    Authors: Michael Fromm, Evgeniy Faerman, Max Berrendorf, Siddharth Bhargava, Ruoxia Qi, Yao Zhang, Lukas Dennert, Sophia Selle, Yang Mao, Thomas Seidl

    Abstract: Peer reviewing is a central process in modern research and essential for ensuring high quality and reliability of published work. At the same time, it is a time-consuming process and increasing interest in emerging fields often results in a high review workload, especially for senior researchers in this area. How to cope with this problem is an open question and it is vividly discussed across all… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

  9. arXiv:2011.02177  [pdf, ps, other

    cs.IR cs.CL

    Diversity Aware Relevance Learning for Argument Search

    Authors: Michael Fromm, Max Berrendorf, Sandra Obermeier, Thomas Seidl, Evgeniy Faerman

    Abstract: In this work, we focus on the problem of retrieving relevant arguments for a query claim covering diverse aspects. State-of-the-art methods rely on explicit mappings between claims and premises, and thus are unable to utilize large available collections of premises without laborious and costly manual annotation. Their diversity approach relies on removing duplicates via clustering which does not d… ▽ More

    Submitted 17 March, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

  10. arXiv:2001.10883  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Unsupervised Anomaly Detection for X-Ray Images

    Authors: Diana Davletshina, Valentyn Melnychuk, Viet Tran, Hitansh Singla, Max Berrendorf, Evgeniy Faerman, Michael Fromm, Matthias Schubert

    Abstract: Obtaining labels for medical (image) data requires scarce and expensive experts. Moreover, due to ambiguous symptoms, single images rarely suffice to correctly diagnose a medical condition. Instead, it often requires to take additional background information such as the patient's medical history or test results into account. Hence, instead of focusing on uninterpretable black-box systems deliverin… ▽ More

    Submitted 4 November, 2020; v1 submitted 29 January, 2020; originally announced January 2020.

  11. arXiv:1910.09313  [pdf, other

    cs.IR cs.DL cs.LG stat.ML

    Using Supervised Learning to Classify Metadata of Research Data by Discipline of Research

    Authors: Tobias Weber, Dieter Kranzlmüller, Michael Fromm, Nelson Tavares de Sousa

    Abstract: Automated classification of metadata of research data by their discipline(s) of research can be used in scientometric research, by repository service providers, and in the context of research data aggregation services. Openly available metadata of the DataCite index for research data were used to compile a large training and evaluation set comprised of 609,524 records, which is published alongside… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

  12. arXiv:1906.00923  [pdf, other

    cs.CL cs.LG stat.ML

    TACAM: Topic And Context Aware Argument Mining

    Authors: Michael Fromm, Evgeniy Faerman, Thomas Seidl

    Abstract: In this work we address the problem of argument search. The purpose of argument search is the distillation of pro and contra arguments for requested topics from large text corpora. In previous works, the usual approach is to use a standard search engine to extract text parts which are relevant to the given topic and subsequently use an argument recognition algorithm to select arguments from them.… ▽ More

    Submitted 26 August, 2019; v1 submitted 26 May, 2019; originally announced June 2019.