Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Kruschwitz, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13511  [pdf, other

    cs.CL

    Can Open-Source LLMs Compete with Commercial Models? Exploring the Few-Shot Performance of Current GPT Models in Biomedical Tasks

    Authors: Samy Ateia, Udo Kruschwitz

    Abstract: Commercial large language models (LLMs), like OpenAI's GPT-4 powering ChatGPT and Anthropic's Claude 3 Opus, have dominated natural language processing (NLP) benchmarks across different domains. New competing Open-Source alternatives like Mixtral 8x7B or Llama 3 have emerged and seem to be closing the gap while often offering higher throughput and being less costly to use. Open-Source LLMs can als… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Version as accepted at the BioASQ Lab at CLEF 2024

  2. arXiv:2404.08259  [pdf, ps, other

    cs.CL

    Investigating Neural Machine Translation for Low-Resource Languages: Using Bavarian as a Case Study

    Authors: Wan-Hua Her, Udo Kruschwitz

    Abstract: Machine Translation has made impressive progress in recent years offering close to human-level performance on many languages, but studies have primarily focused on high-resource languages with broad online presence and resources. With the help of growing Large Language Models, more and more low-resource languages achieve better results through the presence of other languages. However, studies have… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Preprint accepted at the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages (SIGUL 2024)

  3. arXiv:2402.18179  [pdf, other

    cs.CL

    Challenges in Pre-Training Graph Neural Networks for Context-Based Fake News Detection: An Evaluation of Current Strategies and Resource Limitations

    Authors: Gregor Donabauer, Udo Kruschwitz

    Abstract: Pre-training of neural networks has recently revolutionized the field of Natural Language Processing (NLP) and has before demonstrated its effectiveness in computer vision. At the same time, advances around the detection of fake news were mainly driven by the context-based paradigm, where different types of signals (e.g. from social media) form graph-like structures that hold contextual informatio… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Preprint accepted at LREC-COLING 2024

  4. arXiv:2306.16108  [pdf, other

    cs.CL

    Is ChatGPT a Biomedical Expert? -- Exploring the Zero-Shot Performance of Current GPT Models in Biomedical Tasks

    Authors: Samy Ateia, Udo Kruschwitz

    Abstract: We assessed the performance of commercial Large Language Models (LLMs) GPT-3.5-Turbo and GPT-4 on tasks from the 2023 BioASQ challenge. In Task 11b Phase B, which is focused on answer generation, both models demonstrated competitive abilities with leading systems. Remarkably, they achieved this with simple zero-shot learning, grounded with relevant snippets. Even without relevant snippets, their p… ▽ More

    Submitted 24 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Preprint accepted at the 11th BioASQ Workshop at the 14th Conference and Labs of the Evaluation Forum (CLEF) 2023; Changes: 1. Added related work and experimental setup sections. 2. Reworked discussion and future work section. 3. Fixed multiple typos and improved style. Changed license

  5. arXiv:2212.06560  [pdf, ps, other

    cs.CL cs.IR

    Exploring Fake News Detection with Heterogeneous Social Media Context Graphs

    Authors: Gregor Donabauer, Udo Kruschwitz

    Abstract: Fake news detection has become a research area that goes way beyond a purely academic interest as it has direct implications on our society as a whole. Recent advances have primarily focused on textbased approaches. However, it has become clear that to be effective one needs to incorporate additional, contextual information such as spreading behaviour of news articles and user interaction patterns… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: Preprint accepted at the 45th European Conference on Information Retrieval (ECIR 2023)

  6. arXiv:2210.05581  [pdf, other

    cs.CL

    Aggregating Crowdsourced and Automatic Judgments to Scale Up a Corpus of Anaphoric Reference for Fiction and Wikipedia Texts

    Authors: Juntao Yu, Silviu Paun, Maris Camilleri, Paloma Carretero Garcia, Jon Chamberlain, Udo Kruschwitz, Massimo Poesio

    Abstract: Although several datasets annotated for anaphoric reference/coreference exist, even the largest such datasets have limitations in terms of size, range of domains, coverage of anaphoric phenomena, and size of documents included. Yet, the approaches proposed to scale up anaphoric annotation haven't so far resulted in datasets overcoming these limitations. In this paper, we introduce a new release of… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  7. arXiv:2204.02712  [pdf, other

    cs.CL cs.IR

    A New Dataset for Topic-Based Paragraph Classification in Genocide-Related Court Transcripts

    Authors: Miriam Schirmer, Udo Kruschwitz, Gregor Donabauer

    Abstract: Recent progress in natural language processing has been impressive in many different areas with transformer-based approaches setting new benchmarks for a wide range of applications. This development has also lowered the barriers for people outside the NLP community to tap into the tools and resources applied to a variety of domain-specific applications. The bottleneck however still remains the lac… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: Preprint. Accepted to appear in Proceedings of LREC 2022

  8. arXiv:2204.01841  [pdf, other

    cs.CL cs.AI

    Applying Automatic Text Summarization for Fake News Detection

    Authors: Philipp Hartl, Udo Kruschwitz

    Abstract: The distribution of fake news is not a new but a rapidly growing problem. The shift to news consumption via social media has been one of the drivers for the spread of misleading and deliberately wrong information, as in addition to it of easy use there is rarely any veracity monitoring. Due to the harmful effects of such fake news on society, the detection of these has become increasingly importan… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Preprint. Accepted to appear in Proceedings of LREC 2022

  9. ur-iw-hnt at GermEval 2021: An Ensembling Strategy with Multiple BERT Models

    Authors: Hoai Nam Tran, Udo Kruschwitz

    Abstract: This paper describes our approach (ur-iw-hnt) for the Shared Task of GermEval2021 to identify toxic, engaging, and fact-claiming comments. We submitted three runs using an ensembling strategy by majority (hard) voting with multiple different BERT models of three different types: German-based, Twitter-based, and multilingual models. All ensemble models outperform single models, while BERTweet is th… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Comments: 5 pages, 1 figure

    Journal ref: In Proceedings of the GermEval 2021 Workshop on the Identification of Toxic, Engaging, and Fact-Claiming Comments: 17th Conference on Natural Language Processing KONVENS 2021, pages 83-87, Online (2021)

  10. arXiv:2106.13528  [pdf

    cs.IR cs.HC

    Interactive query expansion for professional search applications

    Authors: Tony Russell-Rose, Philip Gooch, Udo Kruschwitz

    Abstract: Knowledge workers (such as healthcare information professionals, patent agents and recruitment professionals) undertake work tasks where search forms a core part of their duties. In these instances, the search task is often complex and time-consuming and requires specialist expert knowledge to formulate accurate search strategies. Interactive features such as query expansion can play a key role in… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: 34 pages, 5 figures

  11. arXiv:2102.04211  [pdf, other

    cs.CY cs.SI

    Challenging Social Media Threats using Collective Well-being Aware Recommendation Algorithms and an Educational Virtual Companion

    Authors: Dimitri Ognibene, Davide Taibi, Udo Kruschwitz, Rodrigo Souza Wilkens, Davinia Hernandez-Leo, Emily Theophilou, Lidia Scifo, Rene Alejandro Lobo, Francesco Lomonaco, Sabrina Eimler, H. Ulrich Hoppe, Nils Malzahn

    Abstract: Social media have become an integral part of our lives, expanding our interlinking capabilities to new levels. There is plenty to be said about their positive effects. On the other hand, however, some serious negative implications of social media have been repeatedly highlighted in recent years, pointing at various threats to society and its more vulnerable members, such as teenagers. We thus prop… ▽ More

    Submitted 17 October, 2022; v1 submitted 25 January, 2021; originally announced February 2021.

  12. arXiv:1905.04577  [pdf, other

    cs.IR cs.HC

    Information search in a professional context - exploring a collection of professional search tasks

    Authors: Suzan Verberne, Jiyin He, Gineke Wiggers, Tony Russell-Rose, Udo Kruschwitz, Arjen P. de Vries

    Abstract: Search conducted in a work context is an everyday activity that has been around since long before the Web was invented, yet we still seem to understand little about its general characteristics. With this paper we aim to contribute to a better understanding of this large but rather multi-faceted area of `professional search'. Unlike task-based studies that aim at measuring the effectiveness of sear… ▽ More

    Submitted 11 May, 2019; originally announced May 2019.

    Comments: 5 pages, 2 figures

  13. Personalised Query Suggestion for Intranet Search with Temporal User Profiling

    Authors: Thanh Vu, Alistair Willis, Udo Kruschwitz, Dawei Song

    Abstract: Recent research has shown the usefulness of using collective user interaction data (e.g., query logs) to recommend query modification suggestions for Intranet search. However, most of the query suggestion approaches for Intranet search follow an "one size fits all" strategy, whereby different users who submit an identical query would get the same query suggestion list. This is problematic, as even… ▽ More

    Submitted 8 January, 2017; originally announced January 2017.

    Comments: 4 pages, 2 figures, the 2017 ACM SIGIR Conference on Human Information Interaction & Retrieval (CHIIR)

  14. arXiv:1204.4071  [pdf, other

    cs.SI physics.soc-ph

    Motivations for Participation in Socially Networked Collective Intelligence Systems

    Authors: Jon Chamberlain, Udo Kruschwitz, Massimo Poesio

    Abstract: One of the most significant challenges facing systems of collective intelligence is how to encourage participation on the scale required to produce high quality data. This paper details ongoing work with Phrase Detectives, an online game-with-a-purpose deployed on Facebook, and investigates user motivations for participation in social network gaming where the wisdom of crowds produces useful data.

    Submitted 18 April, 2012; originally announced April 2012.

    Comments: Presented at Collective Intelligence conference, 2012 (arXiv:1204.2991)

    Report number: CollectiveIntelligence/2012/50