Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Ateia, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13511  [pdf, other

    cs.CL

    Can Open-Source LLMs Compete with Commercial Models? Exploring the Few-Shot Performance of Current GPT Models in Biomedical Tasks

    Authors: Samy Ateia, Udo Kruschwitz

    Abstract: Commercial large language models (LLMs), like OpenAI's GPT-4 powering ChatGPT and Anthropic's Claude 3 Opus, have dominated natural language processing (NLP) benchmarks across different domains. New competing Open-Source alternatives like Mixtral 8x7B or Llama 3 have emerged and seem to be closing the gap while often offering higher throughput and being less costly to use. Open-Source LLMs can als… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Version as accepted at the BioASQ Lab at CLEF 2024

  2. arXiv:2306.16108  [pdf, other

    cs.CL

    Is ChatGPT a Biomedical Expert? -- Exploring the Zero-Shot Performance of Current GPT Models in Biomedical Tasks

    Authors: Samy Ateia, Udo Kruschwitz

    Abstract: We assessed the performance of commercial Large Language Models (LLMs) GPT-3.5-Turbo and GPT-4 on tasks from the 2023 BioASQ challenge. In Task 11b Phase B, which is focused on answer generation, both models demonstrated competitive abilities with leading systems. Remarkably, they achieved this with simple zero-shot learning, grounded with relevant snippets. Even without relevant snippets, their p… ▽ More

    Submitted 24 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Preprint accepted at the 11th BioASQ Workshop at the 14th Conference and Labs of the Evaluation Forum (CLEF) 2023; Changes: 1. Added related work and experimental setup sections. 2. Reworked discussion and future work section. 3. Fixed multiple typos and improved style. Changed license