Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Hellinger, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.10920  [pdf, other

    cs.CL cs.AI

    NuclearQA: A Human-Made Benchmark for Language Models for the Nuclear Domain

    Authors: Anurag Acharya, Sai Munikoti, Aaron Hellinger, Sara Smith, Sridevi Wagle, Sameera Horawalavithana

    Abstract: As LLMs have become increasingly popular, they have been used in almost every field. But as the application for LLMs expands from generic fields to narrow, focused science domains, there exists an ever-increasing gap in ways to evaluate their efficacy in those fields. For the benchmarks that do exist, a lot of them focus on questions that don't require proper understanding of the subject in questi… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 9 pages

    ACM Class: I.2.7