Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Diau, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00936  [pdf, other

    cs.CL

    A Survey of Useful LLM Evaluation

    Authors: Ji-Lun Peng, Sijia Cheng, Egil Diau, Yung-Yu Shih, Po-Heng Chen, Yen-Ting Lin, Yun-Nung Chen

    Abstract: LLMs have gotten attention across various research domains due to their exceptional performance on a wide range of complex tasks. Therefore, refined methods to evaluate the capabilities of LLMs are needed to determine the tasks and responsibility they should undertake. Our study mainly discussed how LLMs, as useful tools, should be effectively assessed. We proposed the two-stage framework: from ``… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.