Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Bhuiyan, M A H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.18486  [pdf, other

    cs.CL cs.AI cs.LG

    A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets

    Authors: Md Tahmid Rahman Laskar, M Saiful Bari, Mizanur Rahman, Md Amran Hossen Bhuiyan, Shafiq Joty, Jimmy Xiangji Huang

    Abstract: The development of large language models (LLMs) such as ChatGPT has brought a lot of attention recently. However, their evaluation in the benchmark academic datasets remains under-explored due to the difficulty of evaluating the generative outputs produced by this model against the ground truth. In this paper, we aim to present a thorough evaluation of ChatGPT's performance on diverse academic dat… ▽ More

    Submitted 5 July, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023 Findings. The first three authors contributed equally