A framework for human evaluation of large language models in healthcare derived from literature review.
Tam TYC, Sivarajkumar S, Kapoor S, Stolyar AV, Polanska K, McCarthy KR, Osterhoudt H, Wu X, Visweswaran S, Fu S, Mathur P, Cacciamani GE, Sun C, Peng Y, Wang Y.
Tam TYC, et al. Among authors: osterhoudt h.
NPJ Digit Med. 2024 Sep 28;7(1):258. doi: 10.1038/s41746-024-01258-7.
NPJ Digit Med. 2024.
PMID: 39333376
Free PMC article.