Simulate Scientific Reasoning with Multiple Large Language Models: An Application to Alzheimer's Disease Combinatorial Therapy

Qidi Xu; Xiaozhong Liu; Xiaoqian Jiang; Yejin Kim

doi:10.1101/2024.12.10.24318800

Simulate Scientific Reasoning with Multiple Large Language Models: An Application to Alzheimer's Disease Combinatorial Therapy

medRxiv [Preprint]. 2024 Dec 12:2024.12.10.24318800. doi: 10.1101/2024.12.10.24318800.

Authors

Qidi Xu¹, Xiaozhong Liu², Xiaoqian Jiang¹, Yejin Kim¹

Affiliations

¹ McWilliams School of Biomedical Informatics, UTHealth Houston, Houston, TX, 77030.
² Computer Science and Data Science, Worcester Polytechnic Institute, Worcester, MA, 01609.

Abstract

Motivation: This study aims to develop an AI-driven framework that leverages large language models (LLMs) to simulate scientific reasoning and peer review to predict efficacious combinatorial therapy when data-driven prediction is infeasible.

Results: Our proposed framework achieved a significantly higher accuracy (0.74) than traditional knowledge-based prediction (0.52). An ablation study highlighted the importance of high quality few-shot examples, external knowledge integration, self-consistency, and review within the framework. The external validation with private experimental data yielded an accuracy of 0.82, further confirming the framework's ability to generate high-quality hypotheses in biological inference tasks. Our framework offers an automated knowledge-driven hypothesis generation approach when data-driven prediction is not a viable option.

Availability and implementation: Our source code and data are available at https://github.com/QidiXu96/Coated-LLM.

Publication types

Preprint

Abstract

Publication types

Grants and funding