Validation requirements for AI-based intervention-evaluation in aging and longevity research and practice

Georg Fuellen; Anton Kulaga; Sebastian Lobentanzer; Maximilian Unfried; Roberto A Avelar; Daniel Palmer; Brian K Kennedy

doi:10.1016/j.arr.2024.102617

Validation requirements for AI-based intervention-evaluation in aging and longevity research and practice

Ageing Res Rev. 2024 Dec 4:104:102617. doi: 10.1016/j.arr.2024.102617. Online ahead of print.

Authors

Georg Fuellen¹, Anton Kulaga², Sebastian Lobentanzer³, Maximilian Unfried⁴, Roberto A Avelar², Daniel Palmer², Brian K Kennedy⁵

Affiliations

¹ Institute for Biostatistics and Informatics in Medicine and Ageing Research, Rostock University Medical Center, Rostock, Germany; UCD Conway Institute of Biomolecular and Biomedical Research, School of Medicine, University College Dublin, Dublin, Ireland. Electronic address: [email protected].
² Institute for Biostatistics and Informatics in Medicine and Ageing Research, Rostock University Medical Center, Rostock, Germany.
³ Institute for Computational Biomedicine, Heidelberg University, Faculty of Medicine and Heidelberg University Hospital, Heidelberg, Germany; European Bioinformatics Institute, Hinxton, Cambridgeshire, UK.
⁴ Healthy Longevity Translational Research Program, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore.
⁵ Healthy Longevity Translational Research Program, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore; Department of Physiology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore. Electronic address: [email protected].

PMID: 39643211
DOI: 10.1016/j.arr.2024.102617

Abstract

The field of aging and longevity research is overwhelmed by vast amounts of data, calling for the use of Artificial Intelligence (AI), including Large Language Models (LLMs), for the evaluation of geroprotective interventions. Such evaluations should be correct, useful, comprehensive, explainable, and they should consider causality, interdisciplinarity, adherence to standards, longitudinal data and known aging biology. In particular, comprehensive analyses should go beyond comparing data based on canonical biomedical databases, suggesting the use of AI to interpret changes in biomarkers and outcomes. Our requirements motivate the use of LLMs with Knowledge Graphs and dedicated workflows employing, e.g., Retrieval-Augmented Generation. While naive trust in the responses of AI tools can cause harm, adding our requirements to LLM queries can improve response quality, calling for benchmarking efforts and justifying the informed use of LLMs for advice on longevity interventions.

Keywords: Large Language Models; Longevity; Preventive Medicine.

Publication types

Review