Integration and validation of host transcript signatures, including a novel 3-transcript tuberculosis signature, to enable one-step multiclass diagnosis of childhood febrile disease

Samuel Channon-Wells; Dominic Habgood-Coote; Ortensia Vito; Rachel Galassini; Victoria J Wright; Andrew J Brent; Robert S Heyderman; Suzanne T Anderson; Brian Eley; Federico Martinón-Torres; Michael Levin; Myrsini Kaforou; UK Kawasaki Disease Genetics, ILULU, GENDRES and EUCLIDS consortia; Jethro A Herberg

doi:10.1186/s12967-024-05241-4

Integration and validation of host transcript signatures, including a novel 3-transcript tuberculosis signature, to enable one-step multiclass diagnosis of childhood febrile disease

J Transl Med. 2024 Aug 29;22(1):802. doi: 10.1186/s12967-024-05241-4.

Authors

Samuel Channon-Wells^{1

2}, Dominic Habgood-Coote^{1

2}, Ortensia Vito^{1

2}, Rachel Galassini^{1

2}, Victoria J Wright^{1

2}, Andrew J Brent^{3

4}, Robert S Heyderman⁵, Suzanne T Anderson⁶, Brian Eley⁷, Federico Martinón-Torres^{8

9

10}, Michael Levin^{1

2}, Myrsini Kaforou^#^{1

2}; UK Kawasaki Disease Genetics, ILULU, GENDRES and EUCLIDS consortia; Jethro A Herberg^#^{11

12}

Affiliations

¹ Section of Paediatric Infectious Disease, Department of Infectious Disease, Imperial College London, London, UK.
² Centre for Paediatrics and Child Health, Imperial College London, London, UK.
³ Oxford University Hospitals NHS Foundation Trust, Headley Way, Headington, Oxford, UK.
⁴ Nuffield Department of Medicine, University of Oxford, Oxford, UK.
⁵ Research Department of Infection, Division of Infection and Immunity, University College London, London, UK.
⁶ MRC Clinical Trials Unit, University College London, London, UK.
⁷ Department of Paediatrics and Child Health, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa.
⁸ Translational Pediatrics and Infectious Diseases, Department of Pediatrics, Hospital Clínico Universitario de Santiago de Compostela, Santiago de Compostela, Galicia, Spain.
⁹ Genetics, Vaccines, Infections and Pediatrics Research Group (GENVIP), Instituto de Investigación Santiaria de Santiago, Universidade de Santiago de Compostela, Santiago de Compostela, Spain.
¹⁰ Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBER-ES), Instituto de Salud Carlos III, Madrid, Spain.
¹¹ Section of Paediatric Infectious Disease, Department of Infectious Disease, Imperial College London, London, UK. [email protected].
¹² Centre for Paediatrics and Child Health, Imperial College London, London, UK. [email protected].

^# Contributed equally.

Abstract

Background: Whole blood host transcript signatures show great potential for diagnosis of infectious and inflammatory illness, with most published signatures performing binary classification tasks. Barriers to clinical implementation include validation studies, and development of strategies that enable simultaneous, multiclass diagnosis of febrile illness based on gene expression.

Methods: We validated five distinct diagnostic signatures for paediatric infectious diseases in parallel using a single NanoString nCounter® experiment. We included a novel 3-transcript signature for childhood tuberculosis, and four published signatures which differentiate bacterial infection, viral infection, or Kawasaki disease from other febrile illnesses. Signature performance was assessed using receiver operating characteristic curve statistics. We also explored conceptual frameworks for multiclass diagnostic signatures, including additional transcripts found to be significantly differentially expressed in previous studies. Relaxed, regularised logistic regression models were used to derive two novel multiclass signatures: a mixed One-vs-All model (MOVA), running multiple binomial models in parallel, and a full-multiclass model. In-sample performance of these models was compared using radar-plots and confusion matrix statistics.

Results: Samples from 91 children were included in the study: 23 bacterial infections (DB), 20 viral infections (DV), 14 Kawasaki disease (KD), 18 tuberculosis disease (TB), and 16 healthy controls. The five signatures tested demonstrated cross-platform performance similar to their primary discovery-validation cohorts. The signatures could differentiate: KD from other diseases with area under ROC curve (AUC) of 0.897 [95% confidence interval: 0.822-0.972]; DB from DV with AUC of 0.825 [0.691-0.959] (signature-1) and 0.867 [0.753-0.982] (signature-2); TB from other diseases with AUC of 0.882 [0.787-0.977] (novel signature); TB from healthy children with AUC of 0.910 [0.808-1.000]. Application of signatures outside of their designed context reduced performance. In-sample error rates for the multiclass models were 13.3% for the MOVA model and 0.0% for the full-multiclass model. The MOVA model misclassified DB cases most frequently (18.7%) and TB cases least (2.7%).

Conclusions: Our study demonstrates the feasibility of NanoString technology for cross-platform validation of multiple transcriptomic signatures in parallel. This external cohort validated performance of all five signatures, including a novel sparse TB signature. Two exploratory multi-class models showed high potential accuracy across four distinct diagnostic groups.

Keywords: Bacterial infection; Diagnostics; Gene expression; Kawasaki disease; Multiclass diagnostics; Tuberculosis; Viral infection.

Publication types

Validation Study

MeSH terms

Child
Child, Preschool
Female
Fever* / diagnosis
Fever* / microbiology
Gene Expression Profiling
Humans
Infant
Male
RNA, Messenger / blood
RNA, Messenger / genetics
RNA, Messenger / metabolism
ROC Curve
Reproducibility of Results
Transcriptome / genetics
Tuberculosis* / diagnosis
Tuberculosis* / genetics

Substances

RNA, Messenger

Abstract

Publication types

MeSH terms

Substances

Grants and funding