Estimating the time-varying effective reproduction number via Cycle Threshold-based Transformer

PLoS Comput Biol. 2024 Dec 23;20(12):e1012694. doi: 10.1371/journal.pcbi.1012694. eCollection 2024 Dec.

Abstract

Monitoring the spread of infectious disease is essential to design and adjust the interventions timely for the prevention of the epidemic outbreak and safeguarding the public health. The governments have generally adopted the incidence-based statistical method to estimate the time-varying effective reproduction number Rt and evaluate the transmission ability of epidemics. However, this method exhibits biases arising from the reported incidence data and assumes the generation interval distribution which is not available at the early stage of epidemic. Recent studies showed that the viral loads characterized by cycle threshold (Ct) of the infected populations evolving throughout the course of epidemic and providing a possibility to infer the epidemic trajectory. In this work, we propose the Cycle Threshold-based Transformer (Ct-Transformer) to estimate Rt. We find the supervised learning of Ct-Transformer outperforms the traditional incidence-based statistic and Ct-based Rt estimating methods, and more importantly Ct-Transformer is robust to the detection resources. Further, we apply the proposed model to self-supervised pre-training tasks and obtain excellent fine-tuned performance, which attains comparable performance with the supervised Ct-Transformer, verified by both the synthetic and real-world datasets. We demonstrate that the Ct-based deep learning method can improve the real-time estimates of Rt, enabling more easily adapted to the track of the newly emerged epidemic.

MeSH terms

  • Algorithms
  • Basic Reproduction Number* / statistics & numerical data
  • COVID-19 / epidemiology
  • COVID-19 / prevention & control
  • COVID-19 / transmission
  • Communicable Diseases / epidemiology
  • Communicable Diseases / transmission
  • Computational Biology* / methods
  • Deep Learning
  • Disease Outbreaks / prevention & control
  • Disease Outbreaks / statistics & numerical data
  • Epidemics / prevention & control
  • Epidemics / statistics & numerical data
  • Humans
  • Incidence
  • Supervised Machine Learning
  • Viral Load / statistics & numerical data

Grants and funding

Q. H. L recevies the funding from the National Natural Science Foundation of China (url: https://www.nsfc.gov.cn/) with grant number 62373264, and the Major Program of National Fund of Philosophy and Social Science of China (url: http://www.nopss.gov.cn/GB/index.html) with grant number 20&ZD112. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.