covid19census: U.S. and Italy COVID-19 metrics and other epidemiological data

Database (Oxford). 2021 May 15:2021:baab027. doi: 10.1093/database/baab027.

Abstract

Since the beginning of the coronavirus disease-2019 (COVID-19) pandemic in 2020, there has been a tremendous accumulation of data capturing different statistics including the number of tests, confirmed cases and deaths. This data wealth offers a great opportunity for researchers to model the effect of certain variables on COVID-19 morbidity and mortality and to get a better understanding of the disease at the epidemiological level. However, in order to draw any reliable and unbiased estimate, models also need to take into account other variables and metrics available from a plurality of official and unofficial heterogenous resources. In this study, we introduce covid19census, an R package that extracts from many different repositories and combines together COVID-19 metrics and other demographic, environment- and health-related variables of the USA and Italy at the county and regional levels, respectively. The package is equipped with a number of user-friendly functions that dynamically extract the data over different timepoints and contains a detailed description of the included variables. To demonstrate the utility of this tool, we used it to extract and combine different county-level data from the USA, which we subsequently used to model the effect of diabetes on COVID-19 mortality at the county level, taking into account other variables that may influence such effects. In conclusion, it was observed that the 'covid19census' package allows to easily extract area-level data from both the USA and Italy using few functions. These comprehensive data can be used to provide reliable estimates of the effect of certain variables on COVID-19 outcomes. Database URL: https://github.com/c1au6i0/covid19census.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • COVID-19 / epidemiology*
  • Comorbidity
  • Databases, Factual*
  • Datasets as Topic*
  • Demography
  • Diabetes Mellitus / mortality
  • Health Surveys
  • Humans
  • Italy / epidemiology
  • Models, Theoretical
  • Pandemics*
  • SARS-CoV-2*
  • Software
  • United States / epidemiology