Development of a High-Resolution Indoor Radon Map Using a New Machine Learning-Based Probabilistic Model and German Radon Survey Data

Environ Health Perspect. 2024 Sep;132(9):97009. doi: 10.1289/EHP14171. Epub 2024 Sep 18.

Abstract

Background: Radon is a carcinogenic, radioactive gas that can accumulate indoors and is undetected by human senses. Therefore, accurate knowledge of indoor radon concentration is crucial for assessing radon-related health effects or identifying radon-prone areas.

Objectives: Indoor radon concentration at the national scale is usually estimated on the basis of extensive measurement campaigns. However, characteristics of the sampled households often differ from the characteristics of the target population owing to the large number of relevant factors that control the indoor radon concentration, such as the availability of geogenic radon or floor level. Furthermore, the sample size usually does not allow estimation with high spatial resolution. We propose a model-based approach that allows a more realistic estimation of indoor radon distribution with a higher spatial resolution than a purely data-based approach.

Methods: A multistage modeling approach was used by applying a quantile regression forest that uses environmental and building data as predictors to estimate the probability distribution function of indoor radon for each floor level of each residential building in Germany. Based on the estimated probability distribution function, a probabilistic Monte Carlo sampling technique was applied, enabling the combination and population weighting of floor-level predictions. In this way, the uncertainty of the individual predictions is effectively propagated into the estimate of variability at the aggregated level.

Results: The results show an approximate lognormal distribution of indoor radon in dwellings in Germany with an arithmetic mean of 63 Bq/m3, a geometric mean of 41 Bq/m3, and a 95th percentile of 180 Bq/m3. The exceedance probabilities for 100 and 300 Bq/m3 are 12.5% (10.5 million people affected) and 2.2% (1.9 million people affected), respectively. In large cities, individual indoor radon concentration is generally estimated to be lower than in rural areas, which is due to the different distribution of the population on floor levels.

Discussion: The advantages of our approach are that is yields a) an accurate estimation of indoor radon concentration even if the survey is not fully representative with respect to floor level and radon concentration in soil, and b) an estimate of the indoor radon distribution with a much higher spatial resolution than basic descriptive statistics. https://doi.org/10.1289/EHP14171.

MeSH terms

  • Air Pollutants, Radioactive* / analysis
  • Air Pollution, Indoor* / analysis
  • Air Pollution, Indoor* / statistics & numerical data
  • Deutschland
  • Housing*
  • Humans
  • Machine Learning*
  • Models, Statistical
  • Radiation Monitoring / methods
  • Radon* / analysis

Substances

  • Radon
  • Air Pollutants, Radioactive