Optimal Operation of Cryogenic Calorimeters Through Deep Reinforcement Learning

Comput Softw Big Sci. 2024;8(1):10. doi: 10.1007/s41781-024-00119-y. Epub 2024 May 22.

Abstract

Cryogenic phonon detectors with transition-edge sensors achieve the best sensitivity to sub-GeV/c 2 dark matter interactions with nuclei in current direct detection experiments. In such devices, the temperature of the thermometer and the bias current in its readout circuit need careful optimization to achieve optimal detector performance. This task is not trivial and is typically done manually by an expert. In our work, we automated the procedure with reinforcement learning in two settings. First, we trained on a simulation of the response of three Cryogenic Rare Event Search with Superconducting Thermometers (CRESST) detectors used as a virtual reinforcement learning environment. Second, we trained live on the same detectors operated in the CRESST underground setup. In both cases, we were able to optimize a standard detector as fast and with comparable results as human experts. Our method enables the tuning of large-scale cryogenic detector setups with minimal manual interventions.

Keywords: Cryogenic calorimeter; Dark matter; Reinforcement learning; Transition-edge sensor.