Crowd Counting Using Meta-Test-Time Adaptation

Chaoqun Ma; Ferrante Neri; Li Gu; Ziqiang Wang; Jian Wang; Anyong Qing; Yang Wang

doi:10.1142/S0129065724500618

Crowd Counting Using Meta-Test-Time Adaptation

Int J Neural Syst. 2024 Nov;34(11):2450061. doi: 10.1142/S0129065724500618.

Authors

Chaoqun Ma¹, Ferrante Neri², Li Gu³, Ziqiang Wang³, Jian Wang⁴, Anyong Qing¹, Yang Wang³

Affiliations

¹ School of Electrical Engineering, Southwest Jiaotong University, Chengdu 611756, P. R. China.
² NICE Group, School of Computer Science and Electronic Engineering, University of Surrey, Guildford, Surrey GU2 7XH, UK.
³ Department of Computer Science and Software Engineering, Concordia University, Montreal, QC H3H 2L9, Canada.
⁴ Faculty of Electric Power Engineering, Kunming University of Science and Technology, Kunming 650500, P. R. China.

PMID: 39252679
DOI: 10.1142/S0129065724500618

Abstract

Machine learning algorithms are commonly used for quickly and efficiently counting people from a crowd. Test-time adaptation methods for crowd counting adjust model parameters and employ additional data augmentation to better adapt the model to the specific conditions encountered during testing. The majority of current studies concentrate on unsupervised domain adaptation. These approaches commonly perform hundreds of epochs of training iterations, requiring a sizable number of unannotated data of every new target domain apart from annotated data of the source domain. Unlike these methods, we propose a meta-test-time adaptive crowd counting approach called CrowdTTA, which integrates the concept of test-time adaptation into the meta-learning framework and makes it easier for the counting model to adapt to the unknown test distributions. To facilitate the reliable supervision signal at the pixel level, we introduce uncertainty by inserting the dropout layer into the counting model. The uncertainty is then used to generate valuable pseudo labels, serving as effective supervisory signals for adapting the model. In the context of meta-learning, one image can be regarded as one task for crowd counting. In each iteration, our approach is a dual-level optimization process. In the inner update, we employ a self-supervised consistency loss function to optimize the model so as to simulate the parameters update process that occurs during the test phase. In the outer update, we authentically update the parameters based on the image with ground truth, improving the model's performance and making the pseudo labels more accurate in the next iteration. At test time, the input image is used for adapting the model before testing the image. In comparison to various supervised learning and domain adaptation methods, our results via extensive experiments on diverse datasets showcase the general adaptive capability of our approach across datasets with varying crowd densities and scales.

Keywords: Crowd counting; dropout; meta-learning; pseudo labels; test-time adaptation.

MeSH terms

Algorithms
Crowding
Humans
Machine Learning*