Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition | IEEE Conference Publication | IEEE Xplore