We gratefully acknowledge support from
the Simons Foundation and member institutions.

Sungnyun Kim is qualified to endorse.

Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition

Sungnyun Kim: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL, cs.CV, cs.LG. (why?)

Kangwook Jang, Sangmin Bae, Hoirin Kim and Se-Young Yun are not registered as owners of this paper. (why?)