Natural visual scenes are dominated by sequences of transforming images. Spatial visual information is thought to be processed by detection of elemental stimulus features which are recomposed into scenes. How image information is integrated over time is unclear. We explored visual information encoding in the optic tectum. Unbiased stimulus presentation shows that the majority of tectal neurons recognize image sequences. This is achieved by temporally dynamic response properties, which encode complex image transitions over several hundred milliseconds. Calcium imaging reveals that neurons that encode spatiotemporal image sequences fire in spike sequences that predict a logical diagram of spatiotemporal information processing. Furthermore, the temporal scale of visual information is tuned by experience. This study indicates how neurons recognize dynamic visual scenes that transform over time.