Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Chatfield, K

.
  1. arXiv:2212.14504  [pdf, other

    cs.CV

    Improving Visual Representation Learning through Perceptual Understanding

    Authors: Samyakh Tukra, Frederick Hoffman, Ken Chatfield

    Abstract: We present an extension to masked autoencoders (MAE) which improves on the representations learnt by the model by explicitly encouraging the learning of higher scene-level features. We do this by: (i) the introduction of a perceptual similarity term between generated and real images (ii) incorporating several techniques from the adversarial training literature including multi-scale training and ad… ▽ More

    Submitted 28 March, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: v2: add additional details on MSG-MAE. In Proc CVPR 2023

  2. arXiv:1407.4764  [pdf, other

    cs.CV cs.LG cs.NE

    Efficient On-the-fly Category Retrieval using ConvNets and GPUs

    Authors: Ken Chatfield, Karen Simonyan, Andrew Zisserman

    Abstract: We investigate the gains in precision and speed, that can be obtained by using Convolutional Networks (ConvNets) for on-the-fly retrieval - where classifiers are learnt at run time for a textual query from downloaded images, and used to rank large image or video datasets. We make three contributions: (i) we present an evaluation of state-of-the-art image representations for object category retri… ▽ More

    Submitted 17 November, 2014; v1 submitted 17 July, 2014; originally announced July 2014.

    Comments: Published in proceedings of ACCV 2014

  3. arXiv:1405.3531  [pdf, other

    cs.CV

    Return of the Devil in the Details: Delving Deep into Convolutional Nets

    Authors: Ken Chatfield, Karen Simonyan, Andrea Vedaldi, Andrew Zisserman

    Abstract: The latest generation of Convolutional Neural Networks (CNN) have achieved impressive results in challenging benchmarks on image recognition and object detection, significantly raising the interest of the community in these methods. Nevertheless, it is still unclear how different CNN methods compare with each other and with previous state-of-the-art shallow representations such as the Bag-of-Visua… ▽ More

    Submitted 5 November, 2014; v1 submitted 14 May, 2014; originally announced May 2014.

    Comments: Published in proceedings of BMVC 2014