Zum Hauptinhalt springen

Showing 151–200 of 331 results for author: Torr, P

.
  1. arXiv:2111.03555  [pdf, other

    cs.LG

    AUTOKD: Automatic Knowledge Distillation Into A Student Architecture Family

    Authors: Roy Henha Eyono, Fabio Maria Carlucci, Pedro M Esperança, Binxin Ru, Phillip Torr

    Abstract: State-of-the-art results in deep learning have been improving steadily, in good part due to the use of larger models. However, widespread use is constrained by device hardware limitations, resulting in a substantial performance gap between state-of-the-art models and those that can be effectively deployed on small devices. While Knowledge Distillation (KD) theoretically enables small student model… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: 12 pages, 8 figures

  2. arXiv:2111.00079  [pdf, other

    cs.CV cs.LG

    Deep Deterministic Uncertainty for Semantic Segmentation

    Authors: Jishnu Mukhoti, Joost van Amersfoort, Philip H. S. Torr, Yarin Gal

    Abstract: We extend Deep Deterministic Uncertainty (DDU), a method for uncertainty estimation using feature space densities, to semantic segmentation. DDU enables quantifying and disentangling epistemic and aleatoric uncertainty in a single forward pass through the model. We study the similarity of feature representations of pixels at different locations for the same class and conclude that it is feasible t… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

  3. arXiv:2110.02483  [pdf, other

    stat.ML cs.CR cs.LG stat.AP

    Detecting and Quantifying Malicious Activity with Simulation-based Inference

    Authors: Andrew Gambardella, Bogdan State, Naeemullah Khan, Leo Tsourides, Philip H. S. Torr, Atılım Güneş Baydin

    Abstract: We propose the use of probabilistic programming techniques to tackle the malicious user identification problem in a recommendation algorithm. Probabilistic programming provides numerous advantages over other techniques, including but not limited to providing a disentangled representation of how malicious users acted under a structured model, as well as allowing for the quantification of damage cau… ▽ More

    Submitted 7 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Short version, appeared at ICML workshop on Socially Responsible Machine Learning 2021

  4. arXiv:2109.05211  [pdf, other

    cs.CV

    RobustART: Benchmarking Robustness on Architecture Design and Training Techniques

    Authors: Shiyu Tang, Ruihao Gong, Yan Wang, Aishan Liu, Jiakai Wang, Xinyun Chen, Fengwei Yu, Xianglong Liu, Dawn Song, Alan Yuille, Philip H. S. Torr, Dacheng Tao

    Abstract: Deep neural networks (DNNs) are vulnerable to adversarial noises, which motivates the benchmark of model robustness. Existing benchmarks mainly focus on evaluating defenses, but there are no comprehensive studies of how architecture design and training techniques affect robustness. Comprehensively benchmarking their relationships is beneficial for better understanding and developing robust DNNs. T… ▽ More

    Submitted 13 January, 2022; v1 submitted 11 September, 2021; originally announced September 2021.

  5. arXiv:2108.04226  [pdf, other

    cs.CV cs.LG

    Class-Agnostic Segmentation Loss and Its Application to Salient Object Detection and Segmentation

    Authors: Angira Sharma, Naeemullah Khan, Muhammad Mubashar, Ganesh Sundaramoorthi, Philip Torr

    Abstract: In this paper we present a novel loss function, called class-agnostic segmentation (CAS) loss. With CAS loss the class descriptors are learned during training of the network. We don't require to define the label of a class a-priori, rather the CAS loss clusters regions with similar appearance together in a weakly-supervised manner. Furthermore, we show that the CAS loss function is sparse, bounded… ▽ More

    Submitted 16 July, 2021; originally announced August 2021.

    Comments: IJCAI 2021 WSRL (Weakly Supervised Representation Learning) Workshop. arXiv admin note: substantial text overlap with arXiv:2010.14793

  6. arXiv:2108.01684  [pdf, other

    cs.CV

    Vision Transformer with Progressive Sampling

    Authors: Xiaoyu Yue, Shuyang Sun, Zhanghui Kuang, Meng Wei, Philip Torr, Wayne Zhang, Dahua Lin

    Abstract: Transformers with powerful global relation modeling abilities have been introduced to fundamental computer vision tasks recently. As a typical example, the Vision Transformer (ViT) directly applies a pure transformer architecture on image classification, by simply splitting images into tokens with a fixed length, and employing transformers to learn relations between these tokens. However, such nai… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021

  7. arXiv:2108.00977  [pdf, other

    cs.CV cs.LG

    Multilevel Knowledge Transfer for Cross-Domain Object Detection

    Authors: Botos Csaba, Xiaojuan Qi, Arslan Chaudhry, Puneet Dokania, Philip Torr

    Abstract: Domain shift is a well known problem where a model trained on a particular domain (source) does not perform well when exposed to samples from a different domain (target). Unsupervised methods that can adapt to domain shift are highly desirable as they allow effective utilization of the source data without requiring additional annotated training data from the target. Practically, obtaining sufficie… ▽ More

    Submitted 3 August, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

  8. arXiv:2107.14228  [pdf, other

    cs.CV cs.LG

    Open-World Entity Segmentation

    Authors: Lu Qi, Jason Kuen, Yi Wang, Jiuxiang Gu, Hengshuang Zhao, Zhe Lin, Philip Torr, Jiaya Jia

    Abstract: We introduce a new image segmentation task, called Entity Segmentation (ES), which aims to segment all visual entities (objects and stuffs) in an image without predicting their semantic labels. By removing the need of class label prediction, the models trained for such task can focus more on improving segmentation quality. It has many practical applications such as image manipulation and editing w… ▽ More

    Submitted 19 December, 2022; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: Project page: http://luqi.info/Entity_Web

  9. arXiv:2107.08295  [pdf, other

    cs.AI cs.MA

    Communicating via Markov Decision Processes

    Authors: Samuel Sokota, Christian Schroeder de Witt, Maximilian Igl, Luisa Zintgraf, Philip Torr, Martin Strohmeier, J. Zico Kolter, Shimon Whiteson, Jakob Foerster

    Abstract: We consider the problem of communicating exogenous information by means of Markov decision process trajectories. This setting, which we call a Markov coding game (MCG), generalizes both source coding and a large class of referential games. MCGs also isolate a problem that is important in decentralized control settings in which cheap-talk is not available -- namely, they require balancing communica… ▽ More

    Submitted 12 June, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

    Comments: ICML 2022

  10. arXiv:2107.05790  [pdf, other

    cs.CV

    Visual Parser: Representing Part-whole Hierarchies with Transformers

    Authors: Shuyang Sun, Xiaoyu Yue, Song Bai, Philip Torr

    Abstract: Human vision is able to capture the part-whole hierarchical information from the entire scene. This paper presents the Visual Parser (ViP) that explicitly constructs such a hierarchy with transformers. ViP divides visual representations into two levels, the part level and the whole level. Information of each part represents a combination of several independent vectors within the whole. To model th… ▽ More

    Submitted 8 January, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

  11. arXiv:2107.04570  [pdf, other

    cs.LG cs.CV

    ANCER: Anisotropic Certification via Sample-wise Volume Maximization

    Authors: Francisco Eiras, Motasem Alfarra, M. Pawan Kumar, Philip H. S. Torr, Puneet K. Dokania, Bernard Ghanem, Adel Bibi

    Abstract: Randomized smoothing has recently emerged as an effective tool that enables certification of deep neural network classifiers at scale. All prior art on randomized smoothing has focused on isotropic $\ell_p$ certification, which has the advantage of yielding certificates that can be easily compared among isotropic methods via $\ell_p$-norm radius. However, isotropic certification limits the region… ▽ More

    Submitted 31 August, 2022; v1 submitted 9 July, 2021; originally announced July 2021.

    Comments: First two authors and the last one contributed equally to this work

  12. arXiv:2107.02156  [pdf, other

    cs.CV cs.AI

    Do Different Tracking Tasks Require Different Appearance Models?

    Authors: Zhongdao Wang, Hengshuang Zhao, Ya-Li Li, Shengjin Wang, Philip H. S. Torr, Luca Bertinetto

    Abstract: Tracking objects of interest in a video is one of the most popular and widely applicable problems in computer vision. However, with the years, a Cambrian explosion of use cases and benchmarks has fragmented the problem in a multitude of different experimental setups. As a consequence, the literature has fragmented too, and now novel approaches proposed by the community are usually specialised to f… ▽ More

    Submitted 1 December, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: To appear at NeurIPS 2021

  13. arXiv:2107.00996  [pdf, other

    cs.LG stat.ML

    DeformRS: Certifying Input Deformations with Randomized Smoothing

    Authors: Motasem Alfarra, Adel Bibi, Naeemullah Khan, Philip H. S. Torr, Bernard Ghanem

    Abstract: Deep neural networks are vulnerable to input deformations in the form of vector fields of pixel displacements and to other parameterized geometric deformations e.g. translations, rotations, etc. Current input deformation certification methods either 1. do not scale to deep networks on large input datasets, or 2. can only certify a specific class of deformations, e.g. only rotations. We reformulate… ▽ More

    Submitted 19 December, 2021; v1 submitted 2 July, 2021; originally announced July 2021.

    Comments: Accepted to AAAI Conference on Artificial Intelligence (AAAI'22)

  14. arXiv:2106.12570  [pdf, other

    cs.LG cs.CV

    Learning Multimodal VAEs through Mutual Supervision

    Authors: Tom Joy, Yuge Shi, Philip H. S. Torr, Tom Rainforth, Sebastian M. Schmon, N. Siddharth

    Abstract: Multimodal VAEs seek to model the joint distribution over heterogeneous data (e.g.\ vision, language), whilst also capturing a shared representation across such modalities. Prior work has typically combined information from the modalities by reconciling idiosyncratic representations directly in the recognition model through explicit products, mixtures, or other such factorisations. Here we introdu… ▽ More

    Submitted 16 December, 2022; v1 submitted 23 June, 2021; originally announced June 2021.

  15. arXiv:2106.07780  [pdf, other

    cs.LG

    KL Guided Domain Adaptation

    Authors: A. Tuan Nguyen, Toan Tran, Yarin Gal, Philip H. S. Torr, Atılım Güneş Baydin

    Abstract: Domain adaptation is an important problem and often needed for real-world applications. In this problem, instead of i.i.d. training and testing datapoints, we assume that the source (training) data and the target (testing) data have different distributions. With that setting, the empirical risk minimization training procedure often does not perform well, since it does not account for the change in… ▽ More

    Submitted 14 March, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: Accepted to ICLR2022

  16. Large-scale Unsupervised Semantic Segmentation

    Authors: Shanghua Gao, Zhong-Yu Li, Ming-Hsuan Yang, Ming-Ming Cheng, Junwei Han, Philip Torr

    Abstract: Empowered by large datasets, e.g., ImageNet, unsupervised learning on large-scale data has enabled significant advances for classification tasks. However, whether the large-scale unsupervised semantic segmentation can be achieved remains unknown. There are two major challenges: i) we need a large-scale benchmark for assessing algorithms; ii) we need to develop methods to simultaneously learn categ… ▽ More

    Submitted 3 November, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Benchmark and Source Code: https://github.com/LUSSeg

    Journal ref: IEEE TPAMI 2022

  17. arXiv:2106.01908  [pdf, other

    cs.CV cs.LG

    You Never Cluster Alone

    Authors: Yuming Shen, Ziyi Shen, Menghan Wang, Jie Qin, Philip H. S. Torr, Ling Shao

    Abstract: Recent advances in self-supervised learning with instance-level contrastive objectives facilitate unsupervised clustering. However, a standalone datum is not perceiving the context of the holistic cluster, and may undergo sub-optimal assignment. In this paper, we extend the mainstream contrastive learning paradigm to a cluster-level scheme, where all the data subjected to the same cluster contribu… ▽ More

    Submitted 21 January, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  18. arXiv:2105.05838  [pdf, other

    cs.CV

    Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence Learning

    Authors: Yansong Tang, Zhenyu Jiang, Zhenda Xie, Yue Cao, Zheng Zhang, Philip H. S. Torr, Han Hu

    Abstract: Previous cycle-consistency correspondence learning methods usually leverage image patches for training. In this paper, we present a fully convolutional method, which is simpler and more coherent to the inference process. While directly applying fully convolutional training results in model collapse, we study the underline reason behind this collapse phenomenon, indicating that the absolute positio… ▽ More

    Submitted 28 December, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: ICCV 2021 SRVU workshop

  19. arXiv:2104.09937  [pdf, other

    cs.LG stat.ML

    Gradient Matching for Domain Generalization

    Authors: Yuge Shi, Jeffrey Seely, Philip H. S. Torr, N. Siddharth, Awni Hannun, Nicolas Usunier, Gabriel Synnaeve

    Abstract: Machine learning systems typically assume that the distributions of training and test sets match closely. However, a critical requirement of such systems in the real world is their ability to generalize to unseen domains. Here, we propose an inter-domain gradient matching objective that targets domain generalization by maximizing the inner product between gradients from different domains. Since di… ▽ More

    Submitted 13 July, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

  20. arXiv:2104.08760  [pdf, other

    cs.CV cs.AI cs.LG

    Solving Inefficiency of Self-supervised Representation Learning

    Authors: Guangrun Wang, Keze Wang, Guangcong Wang, Philip H. S. Torr, Liang Lin

    Abstract: Self-supervised learning (especially contrastive learning) has attracted great interest due to its huge potential in learning discriminative representations in an unsupervised manner. Despite the acknowledged successes, existing contrastive learning methods suffer from very low learning efficiency, e.g., taking about ten times more training epochs than supervised learning for comparable recognitio… ▽ More

    Submitted 21 October, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: ICCV 2021 paper, oral presentation

  21. arXiv:2104.06718  [pdf, other

    cs.LG cs.LO stat.ML

    Improved Branch and Bound for Neural Network Verification via Lagrangian Decomposition

    Authors: Alessandro De Palma, Rudy Bunel, Alban Desmaison, Krishnamurthy Dvijotham, Pushmeet Kohli, Philip H. S. Torr, M. Pawan Kumar

    Abstract: We improve the scalability of Branch and Bound (BaB) algorithms for formally proving input-output properties of neural networks. First, we propose novel bounding algorithms based on Lagrangian Decomposition. Previous works have used off-the-shelf solvers to solve relaxations at each node of the BaB tree, or constructed weaker relaxations that can be solved efficiently, but lead to unnecessarily we… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: Submitted for review to JMLR. This is an extended version of our paper in the UAI-20 conference (arXiv:2002.10410)

  22. arXiv:2104.05519  [pdf, other

    cs.CV

    Cloth Interactive Transformer for Virtual Try-On

    Authors: Bin Ren, Hao Tang, Fanyang Meng, Runwei Ding, Philip H. S. Torr, Nicu Sebe

    Abstract: The 2D image-based virtual try-on has aroused increased interest from the multimedia and computer vision fields due to its enormous commercial value. Nevertheless, most existing image-based virtual try-on approaches directly combine the person-identity representation and the in-shop clothing items without taking their mutual correlations into consideration. Moreover, these methods are commonly est… ▽ More

    Submitted 20 August, 2023; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: 16 pages, 7 figures, Accepted by ACM ToMM in 2023

  23. arXiv:2103.14347  [pdf, other

    cs.LG cs.CV

    Combating Adversaries with Anti-Adversaries

    Authors: Motasem Alfarra, Juan C. Pérez, Ali Thabet, Adel Bibi, Philip H. S. Torr, Bernard Ghanem

    Abstract: Deep neural networks are vulnerable to small input perturbations known as adversarial attacks. Inspired by the fact that these adversaries are constructed by iteratively minimizing the confidence of a network for the true class label, we propose the anti-adversary layer, aimed at countering this effect. In particular, our layer generates an input perturbation in the opposite direction of the adver… ▽ More

    Submitted 16 December, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted to AAAI Conference on Artificial Intelligence (AAAI'22)

  24. arXiv:2102.11582  [pdf, other

    cs.LG stat.ML

    Deep Deterministic Uncertainty: A Simple Baseline

    Authors: Jishnu Mukhoti, Andreas Kirsch, Joost van Amersfoort, Philip H. S. Torr, Yarin Gal

    Abstract: Reliable uncertainty from deterministic single-forward pass models is sought after because conventional methods of uncertainty quantification are computationally expensive. We take two complex single-forward-pass uncertainty approaches, DUQ and SNGP, and examine whether they mainly rely on a well-regularized feature space. Crucially, without using their more complex methods for estimating uncertai… ▽ More

    Submitted 28 January, 2022; v1 submitted 23 February, 2021; originally announced February 2021.

  25. arXiv:2102.08497  [pdf, other

    cs.CV

    Shape-Tailored Deep Neural Networks

    Authors: Naeemullah Khan, Angira Sharma, Ganesh Sundaramoorthi, Philip H. S. Torr

    Abstract: We present Shape-Tailored Deep Neural Networks (ST-DNN). ST-DNN extend convolutional networks (CNN), which aggregate data from fixed shape (square) neighborhoods, to compute descriptors defined on arbitrarily shaped regions. This is natural for segmentation, where descriptors should describe regions (e.g., of objects) that have diverse shape. We formulate these descriptors through the Poisson part… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  26. arXiv:2102.01558  [pdf, other

    cs.CV

    Occluded Video Instance Segmentation: A Benchmark

    Authors: Jiyang Qi, Yan Gao, Yao Hu, Xinggang Wang, Xiaoyu Liu, Xiang Bai, Serge Belongie, Alan Yuille, Philip H. S. Torr, Song Bai

    Abstract: Can our video understanding systems perceive objects when a heavy occlusion exists in a scene? To answer this question, we collect a large-scale dataset called OVIS for occluded video instance segmentation, that is, to simultaneously detect, segment, and track instances in occluded scenes. OVIS consists of 296k high-quality instance masks from 25 semantic categories, where object occlusions usua… ▽ More

    Submitted 17 May, 2022; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: IJCV 2022. Project page at https://songbai.site/ovis

    MSC Class: 68T07; 68T45

  27. arXiv:2101.05844  [pdf, other

    cs.LG

    Scaling the Convex Barrier with Sparse Dual Algorithms

    Authors: Alessandro De Palma, Harkirat Singh Behl, Rudy Bunel, Philip H. S. Torr, M. Pawan Kumar

    Abstract: Tight and efficient neural network bounding is crucial to the scaling of neural network verification systems. Many efficient bounding algorithms have been presented recently, but they are often too loose to verify more challenging properties. This is due to the weakness of the employed relaxation, which is usually a linear program of size linear in the number of neurons. While a tighter linear rel… ▽ More

    Submitted 26 February, 2024; v1 submitted 14 January, 2021; originally announced January 2021.

    Comments: Journal of Machine Learning Research, 2024 (extension of ICLR 2021 paper in [v1])

  28. arXiv:2012.15840  [pdf, other

    cs.CV

    Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

    Authors: Sixiao Zheng, Jiachen Lu, Hengshuang Zhao, Xiatian Zhu, Zekun Luo, Yabiao Wang, Yanwei Fu, Jianfeng Feng, Tao Xiang, Philip H. S. Torr, Li Zhang

    Abstract: Most recent semantic segmentation methods adopt a fully-convolutional network (FCN) with an encoder-decoder architecture. The encoder progressively reduces the spatial resolution and learns more abstract/semantic visual concepts with larger receptive fields. Since context modeling is critical for segmentation, the latest efforts have been focused on increasing the receptive field, through either d… ▽ More

    Submitted 25 July, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: CVPR 2021. Project page at https://fudan-zvg.github.io/SETR/

  29. arXiv:2012.13220  [pdf, other

    cs.LG stat.ML

    On Batch Normalisation for Approximate Bayesian Inference

    Authors: Jishnu Mukhoti, Puneet K. Dokania, Philip H. S. Torr, Yarin Gal

    Abstract: We study batch normalisation in the context of variational inference methods in Bayesian neural networks, such as mean-field or MC Dropout. We show that batch-normalisation does not affect the optimum of the evidence lower bound (ELBO). Furthermore, we study the Monte Carlo Batch Normalisation (MCBN) algorithm, proposed as an approximate inference technique parallel to MC Dropout, and show that fo… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

  30. arXiv:2012.09434  [pdf, other

    cs.CV

    Multi-shot Temporal Event Localization: a Benchmark

    Authors: Xiaolong Liu, Yao Hu, Song Bai, Fei Ding, Xiang Bai, Philip H. S. Torr

    Abstract: Current developments in temporal event or action localization usually target actions captured by a single camera. However, extensive events or actions in the wild may be captured as a sequence of shots by multiple cameras at different positions. In this paper, we propose a new and challenging task called multi-shot temporal event localization, and accordingly, collect a large scale dataset called… ▽ More

    Submitted 15 April, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: CVPR 2021. Project page at https://songbai.site/muses/

  31. arXiv:2012.09164  [pdf, other

    cs.CV

    Point Transformer

    Authors: Hengshuang Zhao, Li Jiang, Jiaya Jia, Philip Torr, Vladlen Koltun

    Abstract: Self-attention networks have revolutionized natural language processing and are making impressive strides in image analysis tasks such as image classification and object detection. Inspired by this success, we investigate the application of self-attention networks to 3D point cloud processing. We design self-attention layers for point clouds and use these to construct self-attention networks for t… ▽ More

    Submitted 26 September, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

  32. GeoNet++: Iterative Geometric Neural Network with Edge-Aware Refinement for Joint Depth and Surface Normal Estimation

    Authors: Xiaojuan Qi, Zhengzhe Liu, Renjie Liao, Philip H. S. Torr, Raquel Urtasun, Jiaya Jia

    Abstract: In this paper, we propose a geometric neural network with edge-aware refinement (GeoNet++) to jointly predict both depth and surface normal maps from a single image. Building on top of two-stream CNNs, GeoNet++ captures the geometric relationships between depth and surface normals with the proposed depth-to-normal and normal-to-depth modules. In particular, the "depth-to-normal" module exploits th… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

    Comments: TPAMI 2020. Code available: https://github.com/xjqi/GeoNet

  33. arXiv:2012.04351  [pdf, other

    cs.LG

    Data-Dependent Randomized Smoothing

    Authors: Motasem Alfarra, Adel Bibi, Philip H. S. Torr, Bernard Ghanem

    Abstract: Randomized smoothing is a recent technique that achieves state-of-art performance in training certifiably robust deep neural networks. While the smoothing family of distributions is often connected to the choice of the norm used for certification, the parameters of these distributions are always set as global hyper parameters independent from the input data on which a network is certified. In this… ▽ More

    Submitted 5 July, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted in Uncertainty in Artificial Intelligence Conference (UAI 2022). First two authors contributed equally to this work

  34. arXiv:2011.12097  [pdf, other

    cs.CV cs.MM

    Learning to Sample the Most Useful Training Patches from Images

    Authors: Shuyang Sun, Liang Chen, Gregory Slabaugh, Philip Torr

    Abstract: Some image restoration tasks like demosaicing require difficult training samples to learn effective models. Existing methods attempt to address this data training problem by manually collecting a new training dataset that contains adequate hard samples, however, there are still hard and simple areas even within one single image. In this paper, we present a data-driven approach called PatchNet that… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  35. arXiv:2011.09533  [pdf, other

    cs.AI

    Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?

    Authors: Christian Schroeder de Witt, Tarun Gupta, Denys Makoviichuk, Viktor Makoviychuk, Philip H. S. Torr, Mingfei Sun, Shimon Whiteson

    Abstract: Most recently developed approaches to cooperative multi-agent reinforcement learning in the \emph{centralized training with decentralized execution} setting involve estimating a centralized, joint value function. In this paper, we demonstrate that, despite its various theoretical shortcomings, Independent PPO (IPPO), a form of independent learning in which each agent simply estimates its local val… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

  36. arXiv:2010.14793  [pdf, other

    cs.CV cs.LG

    Class-Agnostic Segmentation Loss and Its Application to Salient Object Detection and Segmentation

    Authors: Angira Sharma, Naeemullah Khan, Ganesh Sundaramoorthi, Philip Torr

    Abstract: In this paper we present a novel loss function, called class-agnostic segmentation (CAS) loss. With CAS loss the class descriptors are learned during training of the network. We don't require to define the label of a class a-priori, rather the CAS loss clusters regions with similar appearance together in a weakly-supervised manner. Furthermore, we show that the CAS loss function is sparse, bounded… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  37. arXiv:2010.12136  [pdf, other

    cs.CV cs.CL cs.LG

    Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation

    Authors: Bowen Li, Xiaojuan Qi, Philip H. S. Torr, Thomas Lukasiewicz

    Abstract: We propose a novel lightweight generative adversarial network for efficient image manipulation using natural language descriptions. To achieve this, a new word-level discriminator is proposed, which provides the generator with fine-grained training feedback at word-level, to facilitate training a lightweight generator that has a small number of parameters, but can still correctly focus on specific… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020

  38. arXiv:2010.11635  [pdf, other

    cs.LG

    Continual Learning in Low-rank Orthogonal Subspaces

    Authors: Arslan Chaudhry, Naeemullah Khan, Puneet K. Dokania, Philip H. S. Torr

    Abstract: In continual learning (CL), a learner is faced with a sequence of tasks, arriving one after the other, and the goal is to remember all the tasks once the continual learning experience is finished. The prior art in CL uses episodic memory, parameter regularization or extensible network structures to reduce interference among tasks, but in the end, all the approaches learn different tasks in a joint… ▽ More

    Submitted 8 December, 2020; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: The paper is accepted at NeurIPS'20

    Journal ref: NeurIPS, 2020

  39. Diagnosing and Preventing Instabilities in Recurrent Video Processing

    Authors: Thomas Tanay, Aivar Sootla, Matteo Maggioni, Puneet K. Dokania, Philip Torr, Ales Leonardis, Gregory Slabaugh

    Abstract: Recurrent models are a popular choice for video enhancement tasks such as video denoising or super-resolution. In this work, we focus on their stability as dynamical systems and show that they tend to fail catastrophically at inference time on long video sequences. To address this issue, we (1) introduce a diagnostic tool which produces input sequences optimized to trigger instabilities and that c… ▽ More

    Submitted 11 March, 2023; v1 submitted 10 October, 2020; originally announced October 2020.

    Journal ref: in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 2, pp. 1594-1605, 1 Feb. 2023

  40. HOTA: A Higher Order Metric for Evaluating Multi-Object Tracking

    Authors: Jonathon Luiten, Aljosa Osep, Patrick Dendorfer, Philip Torr, Andreas Geiger, Laura Leal-Taixe, Bastian Leibe

    Abstract: Multi-Object Tracking (MOT) has been notoriously difficult to evaluate. Previous metrics overemphasize the importance of either detection or association. To address this, we present a novel MOT evaluation metric, HOTA (Higher Order Tracking Accuracy), which explicitly balances the effect of performing accurate detection, association and localization into a single unified metric for comparing track… ▽ More

    Submitted 29 September, 2020; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: Pre-print. Accepted for Publication in the International Journal of Computer Vision, 19 August 2020. Code is available at https://github.com/JonathonLuiten/HOTA-metrics

    Journal ref: International Journal of Computer Vision (2020)

  41. arXiv:2008.09694  [pdf, other

    cs.CV eess.IV

    Many-shot from Low-shot: Learning to Annotate using Mixed Supervision for Object Detection

    Authors: Carlo Biffi, Steven McDonagh, Philip Torr, Ales Leonardis, Sarah Parisot

    Abstract: Object detection has witnessed significant progress by relying on large, manually annotated datasets. Annotating such datasets is highly time consuming and expensive, which motivates the development of weakly supervised and few-shot object detection methods. However, these methods largely underperform with respect to their strongly supervised counterpart, as weak training signals \emph{often} resu… ▽ More

    Submitted 26 August, 2020; v1 submitted 21 August, 2020; originally announced August 2020.

    Comments: Accepted at ECCV 2020. Camera-ready version and Appendices

  42. arXiv:2008.08424  [pdf, other

    cs.CV cs.GR cs.LG stat.ML

    AutoSimulate: (Quickly) Learning Synthetic Data Generation

    Authors: Harkirat Singh Behl, Atılım Güneş Baydin, Ran Gal, Philip H. S. Torr, Vibhav Vineet

    Abstract: Simulation is increasingly being used for generating large labelled datasets in many machine learning problems. Recent methods have focused on adjusting simulator parameters with the goal of maximising accuracy on a validation task, usually relying on REINFORCE-like gradient estimators. However these approaches are very expensive as they treat the entire data generation, model training, and valida… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

    Journal ref: European Conference on Computer Vision (ECCV) 2020

  43. arXiv:2008.04381  [pdf, other

    cs.CV cs.LG eess.IV

    Bipartite Graph Reasoning GANs for Person Image Generation

    Authors: Hao Tang, Song Bai, Philip H. S. Torr, Nicu Sebe

    Abstract: We present a novel Bipartite Graph Reasoning GAN (BiGraphGAN) for the challenging person image generation task. The proposed graph generator mainly consists of two novel blocks that aim to model the pose-to-pose and pose-to-image relations, respectively. Specifically, the proposed Bipartite Graph Reasoning (BGR) block aims to reason the crossing long-range relations between the source pose and the… ▽ More

    Submitted 20 August, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

    Comments: 13 pages, 6 figures, accepted to BMVC 2020 as an oral paper, fix typos

  44. arXiv:2007.09278  [pdf, other

    cs.CV cs.LG eess.IV

    XingGAN for Person Image Generation

    Authors: Hao Tang, Song Bai, Li Zhang, Philip H. S. Torr, Nicu Sebe

    Abstract: We propose a novel Generative Adversarial Network (XingGAN or CrossingGAN) for person image generation tasks, i.e., translating the pose of a given person to a desired one. The proposed Xing generator consists of two generation branches that model the person's appearance and shape information, respectively. Moreover, we propose two novel blocks to effectively transfer and update the person's shape… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: Accepted to ECCV 2020, camera ready (16 pages) + supplementary (6 pages)

  45. arXiv:2007.09185  [pdf, other

    cs.AI cs.CL cs.LG

    WordCraft: An Environment for Benchmarking Commonsense Agents

    Authors: Minqi Jiang, Jelena Luketina, Nantas Nardelli, Pasquale Minervini, Philip H. S. Torr, Shimon Whiteson, Tim Rocktäschel

    Abstract: The ability to quickly solve a wide range of real-world tasks requires a commonsense understanding of the world. Yet, how to best extract such knowledge from natural language corpora and integrate it with reinforcement learning (RL) agents remains an open challenge. This is partly due to the lack of lightweight simulation environments that sufficiently reflect the semantics of the real world and p… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

  46. arXiv:2007.04028  [pdf, other

    cs.LG stat.ML

    How benign is benign overfitting?

    Authors: Amartya Sanyal, Puneet K Dokania, Varun Kanade, Philip H. S. Torr

    Abstract: We investigate two causes for adversarial vulnerability in deep neural networks: bad data and (poorly) trained models. When trained with SGD, deep neural networks essentially achieve zero training error, even in the presence of label noise, while also exhibiting good generalization on natural test data, something referred to as benign overfitting [2, 10]. However, these models are vulnerable to ad… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  47. arXiv:2007.01179  [pdf, other

    cs.LG stat.ML

    Relating by Contrasting: A Data-efficient Framework for Multimodal Generative Models

    Authors: Yuge Shi, Brooks Paige, Philip H. S. Torr, N. Siddharth

    Abstract: Multimodal learning for generative models often refers to the learning of abstract concepts from the commonality of information in multiple modalities, such as vision and language. While it has proven effective for learning generalisable representations, the training of such models often requires a large amount of "related" multimodal data that shares commonality, which can be expensive to come by… ▽ More

    Submitted 21 April, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

  48. arXiv:2006.10711  [pdf, other

    cs.LG stat.ML

    STEER: Simple Temporal Regularization For Neural ODEs

    Authors: Arnab Ghosh, Harkirat Singh Behl, Emilien Dupont, Philip H. S. Torr, Vinay Namboodiri

    Abstract: Training Neural Ordinary Differential Equations (ODEs) is often computationally expensive. Indeed, computing the forward pass of such models involves solving an ODE which can become arbitrarily complex during training. Recent works have shown that regularizing the dynamics of the ODE can partially alleviate this. In this paper we propose a new regularization technique: randomly sampling the end ti… ▽ More

    Submitted 2 November, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Neurips 2020

  49. arXiv:2006.10102  [pdf, other

    cs.LG stat.ML

    Capturing Label Characteristics in VAEs

    Authors: Tom Joy, Sebastian M. Schmon, Philip H. S. Torr, N. Siddharth, Tom Rainforth

    Abstract: We present a principled approach to incorporating labels in VAEs that captures the rich characteristic information associated with those labels. While prior work has typically conflated these by learning latent variables that directly correspond to label values, we argue this is contrary to the intended effect of supervision in VAEs-capturing rich label characteristics with the latents. For exampl… ▽ More

    Submitted 16 December, 2022; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Accepted to ICLR 2021

  50. arXiv:2006.09081  [pdf, other

    cs.CV cs.LG

    Progressive Skeletonization: Trimming more fat from a network at initialization

    Authors: Pau de Jorge, Amartya Sanyal, Harkirat S. Behl, Philip H. S. Torr, Gregory Rogez, Puneet K. Dokania

    Abstract: Recent studies have shown that skeletonization (pruning parameters) of networks \textit{at initialization} provides all the practical benefits of sparsity both at inference and training time, while only marginally degrading their performance. However, we observe that beyond a certain level of sparsity (approx $95\%$), these approaches fail to preserve the network performance, and to our surprise,… ▽ More

    Submitted 19 March, 2021; v1 submitted 16 June, 2020; originally announced June 2020.