Zum Hauptinhalt springen

Showing 151–200 of 227 results for author: Sebe, N

.
  1. TriGAN: Image-to-Image Translation for Multi-Source Domain Adaptation

    Authors: Subhankar Roy, Aliaksandr Siarohin, Enver Sangineto, Nicu Sebe, Elisa Ricci

    Abstract: Most domain adaptation methods consider the problem of transferring knowledge to the target domain from a single source dataset. However, in practical applications, we typically have access to multiple sources. In this paper we propose the first approach for Multi-Source Domain Adaptation (MSDA) based on Generative Adversarial Networks. Our method is inspired by the observation that the appearance… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

    Journal ref: Machine Vision and Applications 2021

  2. arXiv:2004.05973  [pdf, other

    cs.CV

    Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset

    Authors: Shreya Ghosh, Abhinav Dhall, Garima Sharma, Sarthak Gupta, Nicu Sebe

    Abstract: Labelling of human behavior analysis data is a complex and time consuming task. In this paper, a fully automatic technique for labelling an image based gaze behavior dataset for driver gaze zone estimation is proposed. Domain knowledge is added to the data recording paradigm and later labels are generated in an automatic manner using Speech To Text conversion (STT). In order to remove the noise in… ▽ More

    Submitted 18 October, 2021; v1 submitted 13 April, 2020; originally announced April 2020.

  3. arXiv:2004.05551  [pdf, other

    cs.CV

    OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in An Open World

    Authors: Zhun Zhong, Linchao Zhu, Zhiming Luo, Shaozi Li, Yi Yang, Nicu Sebe

    Abstract: In this paper, we tackle the problem of discovering new classes in unlabeled visual data given labeled data from disjoint classes. Existing methods typically first pre-train a model with labeled data, and then identify new classes in unlabeled data via unsupervised clustering. However, the labeled data that provide essential knowledge are often underexplored in the second step. The challenge is th… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

  4. Binary Neural Networks: A Survey

    Authors: Haotong Qin, Ruihao Gong, Xianglong Liu, Xiao Bai, Jingkuan Song, Nicu Sebe

    Abstract: The binary neural network, largely saving the storage and computation, serves as a promising technique for deploying deep models on resource-limited devices. However, the binarization inevitably causes severe information loss, and even worse, its discontinuity brings difficulty to the optimization of the deep network. To address these issues, a variety of algorithms have been proposed, and achieve… ▽ More

    Submitted 31 March, 2020; originally announced April 2020.

    Journal ref: Pattern Recognition (2020) 107281

  5. arXiv:2004.03234  [pdf, other

    cs.CV

    Motion-supervised Co-Part Segmentation

    Authors: Aliaksandr Siarohin, Subhankar Roy, Stéphane Lathuilière, Sergey Tulyakov, Elisa Ricci, Nicu Sebe

    Abstract: Recent co-part segmentation methods mostly operate in a supervised learning setting, which requires a large amount of annotated data for training. To overcome this limitation, we propose a self-supervised deep learning method for co-part segmentation. Differently from previous works, our approach develops the idea that motion information inferred from videos can be leveraged to discover meaningful… ▽ More

    Submitted 15 April, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Journal ref: ICPR 2021

  6. arXiv:2004.03064  [pdf, other

    cs.CV cs.LG eess.IV

    Coarse-to-Fine Gaze Redirection with Numerical and Pictorial Guidance

    Authors: Jingjing Chen, Jichao Zhang, Enver Sangineto, Jiayuan Fan, Tao Chen, Nicu Sebe

    Abstract: Gaze redirection aims at manipulating the gaze of a given face image with respect to a desired direction (i.e., a reference angle) and it can be applied to many real life scenarios, such as video-conferencing or taking group photos. However, previous work on this topic mainly suffers of two limitations: (1) Low-quality image generation and (2) Low redirection precision. In this paper, we propose t… ▽ More

    Submitted 26 November, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: 12 pages, accepted by WACV 2021

  7. arXiv:2003.13898  [pdf, other

    cs.CV cs.LG eess.IV

    Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis

    Authors: Hao Tang, Xiaojuan Qi, Guolei Sun, Dan Xu, Nicu Sebe, Radu Timofte, Luc Van Gool

    Abstract: We propose a novel ECGAN for the challenging semantic image synthesis task. Although considerable improvement has been achieved, the quality of synthesized images is far from satisfactory due to three largely unresolved challenges. 1) The semantic labels do not provide detailed structural information, making it difficult to synthesize local details and structures. 2) The widely adopted CNN operati… ▽ More

    Submitted 27 March, 2023; v1 submitted 30 March, 2020; originally announced March 2020.

  8. arXiv:2003.06788  [pdf, other

    cs.CV

    GMM-UNIT: Unsupervised Multi-Domain and Multi-Modal Image-to-Image Translation via Attribute Gaussian Mixture Modeling

    Authors: Yahui Liu, Marco De Nadai, Jian Yao, Nicu Sebe, Bruno Lepri, Xavier Alameda-Pineda

    Abstract: Unsupervised image-to-image translation (UNIT) aims at learning a mapping between several visual domains by using unpaired training images. Recent studies have shown remarkable success for multiple domains but they suffer from two main limitations: they are either built from several two-domain mappings that are required to be learned independently, or they generate low-diversity results, a problem… ▽ More

    Submitted 21 March, 2020; v1 submitted 15 March, 2020; originally announced March 2020.

    Comments: 27 pages, 17 figures

  9. arXiv:2003.03229  [pdf, other

    cs.NE cs.CV cs.LG stat.ML

    Non-linear Neurons with Human-like Apical Dendrite Activations

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Nicolae-Catalin Ristea, Nicu Sebe

    Abstract: In order to classify linearly non-separable data, neurons are typically organized into multi-layer neural networks that are equipped with at least one hidden layer. Inspired by some recent discoveries in neuroscience, we propose a new model of artificial neuron along with a novel activation function enabling the learning of nonlinear decision boundaries using a single neuron. We show that a standa… ▽ More

    Submitted 10 August, 2023; v1 submitted 2 February, 2020; originally announced March 2020.

    Comments: Accepted for publication in Applied Intelligence

  10. arXiv:2003.00196  [pdf, other

    cs.CV cs.AI

    First Order Motion Model for Image Animation

    Authors: Aliaksandr Siarohin, Stéphane Lathuilière, Sergey Tulyakov, Elisa Ricci, Nicu Sebe

    Abstract: Image animation consists of generating a video sequence so that an object in a source image is animated according to the motion of a driving video. Our framework addresses this problem without using any annotation or prior information about the specific object to animate. Once trained on a set of videos depicting objects of the same category (e.g. faces, human bodies), our method can be applied to… ▽ More

    Submitted 1 October, 2020; v1 submitted 29 February, 2020; originally announced March 2020.

    Comments: NeurIPS 2019

  11. arXiv:2002.01048  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation

    Authors: Hao Tang, Philip H. S. Torr, Nicu Sebe

    Abstract: We propose a novel model named Multi-Channel Attention Selection Generative Adversarial Network (SelectionGAN) for guided image-to-image translation, where we translate an input image into another while respecting an external semantic guidance. The proposed SelectionGAN explicitly utilizes the semantic guidance information and consists of two stages. In the first stage, the input image and the con… ▽ More

    Submitted 6 October, 2022; v1 submitted 3 February, 2020; originally announced February 2020.

    Comments: Accepted to TPAMI, an extended version of a paper published in CVPR2019. arXiv admin note: substantial text overlap with arXiv:1904.06807

  12. arXiv:2001.00238  [pdf, other

    cs.CV

    Low-Budget Label Query through Domain Alignment Enforcement

    Authors: Jurandy Almeida, Cristiano Saltori, Paolo Rota, Nicu Sebe

    Abstract: Deep learning revolution happened thanks to the availability of a massive amount of labelled data which have contributed to the development of models with extraordinary inference capabilities. Despite the public availability of a large quantity of datasets, to address specific requirements it is often necessary to generate a new set of labelled data. Quite often, the production of labels is costly… ▽ More

    Submitted 29 March, 2020; v1 submitted 1 January, 2020; originally announced January 2020.

  13. arXiv:1912.12215  [pdf, other

    cs.CV cs.LG eess.IV

    Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

    Authors: Hao Tang, Dan Xu, Yan Yan, Philip H. S. Torr, Nicu Sebe

    Abstract: In this paper, we address the task of semantic-guided scene generation. One open challenge in scene generation is the difficulty of the generation of small objects and detailed local texture, which has been widely observed in global image-level generation methods. To tackle this issue, in this work we consider learning the scene generation in a local context, and correspondingly design a local cla… ▽ More

    Submitted 30 March, 2020; v1 submitted 27 December, 2019; originally announced December 2019.

    Comments: Accepted to CVPR 2020, camera ready (10 pages) + supplementary (18 pages)

  14. arXiv:1912.06931  [pdf, other

    cs.CV cs.LG eess.IV

    Asymmetric GANs for Image-to-Image Translation

    Authors: Hao Tang, Nicu Sebe

    Abstract: Existing models for unsupervised image translation with Generative Adversarial Networks (GANs) can learn the mapping from the source domain to the target domain using a cycle-consistency loss. However, these methods always adopt a symmetric network architecture to learn both forward and backward cycles. Because of the task complexity and cycle input difference between the source and target domains… ▽ More

    Submitted 12 July, 2024; v1 submitted 14 December, 2019; originally announced December 2019.

    Comments: Added more information

  15. arXiv:1912.06112  [pdf, other

    cs.CV cs.LG eess.IV

    Unified Generative Adversarial Networks for Controllable Image-to-Image Translation

    Authors: Hao Tang, Hong Liu, Nicu Sebe

    Abstract: We propose a unified Generative Adversarial Network (GAN) for controllable image-to-image translation, i.e., transferring an image from a source to a target domain guided by controllable structures. In addition to conditioning on a reference image, we show how the model can generate images conditioned on controllable structures, e.g., class labels, object keypoints, human skeletons, and scene sema… ▽ More

    Submitted 2 September, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: Accepted to TIP, an extended version of a paper published in ACM MM 2018. arXiv admin note: substantial text overlap with arXiv:1808.04859

  16. arXiv:1911.11897  [pdf, other

    cs.CV cs.LG eess.IV

    AttentionGAN: Unpaired Image-to-Image Translation using Attention-Guided Generative Adversarial Networks

    Authors: Hao Tang, Hong Liu, Dan Xu, Philip H. S. Torr, Nicu Sebe

    Abstract: State-of-the-art methods in image-to-image translation are capable of learning a mapping from a source domain to a target domain with unpaired image data. Though the existing methods have achieved promising results, they still produce visual artifacts, being able to translate low-level information but not high-level semantics of input images. One possible reason is that generators do not have the… ▽ More

    Submitted 16 August, 2021; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: Accepted to TNNLS, an extended version of a paper published in IJCNN2019. arXiv admin note: substantial text overlap with arXiv:1903.12296

  17. Reduction of SISO H-infinity Output Feedback Control Problem

    Authors: Hayato Waki, Yoshio Ebihara, Noboru Sebe

    Abstract: We consider the linear matrix inequality (LMI) problem of $H_\infty$ output feedback control problem for a generalized plant whose control input, measured output, disturbance input, and controlled output are scalar. We provide an explicit form of the optimal value. This form is the unification of some results in the literature of $H_\infty$ performance limitation analysis. To obtain the form of th… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: Submitted to a journal

    MSC Class: 49K30; 90C22; 93C05; 34K35

  18. arXiv:1911.06849  [pdf, other

    cs.CV cs.LG

    Curriculum Self-Paced Learning for Cross-Domain Object Detection

    Authors: Petru Soviany, Radu Tudor Ionescu, Paolo Rota, Nicu Sebe

    Abstract: Training (source) domain bias affects state-of-the-art object detectors, such as Faster R-CNN, when applied to new (target) domains. To alleviate this problem, researchers proposed various domain adaptation methods to improve object detection results in the cross-domain setting, e.g. by translating images with ground-truth labels from the source domain to the target domain using Cycle-GAN. On top… ▽ More

    Submitted 20 January, 2021; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: Accepted for publication in Computer Vision and Image Understanding

  19. arXiv:1911.03884  [pdf, other

    eess.SY math.OC

    Learning Koopman Operator under Dissipativity Constraints

    Authors: Keita Hara, Masaki Inoue, Noboru Sebe

    Abstract: This paper addresses a learning problem for nonlinear dynamical systems with incorporating any specified dissipativity property. The nonlinear systems are described by the Koopman operator, which is a linear operator defined on the infinite-dimensional lifted state space. The problem of learning the Koopman operator under specified quadratic dissipativity constraints is formulated and addressed. T… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

  20. arXiv:1909.07667  [pdf, other

    cs.CV

    Progressive Fusion for Unsupervised Binocular Depth Estimation using Cycled Networks

    Authors: Andrea Pilzer, Stéphane Lathuilière, Dan Xu, Mihai Marian Puscas, Elisa Ricci, Nicu Sebe

    Abstract: Recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance. However, they require costly ground truth annotations during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps. We introduce a new network architecture, named Progressive Fusion Network (PFN), that is… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: Accepted to TPAMI (SI RGB-D Vision), code https://github.com/andrea-pilzer/PFN-depth

  21. arXiv:1908.05794  [pdf, other

    cs.CV eess.IV

    Structured Coupled Generative Adversarial Networks for Unsupervised Monocular Depth Estimation

    Authors: Mihai Marian Puscas, Dan Xu, Andrea Pilzer, Nicu Sebe

    Abstract: Inspired by the success of adversarial learning, we propose a new end-to-end unsupervised deep learning framework for monocular depth estimation consisting of two Generative Adversarial Networks (GAN), deeply coupled with a structured Conditional Random Field (CRF) model. The two GANs aim at generating distinct and complementary disparity maps and at improving the generation quality via exploiting… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

    Comments: Accepted at 3DV 2019 as ORAL

  22. arXiv:1908.00999  [pdf, other

    cs.CV cs.LG eess.IV

    Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation

    Authors: Hao Tang, Dan Xu, Gaowen Liu, Wei Wang, Nicu Sebe, Yan Yan

    Abstract: In this work, we propose a novel Cycle In Cycle Generative Adversarial Network (C$^2$GAN) for the task of keypoint-guided image generation. The proposed C$^2$GAN is a cross-modal framework exploring a joint exploitation of the keypoint and the image data in an interactive manner. C$^2$GAN contains two different types of generators, i.e., keypoint-oriented generator and image-oriented generator. Bo… ▽ More

    Submitted 15 April, 2020; v1 submitted 2 August, 2019; originally announced August 2019.

    Comments: 9 pages, 8 figures, accepted to ACM MM 2019

    Journal ref: ACM MM 2019

  23. Effortless Deep Training for Traffic Sign Detection Using Templates and Arbitrary Natural Images

    Authors: Lucas Tabelini Torres, Thiago M. Paixão, Rodrigo F. Berriel, Alberto F. De Souza, Claudine Badue, Nicu Sebe, Thiago Oliveira-Santos

    Abstract: Deep learning has been successfully applied to several problems related to autonomous driving. Often, these solutions rely on large networks that require databases of real image samples of the problem (i.e., real world) for proper training. The acquisition of such real-world data sets is not always possible in the autonomous driving context, and sometimes their annotation is not feasible (e.g., ta… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

  24. Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night

    Authors: Vinicius F. Arruda, Thiago M. Paixão, Rodrigo F. Berriel, Alberto F. De Souza, Claudine Badue, Nicu Sebe, Thiago Oliveira-Santos

    Abstract: Deep learning techniques have enabled the emergence of state-of-the-art models to address object detection tasks. However, these techniques are data-driven, delegating the accuracy to the training dataset which must resemble the images in the target task. The acquisition of a dataset involves annotating images, an arduous and expensive process, generally requiring time and manual effort. Thus, a c… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

    Comments: 8 pages, 8 figures, https://github.com/viniciusarruda/cross-domain-car-detection and accepted at IJCNN 2019

  25. Gesture-to-Gesture Translation in the Wild via Category-Independent Conditional Maps

    Authors: Yahui Liu, Marco De Nadai, Gloria Zen, Nicu Sebe, Bruno Lepri

    Abstract: Recent works have shown Generative Adversarial Networks (GANs) to be particularly effective in image-to-image translations. However, in tasks such as body pose and hand gesture translation, existing methods usually require precise annotations, e.g. key-points or skeletons, which are time-consuming to draw. In this work, we propose a novel GAN architecture that decouples the required annotations in… ▽ More

    Submitted 31 July, 2019; v1 submitted 12 July, 2019; originally announced July 2019.

    Comments: 15 pages, 12 figures

    Journal ref: 27th ACM International Conference on Multimedia, 2019

  26. arXiv:1906.03525  [pdf, other

    cs.CV

    Pattern-Affinitive Propagation across Depth, Surface Normal and Semantic Segmentation

    Authors: Zhenyu Zhang, Zhen Cui, Chunyan Xu, Yan Yan, Nicu Sebe, Jian Yang

    Abstract: In this paper, we propose a novel Pattern-Affinitive Propagation (PAP) framework to jointly predict depth, surface normal and semantic segmentation. The motivation behind it comes from the statistic observation that pattern-affinitive pairs recur much frequently across different tasks as well as within a task. Thus, we can conduct two types of propagations, cross-task propagation and task-specific… ▽ More

    Submitted 8 June, 2019; originally announced June 2019.

    Comments: 10 pages, 9 figures, CVPR 2019

  27. arXiv:1906.00805  [pdf, other

    cs.CV cs.AI

    GazeCorrection:Self-Guided Eye Manipulation in the wild using Self-Supervised Generative Adversarial Networks

    Authors: Jichao Zhang, Meng Sun, Jingjing Chen, Hao Tang, Yan Yan, Xueying Qin, Nicu Sebe

    Abstract: Gaze correction aims to redirect the person's gaze into the camera by manipulating the eye region, and it can be considered as a specific image resynthesis problem. Gaze correction has a wide range of applications in real life, such as taking a picture with staring at the camera. In this paper, we propose a novel method that is based on the inpainting model to learn from the face image to fill in… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

  28. Impact of facial landmark localization on facial expression recognition

    Authors: Romain Belmonte, Benjamin Allaert, Pierre Tirilly, Ioan Marius Bilasco, Chaabane Djeraba, Nicu Sebe

    Abstract: Although facial landmark localization (FLL) approaches are becoming increasingly accurate for characterizing facial regions, one question remains unanswered: what is the impact of these approaches on subsequent related tasks? In this paper, the focus is put on facial expression recognition (FER), where facial landmarks are used for face registration, which is a common usage. Since the most used da… ▽ More

    Submitted 19 July, 2021; v1 submitted 26 May, 2019; originally announced May 2019.

  29. Regularized Evolutionary Algorithm for Dynamic Neural Topology Search

    Authors: Cristiano Saltori, Subhankar Roy, Nicu Sebe, Giovanni Iacca

    Abstract: Designing neural networks for object recognition requires considerable architecture engineering. As a remedy, neuro-evolutionary network architecture search, which automatically searches for optimal network architectures using evolutionary algorithms, has recently become very popular. Although very effective, evolutionary algorithms rely heavily on having a large population of individuals (i.e., n… ▽ More

    Submitted 19 August, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

  30. Budget-Aware Adapters for Multi-Domain Learning

    Authors: Rodrigo Berriel, Stéphane Lathuilière, Moin Nabi, Tassilo Klein, Thiago Oliveira-Santos, Nicu Sebe, Elisa Ricci

    Abstract: Multi-Domain Learning (MDL) refers to the problem of learning a set of models derived from a common deep architecture, each one specialized to perform a task in a certain domain (e.g., photos, sketches, paintings). This paper tackles MDL with a particular interest in obtaining domain-specific models with an adjustable budget in terms of the number of network parameters and computational complexity… ▽ More

    Submitted 8 December, 2020; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: ICCV 2019

  31. arXiv:1905.05416  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Expression Conditional GAN for Facial Expression-to-Expression Translation

    Authors: Hao Tang, Wei Wang, Songsong Wu, Xinya Chen, Dan Xu, Nicu Sebe, Yan Yan

    Abstract: In this paper, we focus on the facial expression translation task and propose a novel Expression Conditional GAN (ECGAN) which can learn the mapping from one image domain to another one based on an additional expression attribute. The proposed ECGAN is a generic framework and is applicable to different expression generation tasks where specific facial expression can be easily controlled by the con… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: 5 pages, 5 figures, accepted to ICIP 2019

  32. arXiv:1905.02655  [pdf, other

    cs.CV

    Attention-based Fusion for Multi-source Human Image Generation

    Authors: Stéphane Lathuilière, Enver Sangineto, Aliaksandr Siarohin, Nicu Sebe

    Abstract: We present a generalization of the person-image generation task, in which a human image is generated conditioned on a target pose and a set X of source appearance images. In this way, we can exploit multiple, possibly complementary images of the same person which are usually available at training and at testing time. The solution we propose is mainly based on a local attention mechanism which sele… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: 10 pages

  33. arXiv:1905.00007  [pdf, other

    cs.CV

    Appearance and Pose-Conditioned Human Image Generation using Deformable GANs

    Authors: Aliaksandr Siarohin, Stéphane Lathuilière, Enver Sangineto, Nicu Sebe

    Abstract: In this paper, we address the problem of generating person images conditioned on both pose and appearance information. Specifically, given an image xa of a person and a target pose P(xb), extracted from a different image xb, we synthesize a new image of that person in pose P(xb), while preserving the visual details in xa. In order to deal with pixel-to-pixel misalignments caused by the pose differ… ▽ More

    Submitted 14 October, 2019; v1 submitted 30 April, 2019; originally announced May 2019.

    Comments: To appear on IEEE TPAMI. arXiv admin note: substantial text overlap with arXiv:1801.00055

  34. arXiv:1904.08462  [pdf, other

    cs.CV

    Online Adaptation through Meta-Learning for Stereo Depth Estimation

    Authors: Zhenyu Zhang, Stéphane Lathuilière, Andrea Pilzer, Nicu Sebe, Elisa Ricci, Jian Yang

    Abstract: In this work, we tackle the problem of online adaptation for stereo depth estimation, that consists in continuously adapting a deep network to a target video recordedin an environment different from that of the source training set. To address this problem, we propose a novel Online Meta-Learning model with Adaption (OMLA). Our proposal is based on two main contributions. First, to reducethe domain… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Comments: 12 pages

  35. arXiv:1904.06807  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation

    Authors: Hao Tang, Dan Xu, Nicu Sebe, Yanzhi Wang, Jason J. Corso, Yan Yan

    Abstract: Cross-view image translation is challenging because it involves images with drastically different views and severe deformation. In this paper, we propose a novel approach named Multi-Channel Attention SelectionGAN (SelectionGAN) that makes it possible to generate images of natural scenes in arbitrary viewpoints, based on an image of the scene and a novel semantic map. The proposed SelectionGAN exp… ▽ More

    Submitted 16 April, 2019; v1 submitted 14 April, 2019; originally announced April 2019.

    Comments: 20 pages, 16 figures, accepted to CVPR 2019 as an oral paper

    Journal ref: CVPR 2019

  36. Metric-Learning based Deep Hashing Network for Content Based Retrieval of Remote Sensing Images

    Authors: Subhankar Roy, Enver Sangineto, Begüm Demir, Nicu Sebe

    Abstract: Hashing methods have been recently found very effective in retrieval of remote sensing (RS) images due to their computational efficiency and fast search speed. The traditional hashing methods in RS usually exploit hand-crafted features to learn hash functions to obtain binary codes, which can be insufficient to optimally represent the information content of RS images. To overcome this problem, in… ▽ More

    Submitted 6 January, 2021; v1 submitted 2 April, 2019; originally announced April 2019.

    Comments: Accepted to IEEE Geoscience and Remote Sensing Letters. For code visit: https://github.com/MLEnthusiast/MHCLN

  37. arXiv:1903.12296  [pdf, other

    cs.CV

    Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation

    Authors: Hao Tang, Dan Xu, Nicu Sebe, Yan Yan

    Abstract: The state-of-the-art approaches in Generative Adversarial Networks (GANs) are able to learn a mapping function from one image domain to another with unpaired image data. However, these methods often produce artifacts and can only be able to convert low-level information, but fail to transfer high-level semantic part of images. The reason is mainly that generators do not have the ability to detect… ▽ More

    Submitted 27 August, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: 8 pages, 7 figures, Accepted to IJCNN 2019

    Journal ref: IJCNN 2019

  38. arXiv:1903.04202  [pdf, other

    cs.CV

    Refine and Distill: Exploiting Cycle-Inconsistency and Knowledge Distillation for Unsupervised Monocular Depth Estimation

    Authors: Andrea Pilzer, Stéphane Lathuilière, Nicu Sebe, Elisa Ricci

    Abstract: Nowadays, the majority of state of the art monocular depth estimation techniques are based on supervised deep learning models. However, collecting RGB images with associated depth maps is a very time consuming procedure. Therefore, recent works have proposed deep architectures for addressing the monocular depth prediction task as a reconstruction problem, thus avoiding the need of collecting groun… ▽ More

    Submitted 20 April, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

    Comments: Accepted at CVPR2019

  39. arXiv:1903.03215  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation using Feature-Whitening and Consensus Loss

    Authors: Subhankar Roy, Aliaksandr Siarohin, Enver Sangineto, Samuel Rota Bulo, Nicu Sebe, Elisa Ricci

    Abstract: A classifier trained on a dataset seldom works on other datasets obtained under different conditions due to domain shift. This problem is commonly addressed by domain adaptation methods. In this work we introduce a novel deep learning framework which unifies different paradigms in unsupervised domain adaptation. Specifically, we propose domain alignment layers which implement feature whitening for… ▽ More

    Submitted 16 February, 2020; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: CVPR 2019

  40. arXiv:1901.09774  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Attribute-Guided Sketch Generation

    Authors: Hao Tang, Xinya Chen, Wei Wang, Dan Xu, Jason J. Corso, Nicu Sebe, Yan Yan

    Abstract: Facial attributes are important since they provide a detailed description and determine the visual appearance of human faces. In this paper, we aim at converting a face image to a sketch while simultaneously generating facial attributes. To this end, we propose a novel Attribute-Guided Sketch Generative Adversarial Network (ASGAN) which is an end-to-end framework and contains two pairs of generato… ▽ More

    Submitted 14 April, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

    Comments: 7 pages, 6 figures, accepted to FG 2019

  41. arXiv:1901.04622  [pdf, other

    cs.CV

    Fast and Robust Dynamic Hand Gesture Recognition via Key Frames Extraction and Feature Fusion

    Authors: Hao Tang, Hong Liu, Wei Xiao, Nicu Sebe

    Abstract: Gesture recognition is a hot topic in computer vision and pattern recognition, which plays a vitally important role in natural human-computer interface. Although great progress has been made recently, fast and robust hand gesture recognition remains an open problem, since the existing methods have not well balanced the performance and the efficiency simultaneously. To bridge it, this work combines… ▽ More

    Submitted 14 January, 2019; originally announced January 2019.

    Comments: 11 pages, 3 figures, accepted to NeuroComputing

  42. arXiv:1901.04604  [pdf, other

    cs.CV

    Dual Generator Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

    Authors: Hao Tang, Dan Xu, Wei Wang, Yan Yan, Nicu Sebe

    Abstract: State-of-the-art methods for image-to-image translation with Generative Adversarial Networks (GANs) can learn a mapping from one domain to another domain using unpaired image data. However, these methods require the training of one specific model for every pair of image domains, which limits the scalability in dealing with more than two image domains. In addition, the training stage of these metho… ▽ More

    Submitted 14 January, 2019; originally announced January 2019.

    Comments: 16 pages, 7 figures, accepted to ACCV 2018

  43. arXiv:1901.01868  [pdf, other

    cs.CV cs.LG

    Low-Shot Learning from Imaginary 3D Model

    Authors: Frederik Pahde, Mihai Puscas, Jannik Wolff, Tassilo Klein, Nicu Sebe, Moin Nabi

    Abstract: Since the advent of deep learning, neural networks have demonstrated remarkable results in many visual recognition tasks, constantly pushing the limits. However, the state-of-the-art approaches are largely unsuitable in scarce data regimes. To address this shortcoming, this paper proposes employing a 3D model, which is derived from training images. Such a model can then be used to hallucinate nove… ▽ More

    Submitted 4 January, 2019; originally announced January 2019.

    Comments: To appear at WACV 2019. arXiv admin note: text overlap with arXiv:1811.09192

  44. arXiv:1812.11771  [pdf, other

    cs.CV

    Predicting Group Cohesiveness in Images

    Authors: Shreya Ghosh, Abhinav Dhall, Nicu Sebe, Tom Gedeon

    Abstract: The cohesiveness of a group is an essential indicator of the emotional state, structure and success of a group of people. We study the factors that influence the perception of group-level cohesion and propose methods for estimating the human-perceived cohesion on the group cohesiveness scale. In order to identify the visual cues (attributes) for cohesion, we conducted a user survey. Image analysis… ▽ More

    Submitted 7 April, 2019; v1 submitted 31 December, 2018; originally announced December 2018.

  45. arXiv:1812.08861  [pdf, other

    cs.GR cs.CV cs.LG stat.ML

    Animating Arbitrary Objects via Deep Motion Transfer

    Authors: Aliaksandr Siarohin, Stéphane Lathuilière, Sergey Tulyakov, Elisa Ricci, Nicu Sebe

    Abstract: This paper introduces a novel deep learning framework for image animation. Given an input image with a target object and a driving video sequence depicting a moving object, our framework generates a video in which the target object is animated according to the driving sequence. This is achieved through a deep architecture that decouples appearance and motion information. Our framework consists of… ▽ More

    Submitted 30 August, 2019; v1 submitted 20 December, 2018; originally announced December 2018.

    Comments: CVPR-2019 (oral)

  46. arXiv:1812.00717  [pdf, other

    stat.ML cs.LG

    Enhancing Perceptual Attributes with Bayesian Style Generation

    Authors: Aliaksandr Siarohin, Gloria Zen, Nicu Sebe, Elisa Ricci

    Abstract: Deep learning has brought an unprecedented progress in computer vision and significant advances have been made in predicting subjective properties inherent to visual data (e.g., memorability, aesthetic quality, evoked emotions, etc.). Recently, some research works have even proposed deep learning approaches to modify images such as to appropriately alter these properties. Following this research l… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

    Comments: ACCV-2018

  47. arXiv:1809.04185  [pdf, other

    cs.CV

    Deep Micro-Dictionary Learning and Coding Network

    Authors: Hao Tang, Heng Wei, Wei Xiao, Wei Wang, Dan Xu, Yan Yan, Nicu Sebe

    Abstract: In this paper, we propose a novel Deep Micro-Dictionary Learning and Coding Network (DDLCN). DDLCN has most of the standard deep learning layers (pooling, fully, connected, input/output, etc.) but the main difference is that the fundamental convolutional layers are replaced by novel compound dictionary learning and coding layers. The dictionary learning layer learns an over-complete dictionary for… ▽ More

    Submitted 25 December, 2018; v1 submitted 11 September, 2018; originally announced September 2018.

    Comments: 10 page, 8 figures, accepted to WACV 2019

  48. arXiv:1808.04859  [pdf, other

    cs.CV

    GestureGAN for Hand Gesture-to-Gesture Translation in the Wild

    Authors: Hao Tang, Wei Wang, Dan Xu, Yan Yan, Nicu Sebe

    Abstract: Hand gesture-to-gesture translation in the wild is a challenging task since hand gestures can have arbitrary poses, sizes, locations and self-occlusions. Therefore, this task requires a high-level understanding of the mapping between the input source gesture and the output target gesture. To tackle this problem, we propose a novel hand Gesture Generative Adversarial Network (GestureGAN). GestureGA… ▽ More

    Submitted 19 July, 2019; v1 submitted 14 August, 2018; originally announced August 2018.

    Comments: 9 pages, 7 figures, accepted to ACM MM 2018 as an oral paper, fix typos

  49. arXiv:1807.10915  [pdf, other

    cs.CV

    Unsupervised Adversarial Depth Estimation using Cycled Generative Networks

    Authors: Andrea Pilzer, Dan Xu, Mihai Marian Puscas, Elisa Ricci, Nicu Sebe

    Abstract: While recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance, costly ground truth annotations are required during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps and show that the depth estimation task can be effectively tackled within an adversarial lear… ▽ More

    Submitted 28 July, 2018; originally announced July 2018.

    Comments: To appear in 3DV 2018. Code is available on GitHub

  50. arXiv:1806.00420  [pdf, other

    stat.ML cs.LG

    Whitening and Coloring batch transform for GANs

    Authors: Aliaksandr Siarohin, Enver Sangineto, Nicu Sebe

    Abstract: Batch Normalization (BN) is a common technique used to speed-up and stabilize training. On the other hand, the learnable parameters of BN are commonly used in conditional Generative Adversarial Networks (cGANs) for representing class-specific information using conditional Batch Normalization (cBN). In this paper we propose to generalize both BN and cBN using a Whitening and Coloring based batch no… ▽ More

    Submitted 25 February, 2019; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: ICLR 2019