Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Tzelepis, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09153  [pdf, other

    cs.CV

    Are CLIP features all you need for Universal Synthetic Image Origin Attribution?

    Authors: Dario Cioni, Christos Tzelepis, Lorenzo Seidenari, Ioannis Patras

    Abstract: The steady improvement of Diffusion Models for visual synthesis has given rise to many new and interesting use cases of synthetic images but also has raised concerns about their potential abuse, which poses significant societal threats. To address this, fake images need to be detected and attributed to their source model, and given the frequent release of new generators, realistic applications nee… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: Accepted at ECCV 2024 TWYN workshop

  2. arXiv:2407.06635  [pdf, other

    cs.CV stat.ML

    Ensembled Cold-Diffusion Restorations for Unsupervised Anomaly Detection

    Authors: Sergio Naval Marimont, Vasilis Siomos, Matthew Baugh, Christos Tzelepis, Bernhard Kainz, Giacomo Tarroni

    Abstract: Unsupervised Anomaly Detection (UAD) methods aim to identify anomalies in test samples comparing them with a normative distribution learned from a dataset known to be anomaly-free. Approaches based on generative models offer interpretability by generating anomaly-free versions of test images, but are typically unable to identify subtle anomalies. Alternatively, approaches using feature modelling o… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 8 pages, 3 figures. MICCAI 2024

  3. arXiv:2403.17217  [pdf, other

    cs.CV cs.AI

    DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment

    Authors: Stella Bounareli, Christos Tzelepis, Vasileios Argyriou, Ioannis Patras, Georgios Tzimiropoulos

    Abstract: Video-driven neural face reenactment aims to synthesize realistic facial images that successfully preserve the identity and appearance of a source face, while transferring the target head pose and facial expressions. Existing GAN-based methods suffer from either distortions and visual artifacts or poor reconstruction quality, i.e., the background and several important appearance details, such as h… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project page: https://stelabou.github.io/diffusionact/

  4. arXiv:2402.12550  [pdf, other

    cs.CV cs.LG

    Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization

    Authors: James Oldfield, Markos Georgopoulos, Grigorios G. Chrysos, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Jiankang Deng, Ioannis Patras

    Abstract: The Mixture of Experts (MoE) paradigm provides a powerful way to decompose dense layers into smaller, modular computations often more amenable to human interpretation, debugging, and editability. However, a major challenge lies in the computational cost of scaling the number of experts high enough to achieve fine-grained specialization. In this paper, we propose the Multilinear Mixture of Experts… ▽ More

    Submitted 31 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Github: https://github.com/james-oldfield/muMoE. Project page: https://james-oldfield.github.io/muMoE/

  5. arXiv:2402.03553  [pdf, other

    cs.CV

    One-shot Neural Face Reenactment via Finding Directions in GAN's Latent Space

    Authors: Stella Bounareli, Christos Tzelepis, Vasileios Argyriou, Ioannis Patras, Georgios Tzimiropoulos

    Abstract: In this paper, we present our framework for neural face/head reenactment whose goal is to transfer the 3D head orientation and expression of a target face to a source face. Previous methods focus on learning embedding networks for identity and head pose/expression disentanglement which proves to be a rather hard task, degrading the quality of the generated images. We take a different approach, byp… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Preprint version, accepted for publication in International Journal of Computer Vision (IJCV)

  6. arXiv:2311.15453  [pdf, other

    cs.CV eess.IV

    DISYRE: Diffusion-Inspired SYnthetic REstoration for Unsupervised Anomaly Detection

    Authors: Sergio Naval Marimont, Matthew Baugh, Vasilis Siomos, Christos Tzelepis, Bernhard Kainz, Giacomo Tarroni

    Abstract: Unsupervised Anomaly Detection (UAD) techniques aim to identify and localize anomalies without relying on annotations, only leveraging a model trained on a dataset known to be free of anomalies. Diffusion models learn to modify inputs $x$ to increase the probability of it belonging to a desired distribution, i.e., they model the score function $\nabla_x \log p(x)$. Such a score function is potenti… ▽ More

    Submitted 5 March, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: 5 pages, 3 figures. Accepted for publication in ISBI 2024

  7. arXiv:2311.01573  [pdf, other

    cs.CV cs.AI cs.CY cs.LG

    Improving Fairness using Vision-Language Driven Image Augmentation

    Authors: Moreno D'Incà, Christos Tzelepis, Ioannis Patras, Nicu Sebe

    Abstract: Fairness is crucial when training a deep-learning discriminative model, especially in the facial domain. Models tend to correlate specific characteristics (such as age and skin color) with unrelated attributes (downstream tasks), resulting in biases which do not correspond to reality. It is common knowledge that these correlations are present in the data and are then transferred to the models duri… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted for publication in WACV 2024

  8. arXiv:2307.10797  [pdf, other

    cs.CV

    HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces

    Authors: Stella Bounareli, Christos Tzelepis, Vasileios Argyriou, Ioannis Patras, Georgios Tzimiropoulos

    Abstract: In this paper, we present our method for neural face reenactment, called HyperReenact, that aims to generate realistic talking head images of a source identity, driven by a target facial pose. Existing state-of-the-art face reenactment methods train controllable generative models that learn to synthesize realistic facial images, yet producing reenacted faces that are prone to significant visual ar… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Accepted for publication in ICCV 2023. Project page: https://stelabou.github.io/hyperreenact.github.io/ Code: https://github.com/StelaBou/HyperReenact

  9. arXiv:2305.14053  [pdf, other

    cs.CV cs.LG

    Parts of Speech-Grounded Subspaces in Vision-Language Models

    Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: Latent image representations arising from vision-language models have proved immensely useful for a variety of downstream tasks. However, their utility is limited by their entanglement with respect to different visual attributes. For instance, recent work has shown that CLIP image representations are often biased toward specific visual properties (such as objects or actions) in an unpredictable ma… ▽ More

    Submitted 12 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023

  10. arXiv:2304.03378  [pdf, other

    cs.CV cs.LG

    Self-Supervised Video Similarity Learning

    Authors: Giorgos Kordopatis-Zilos, Giorgos Tolias, Christos Tzelepis, Ioannis Kompatsiaris, Ioannis Patras, Symeon Papadopoulos

    Abstract: We introduce S$^2$VS, a video similarity learning approach with self-supervision. Self-Supervised Learning (SSL) is typically used to train deep models on a proxy task so as to have strong transferability on target tasks after fine-tuning. Here, in contrast to prior work, SSL is used to perform video similarity learning and address multiple retrieval and detection tasks at once with no use of labe… ▽ More

    Submitted 16 June, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  11. arXiv:2303.11296  [pdf, other

    cs.CV

    Attribute-preserving Face Dataset Anonymization via Latent Code Optimization

    Authors: Simone Barattin, Christos Tzelepis, Ioannis Patras, Nicu Sebe

    Abstract: This work addresses the problem of anonymizing the identity of faces in a dataset of images, such that the privacy of those depicted is not violated, while at the same time the dataset is useful for downstream task such as for training machine learning models. To the best of our knowledge, we are the first to explicitly address this issue and deal with two major drawbacks of the existing state-of-… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted for publication in CVPR 2023

  12. arXiv:2209.13375  [pdf, other

    cs.CV

    StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment

    Authors: Stella Bounareli, Christos Tzelepis, Vasileios Argyriou, Ioannis Patras, Georgios Tzimiropoulos

    Abstract: In this paper we address the problem of neural face reenactment, where, given a pair of a source and a target facial image, we need to transfer the target's pose (defined as the head pose and its facial expressions) to the source image, by preserving at the same time the source's identity characteristics (e.g., facial shape, hair style, etc), even in the challenging case where the source and the t… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Accepted for publication in IEEE FG 2023. Code: https://github.com/StelaBou/StyleMask

  13. arXiv:2206.02104  [pdf, other

    cs.CV

    ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences

    Authors: Christos Tzelepis, James Oldfield, Georgios Tzimiropoulos, Ioannis Patras

    Abstract: This work addresses the problem of discovering non-linear interpretable paths in the latent space of pre-trained GANs in a model-agnostic manner. In the proposed method, the discovery is driven by a set of pairs of natural language sentences with contrasting semantics, named semantic dipoles, that serve as the limits of the interpretation that we require by the trainable latent paths to encode. By… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  14. arXiv:2206.00048  [pdf, other

    cs.CV cs.LG

    PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs

    Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: Recent advances in the understanding of Generative Adversarial Networks (GANs) have led to remarkable progress in visual editing and synthesis tasks, capitalizing on the rich semantics that are embedded in the latent spaces of pre-trained GANs. However, existing methods are often tailored to specific GAN architectures and are limited to either discovering global semantic directions that do not fac… ▽ More

    Submitted 6 February, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: Accepted at ICLR 2023. Code available at: https://github.com/james-oldfield/PandA

  15. arXiv:2109.13357  [pdf, other

    cs.CV

    WarpedGANSpace: Finding non-linear RBF paths in GAN latent space

    Authors: Christos Tzelepis, Georgios Tzimiropoulos, Ioannis Patras

    Abstract: This work addresses the problem of discovering, in an unsupervised manner, interpretable paths in the latent space of pretrained GANs, so as to provide an intuitive and easy way of controlling the underlying generative factors. In doing so, it addresses some of the limitations of the state-of-the-art works, namely, a) that they discover directions that are independent of the latent code, i.e., pat… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: Accepted for publication in ICCV 2021

  16. DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval

    Authors: Giorgos Kordopatis-Zilos, Christos Tzelepis, Symeon Papadopoulos, Ioannis Kompatsiaris, Ioannis Patras

    Abstract: In this paper, we address the problem of high performance and computationally efficient content-based video retrieval in large-scale datasets. Current methods typically propose either: (i) fine-grained approaches employing spatio-temporal representations and similarity calculations, achieving high performance at a high computational cost or (ii) coarse-grained approaches representing/indexing vide… ▽ More

    Submitted 5 August, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

    Journal ref: International Journal of Computer Vision (2022)

  17. Few-Shot Action Localization without Knowing Boundaries

    Authors: Ting-Ting Xie, Christos Tzelepis, Fan Fu, Ioannis Patras

    Abstract: Learning to localize actions in long, cluttered, and untrimmed videos is a hard task, that in the literature has typically been addressed assuming the availability of large amounts of annotated training samples for each class -- either in a fully-supervised setting, where action boundaries are known, or in a weakly-supervised setting, where only class labels are known for each video. In this paper… ▽ More

    Submitted 23 September, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: ICMR21 Camera ready; link to code: https://github.com/June01/WFSAL-icmr21

  18. arXiv:2102.06064  [pdf, other

    cs.LG

    Uncertainty Propagation in Convolutional Neural Networks: Technical Report

    Authors: Christos Tzelepis, Ioannis Patras

    Abstract: In this technical report we study the problem of propagation of uncertainty (in terms of variances of given uni-variate normal random variables) through typical building blocks of a Convolutional Neural Network (CNN). These include layers that perform linear operations, such as 2D convolutions, fully-connected, and average pooling layers, as well as layers that act non-linearly on their input, suc… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: A PyTorch implementation is available under the MIT license here: https://github.com/chi0tzp/uacnn

  19. arXiv:2008.11254  [pdf, ps, other

    cs.CV

    Temporal Action Localization with Variance-Aware Networks

    Authors: Ting-Ting Xie, Christos Tzelepis, Ioannis Patras

    Abstract: This work addresses the problem of temporal action localization with Variance-Aware Networks (VAN), i.e., DNNs that use second-order statistics in the input and/or the output of regression tasks. We first propose a network (VANp) that when presented with the second-order statistics of the input, i.e., each sample has a mean and a variance, it propagates the mean and the variance throughout the net… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: Journal paper; Under review

  20. arXiv:2008.11170  [pdf, other

    cs.CV

    Boundary Uncertainty in a Single-Stage Temporal Action Localization Network

    Authors: Ting-Ting Xie, Christos Tzelepis, Ioannis Patras

    Abstract: In this paper, we address the problem of temporal action localization with a single stage neural network. In the proposed architecture we model the boundary predictions as uni-variate Gaussian distributions in order to model their uncertainties, which is the first in this area to the best of our knowledge. We use two uncertainty-aware boundary regression losses: first, the Kullback-Leibler diverge… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: Tech report

  21. Learning to detect video events from zero or very few video examples

    Authors: Christos Tzelepis, Damianos Galanopoulos, Vasileios Mezaris, Ioannis Patras

    Abstract: In this work we deal with the problem of high-level event detection in video. Specifically, we study the challenging problems of i) learning to detect video events from solely a textual description of the event, without using any positive video examples, and ii) additionally exploiting very few positive training samples together with a small number of ``related'' videos. For learning only from an… ▽ More

    Submitted 25 November, 2015; originally announced November 2015.

    Comments: Image and Vision Computing Journal, Elsevier, 2015, accepted for publication

    Journal ref: Image and Vision Computing Journal, Elsevier, 2015

  22. Linear Maximum Margin Classifier for Learning from Uncertain Data

    Authors: Christos Tzelepis, Vasileios Mezaris, Ioannis Patras

    Abstract: In this paper, we propose a maximum margin classifier that deals with uncertainty in data input. More specifically, we reformulate the SVM framework such that each training example can be modeled by a multi-dimensional Gaussian distribution described by its mean vector and its covariance matrix -- the latter modeling the uncertainty. We address the classification problem and define a cost function… ▽ More

    Submitted 19 November, 2017; v1 submitted 15 April, 2015; originally announced April 2015.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence. (c) 2017 IEEE. DOI: 10.1109/TPAMI.2017.2772235 Author's accepted version. The final publication is available at http://ieeexplore.ieee.org/document/8103808/