Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Aarabi, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18322  [pdf, other

    cs.CV cs.AI

    SCE-MAE: Selective Correspondence Enhancement with Masked Autoencoder for Self-Supervised Landmark Estimation

    Authors: Kejia Yin, Varshanth R. Rao, Ruowei Jiang, Xudong Liu, Parham Aarabi, David B. Lindell

    Abstract: Self-supervised landmark estimation is a challenging task that demands the formation of locally distinct feature representations to identify sparse facial landmarks in the absence of annotated data. To tackle this task, existing state-of-the-art (SOTA) methods (1) extract coarse features from backbones that are trained with instance-level self-supervised learning (SSL) paradigms, which neglect the… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPR 2024

  2. arXiv:2310.06667  [pdf, other

    cs.CV cs.LG

    SC2GAN: Rethinking Entanglement by Self-correcting Correlated GAN Space

    Authors: Zikun Chen, Han Zhao, Parham Aarabi, Ruowei Jiang

    Abstract: Generative Adversarial Networks (GANs) can synthesize realistic images, with the learned latent space shown to encode rich semantic information with various interpretable directions. However, due to the unstructured nature of the learned latent space, it inherits the bias from the training data where specific groups of visual attributes that are not causally related tend to appear together, a phen… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to the Out Of Distribution Generalization in Computer Vision workshop at ICCV2023

  3. arXiv:2309.03974  [pdf, other

    cs.LG

    DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation

    Authors: Pau Mulet Arabi, Alec Flowers, Lukas Mauch, Fabien Cardinaux

    Abstract: Computing gradients of an expectation with respect to the distributional parameters of a discrete distribution is a problem arising in many fields of science and engineering. Typically, this problem is tackled using Reinforce, which frames the problem of gradient estimation as a Monte Carlo simulation. Unfortunately, the Reinforce estimator is especially sensitive to discrepancies between the true… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 22 pages, 7 figures

    ACM Class: I.2.0

  4. arXiv:2303.13755  [pdf, other

    cs.CV cs.AI cs.LG

    Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers

    Authors: Cong Wei, Brendan Duke, Ruowei Jiang, Parham Aarabi, Graham W. Taylor, Florian Shkurti

    Abstract: Vision Transformers (ViT) have shown their competitive advantages performance-wise compared to convolutional neural networks (CNNs) though they often come with high computational costs. To this end, previous methods explore different attention patterns by limiting a fixed number of spatially nearby tokens to accelerate the ViT's multi-head self-attention (MHSA) operations. However, such structured… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  5. arXiv:2209.00698  [pdf, other

    cs.CV

    Exploring Gradient-based Multi-directional Controls in GANs

    Authors: Zikun Chen, Ruowei Jiang, Brendan Duke, Han Zhao, Parham Aarabi

    Abstract: Generative Adversarial Networks (GANs) have been widely applied in modeling diverse image distributions. However, despite its impressive applications, the structure of the latent space in GANs largely remains as a black-box, leaving its controllable generation an open problem, especially when spurious correlations between different semantic attributes exist in the image distributions. To address t… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: Accepted to ECCV 2022 (oral)

  6. arXiv:2105.06407  [pdf, other

    cs.CV cs.AI cs.GR

    Deep Graphics Encoder for Real-Time Video Makeup Synthesis from Example

    Authors: Robin Kips, Ruowei Jiang, Sileye Ba, Edmund Phung, Parham Aarabi, Pietro Gori, Matthieu Perrot, Isabelle Bloch

    Abstract: While makeup virtual-try-on is now widespread, parametrizing a computer graphics rendering engine for synthesizing images of a given cosmetics product remains a challenging task. In this paper, we introduce an inverse computer graphics method for automatic makeup synthesis from a reference image, by learning a model that maps an example portrait image with makeup to the space of rendering paramete… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: CVPR 2021 Workshop AI for Content Creation

  7. arXiv:2105.00020  [pdf, other

    cs.CV

    Continuous Face Aging via Self-estimated Residual Age Embedding

    Authors: Zeqi Li, Ruowei Jiang, Parham Aarabi

    Abstract: Face synthesis, including face aging, in particular, has been one of the major topics that witnessed a substantial improvement in image fidelity by using generative adversarial networks (GANs). Most existing face aging approaches divide the dataset into several age groups and leverage group-based training strategies, which lacks the ability to provide fine-controlled continuous aging synthesis in… ▽ More

    Submitted 30 April, 2021; originally announced May 2021.

    Comments: Accepted to CVPR 2021

  8. arXiv:2104.15082  [pdf, other

    cs.CV

    Semantic Relation Preserving Knowledge Distillation for Image-to-Image Translation

    Authors: Zeqi Li, Ruowei Jiang, Parham Aarabi

    Abstract: Generative adversarial networks (GANs) have shown significant potential in modeling high dimensional distributions of image data, especially on image-to-image translation tasks. However, due to the complexity of these tasks, state-of-the-art models often contain a tremendous amount of parameters, which results in large model size and long inference time. In this work, we propose a novel method to… ▽ More

    Submitted 18 May, 2021; v1 submitted 30 April, 2021; originally announced April 2021.

    Comments: Accepted to ECCV 2020

  9. arXiv:2103.17105  [pdf, other

    cs.CV

    The GIST and RIST of Iterative Self-Training for Semi-Supervised Segmentation

    Authors: Eu Wern Teh, Terrance DeVries, Brendan Duke, Ruowei Jiang, Parham Aarabi, Graham W. Taylor

    Abstract: We consider the task of semi-supervised semantic segmentation, where we aim to produce pixel-wise semantic object masks given only a small number of human-labeled training examples. We focus on iterative self-training methods in which we explore the behavior of self-training over multiple refinement stages. We show that iterative self-training leads to performance degradation if done naïvely with… ▽ More

    Submitted 28 April, 2022; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: To appear in the Conference on Computer and Robot Vision (CRV), 2022

  10. arXiv:2103.03891  [pdf, other

    cs.CV cs.LG

    LOHO: Latent Optimization of Hairstyles via Orthogonalization

    Authors: Rohit Saha, Brendan Duke, Florian Shkurti, Graham W. Taylor, Parham Aarabi

    Abstract: Hairstyle transfer is challenging due to hair structure differences in the source and target hair. Therefore, we propose Latent Optimization of Hairstyles via Orthogonalization (LOHO), an optimization-based approach using GAN inversion to infill missing hair structure details in latent space during hairstyle transfer. Our approach decomposes hair into three attributes: perceptual structure, appear… ▽ More

    Submitted 10 March, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: CVPR 2021

  11. arXiv:2101.08833  [pdf, other

    cs.CV

    SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation

    Authors: Brendan Duke, Abdalla Ahmed, Christian Wolf, Parham Aarabi, Graham W. Taylor

    Abstract: In this paper we introduce a Transformer-based approach to video object segmentation (VOS). To address compounding error and scalability issues of prior work, we propose a scalable, end-to-end method for VOS called Sparse Spatiotemporal Transformers (SST). SST extracts per-pixel representations for each object in a video using sparse attention over spatiotemporal features. Our attention-based form… ▽ More

    Submitted 28 March, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

    Comments: CVPR 2021 (Oral)

  12. arXiv:1906.02260  [pdf, other

    cs.CV

    Lightweight Real-time Makeup Try-on in Mobile Browsers with Tiny CNN Models for Facial Tracking

    Authors: TianXing Li, Zhi Yu, Edmund Phung, Brendan Duke, Irina Kezele, Parham Aarabi

    Abstract: Recent works on convolutional neural networks (CNNs) for facial alignment have demonstrated unprecedented accuracy on a variety of large, publicly available datasets. However, the developed models are often both cumbersome and computationally expensive, and are not adapted to applications on resource restricted devices. In this work, we look into developing and training compact facial alignment mo… ▽ More

    Submitted 11 June, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: 4 pages, Third Workshop on Computer Vision for AR/VR

  13. arXiv:1906.02222  [pdf, other

    cs.CV cs.LG

    Nail Polish Try-On: Realtime Semantic Segmentation of Small Objects for Native and Browser Smartphone AR Applications

    Authors: Brendan Duke, Abdalla Ahmed, Edmund Phung, Irina Kezele, Parham Aarabi

    Abstract: We provide a system for semantic segmentation of small objects that enables nail polish try-on AR applications to run client-side in realtime in native and web mobile applications. By adjusting input resolution and neural network depth, our model design enables a smooth trade-off of performance and runtime, with the highest performance setting achieving~\num{94.5} mIoU at 29.8ms runtime in native… ▽ More

    Submitted 10 June, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: 4 pages, 3 figures. CVPRW 2019: Third Workshop on Computer Vision for AR/VR

  14. arXiv:1805.12302  [pdf, other

    cs.CV cs.LG

    Adversarial Attacks on Face Detectors using Neural Net based Constrained Optimization

    Authors: Avishek Joey Bose, Parham Aarabi

    Abstract: Adversarial attacks involve adding, small, often imperceptible, perturbations to inputs with the goal of getting a machine learning model to misclassifying them. While many different adversarial attack strategies have been proposed on image classification models, object detection pipelines have been much harder to break. In this paper, we propose a novel strategy to craft adversarial examples by s… ▽ More

    Submitted 30 May, 2018; originally announced May 2018.

    Comments: Accepted to IEEE MMSP

  15. arXiv:1712.07168  [pdf, other

    cs.CV

    Real-time deep hair matting on mobile devices

    Authors: Alex Levinshtein, Cheng Chang, Edmund Phung, Irina Kezele, Wenzhangzhi Guo, Parham Aarabi

    Abstract: Augmented reality is an emerging technology in many application domains. Among them is the beauty industry, where live virtual try-on of beauty products is of great importance. In this paper, we address the problem of live hair color augmentation. To achieve this goal, hair needs to be segmented quickly and accurately. We show how a modified MobileNet CNN architecture can be used to segment the ha… ▽ More

    Submitted 10 January, 2018; v1 submitted 19 December, 2017; originally announced December 2017.

    Comments: 7 pages, 7 figures, submitted to CRV 2018

  16. arXiv:1712.02822  [pdf, other

    cs.CV

    Hybrid eye center localization using cascaded regression and hand-crafted model fitting

    Authors: Alex Levinshtein, Edmund Phung, Parham Aarabi

    Abstract: We propose a new cascaded regressor for eye center detection. Previous methods start from a face or an eye detector and use either advanced features or powerful regressors for eye center localization, but not both. Instead, we detect the eyes more accurately using an existing facial feature alignment method. We improve the robustness of localization by using both advanced features and powerful reg… ▽ More

    Submitted 7 December, 2017; originally announced December 2017.

    Comments: 12 pages, 5 figures, submitted to Journal of Image and Vision Computing

  17. arXiv:1708.02238  [pdf, other

    cs.IR cs.NE

    A Convolutional Neural Network for Search Term Detection

    Authors: Hojjat Salehinejad, Joseph Barfett, Parham Aarabi, Shahrokh Valaee, Errol Colak, Bruce Gray, Tim Dowdell

    Abstract: Pathfinding in hospitals is challenging for patients, visitors, and even employees. Many people have experienced getting lost due to lack of clear guidance, large footprint of hospitals, and confusing array of hospital wings. In this paper, we propose Halo; An indoor navigation application based on voice-user interaction to help provide directions for users without assistance of a localization sys… ▽ More

    Submitted 7 November, 2017; v1 submitted 6 August, 2017; originally announced August 2017.

    Comments: This paper is accepted for presentation at 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications