Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Qureshi, F Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13715  [pdf, other

    cs.CV cs.LG

    Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning

    Authors: Ans Munir, Faisal Z. Qureshi, Muhammad Haris Khan, Mohsen Ali

    Abstract: Compositional Zero-Shot Learning (CZSL) aims to predict unknown compositions made up of attribute and object pairs. Predicting compositions unseen during training is a challenging task. We are exploring Open World Compositional Zero-Shot Learning (OW-CZSL) in this study, where our test space encompasses all potential combinations of attributes and objects. Our approach involves utilizing the self-… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 10 pages, 6 figures

  2. arXiv:2312.01558  [pdf, other

    cs.CV eess.IV

    Hyperspectral Image Compression Using Sampling and Implicit Neural Representations

    Authors: Shima Rezasoltani, Faisal Z. Qureshi

    Abstract: Hyperspectral images, which record the electromagnetic spectrum for a pixel in the image of a scene, often store hundreds of channels per pixel and contain an order of magnitude more information than a similarly-sized RBG color image. Consequently, concomitant with the decreasing cost of capturing these images, there is a need to develop efficient techniques for storing, transmitting, and analyzin… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  3. arXiv:2311.10701  [pdf, other

    cs.CV cs.LG eess.IV

    SpACNN-LDVAE: Spatial Attention Convolutional Latent Dirichlet Variational Autoencoder for Hyperspectral Pixel Unmixing

    Authors: Soham Chitnis, Kiran Mantripragada, Faisal Z. Qureshi

    Abstract: The hyperspectral pixel unmixing aims to find the underlying materials (endmembers) and their proportions (abundances) in pixels of a hyperspectral image. This work extends the Latent Dirichlet Variational Autoencoder (LDVAE) pixel unmixing scheme by taking into account local spatial context while performing pixel unmixing. The proposed method uses an isotropic convolutional neural network with sp… ▽ More

    Submitted 24 May, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: Accepted at IGARSS 2024

  4. arXiv:2305.17245  [pdf, other

    cs.CV

    Error Estimation for Single-Image Human Body Mesh Reconstruction

    Authors: Hamoon Jafarian, Faisal Z. Qureshi

    Abstract: Human pose and shape estimation methods continue to suffer in situations where one or more parts of the body are occluded. More importantly, these methods cannot express when their predicted pose is incorrect. This has serious consequences when these methods are used in human-robot interaction scenarios, where we need methods that can evaluate their predictions and flag situations where they might… ▽ More

    Submitted 30 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  5. arXiv:2302.04129  [pdf, other

    cs.CV eess.IV

    Hyperspectral Image Compression Using Implicit Neural Representation

    Authors: Shima Rezasoltani, Faisal Z. Qureshi

    Abstract: Hyperspectral images, which record the electromagnetic spectrum for a pixel in the image of a scene, often store hundreds of channels per pixel and contain an order of magnitude more information than a typical similarly-sized color image. Consequently, concomitant with the decreasing cost of capturing these images, there is a need to develop efficient techniques for storing, transmitting, and anal… ▽ More

    Submitted 8 February, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

  6. arXiv:2203.02820  [pdf, other

    cs.CV

    Evaluation of Dirichlet Process Gaussian Mixtures for Segmentation on Noisy Hyperspectral Images

    Authors: Kiran Mantripragada, Faisal Z. Qureshi

    Abstract: Image segmentation is a fundamental step for the interpretation of Remote Sensing Images. Clustering or segmentation methods usually precede the classification task and are used as support tools for manual labeling. The most common algorithms, such as k-means, mean-shift, and MRS, require an extra manual step to find the scale parameter. The segmentation results are severely affected if the parame… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

  7. Hyperspectral Pixel Unmixing with Latent Dirichlet Variational Autoencoder

    Authors: Kiran Mantripragada, Faisal Z. Qureshi

    Abstract: We present a method for hyperspectral pixel {\it unmixing}. The proposed method assumes that (1) {\it abundances} can be encoded as Dirichlet distributions and (2) spectra of {\it endmembers} can be represented as multivariate Normal distributions. The method solves the problem of abundance estimation and endmember extraction within a variational autoencoder setting where a Dirichlet bottleneck la… ▽ More

    Submitted 30 January, 2024; v1 submitted 2 March, 2022; originally announced March 2022.

  8. The Effects of Spectral Dimensionality Reduction on Hyperspectral Pixel Classification: A Case Study

    Authors: Kiran Mantripragada, Phuong D. Dao, Yuhong He, Faisal Z. Qureshi

    Abstract: This paper presents a systematic study of the effects of hyperspectral pixel dimensionality reduction on the pixel classification task. We use five dimensionality reduction methods -- PCA, KPCA, ICA, AE, and DAE -- to compress 301-dimensional hyperspectral pixels. Compressed pixels are subsequently used to perform pixel classifications. Pixel classification accuracies together with compression met… ▽ More

    Submitted 27 January, 2022; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: 15 pages

  9. Real-time Video Summarization on Commodity Hardware

    Authors: Wesley Taylor, Faisal Z. Qureshi

    Abstract: We present a method for creating video summaries in real-time on commodity hardware. Real-time here refers to the fact that the time required for video summarization is less than the duration of the input video. First, low-level features are use to discard undesirable frames. Next, video is divided into segments, and segment-level features are extracted for each segment. Tree-based models trained… ▽ More

    Submitted 26 January, 2019; originally announced January 2019.

    Comments: Appeared in Proc. 12th ACM International Conference on Distributed Smart Cameras (ICDSC 18), pages 8pp, Eidenhoven, September 2018

  10. arXiv:1901.05376  [pdf, other

    cs.CV

    Joint Spatial and Layer Attention for Convolutional Networks

    Authors: Tony Joseph, Konstantinos G. Derpanis, Faisal Z. Qureshi

    Abstract: In this paper, we propose a novel approach that learns to sequentially attend to different Convolutional Neural Networks (CNN) layers (i.e., ``what'' feature abstraction to attend to) and different spatial locations of the selected feature map (i.e., ``where'') to perform the task at hand. Specifically, at each Recurrent Neural Network (RNN) step, both a CNN layer and localized spatial region with… ▽ More

    Submitted 31 May, 2019; v1 submitted 16 January, 2019; originally announced January 2019.

  11. arXiv:1901.00212  [pdf, other

    cs.CV

    EdgeConnect: Generative Image Inpainting with Adversarial Edge Learning

    Authors: Kamyar Nazeri, Eric Ng, Tony Joseph, Faisal Z. Qureshi, Mehran Ebrahimi

    Abstract: Over the last few years, deep learning techniques have yielded significant improvements in image inpainting. However, many of these techniques fail to reconstruct reasonable structures as they are commonly over-smoothed and/or blurry. This paper develops a new approach for image inpainting that does a better job of reproducing filled regions exhibiting fine details. We propose a two-stage adversar… ▽ More

    Submitted 11 January, 2019; v1 submitted 1 January, 2019; originally announced January 2019.

    Comments: Code and data: https://github.com/knazeri/edge-connect

  12. arXiv:1803.04969  [pdf, other

    cs.CV

    A Framework for Video-Driven Crowd Synthesis

    Authors: Jordan Stadler, Faisal Z. Qureshi

    Abstract: We present a framework for video-driven crowd synthesis. Motion vectors extracted from input crowd video are processed to compute global motion paths. These paths encode the dominant motions observed in the input video. These paths are then fed into a behavior-based crowd simulation framework, which is responsible for synthesizing crowd animations that respect the motion patterns observed in the v… ▽ More

    Submitted 13 March, 2018; originally announced March 2018.