Zum Hauptinhalt springen

Showing 1–26 of 26 results for author: Jayasuriya, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16623  [pdf, other

    cs.CV cs.LG eess.IV

    Turbulence Strength $C_n^2$ Estimation from Video using Physics-based Deep Learning

    Authors: Ripon Kumar Saha, Esen Salcin, Jihoo Kim, Joseph Smith, Suren Jayasuriya

    Abstract: Images captured from a long distance suffer from dynamic image distortion due to turbulent flow of air cells with random temperatures, and thus refractive indices. This phenomenon, known as image dancing, is commonly characterized by its refractive-index structure constant $C_n^2$ as a measure of the turbulence strength. For many applications such as atmospheric forecast model, long-range/astronom… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: Code Available: https://github.com/Riponcs/Cn2Estimation

    Journal ref: Optics Express 30, 40854-40870 (2022)

  2. arXiv:2404.13605  [pdf, other

    cs.CV eess.IV

    Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence

    Authors: Ripon Kumar Saha, Dehao Qin, Nianyi Li, Jinwei Ye, Suren Jayasuriya

    Abstract: Tackling image degradation due to atmospheric turbulence, particularly in dynamic environment, remains a challenge for long-range imaging systems. Existing techniques have been primarily designed for static scenes or scenes with small motion. This paper presents the first segment-then-restore pipeline for restoring the videos of dynamic scenes in turbulent environment. We leverage mean optical flo… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Paper

  3. arXiv:2404.05024  [pdf, other

    cs.CV cs.RO

    PathFinder: Attention-Driven Dynamic Non-Line-of-Sight Tracking with a Mobile Robot

    Authors: Shenbagaraj Kannapiran, Sreenithy Chandran, Suren Jayasuriya, Spring Berman

    Abstract: The study of non-line-of-sight (NLOS) imaging is growing due to its many potential applications, including rescue operations and pedestrian detection by self-driving cars. However, implementing NLOS imaging on a moving camera remains an open area of research. Existing NLOS imaging methods rely on time-resolved detectors and laser configurations that require precise optical alignment, making it dif… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: First two authors have equal contribution

  4. arXiv:2404.04687  [pdf, other

    cs.CV cs.GR cs.LG

    Z-Splat: Z-Axis Gaussian Splatting for Camera-Sonar Fusion

    Authors: Ziyuan Qu, Omkar Vengurlekar, Mohamad Qadri, Kevin Zhang, Michael Kaess, Christopher Metzler, Suren Jayasuriya, Adithya Pediredla

    Abstract: Differentiable 3D-Gaussian splatting (GS) is emerging as a prominent technique in computer vision and graphics for reconstructing 3D scenes. GS represents a scene as a set of 3D Gaussians with varying opacities and employs a computationally efficient splatting operation along with analytical derivatives to compute the 3D Gaussian parameters given scene images captured from various viewpoints. Unfo… ▽ More

    Submitted 5 July, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

  5. arXiv:2311.03572  [pdf, other

    cs.CV

    Unsupervised Region-Growing Network for Object Segmentation in Atmospheric Turbulence

    Authors: Dehao Qin, Ripon Saha, Suren Jayasuriya, Jinwei Ye, Nianyi Li

    Abstract: Moving object segmentation in the presence of atmospheric turbulence is highly challenging due to turbulence-induced irregular and time-varying distortions. In this paper, we present an unsupervised approach for segmenting moving objects in videos downgraded by atmospheric turbulence. Our key approach is a detect-then-grow scheme: we first identify a small set of moving object pixels with high con… ▽ More

    Submitted 4 August, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  6. arXiv:2310.17049  [pdf, other

    cs.SD cs.AI eess.AS

    Learning Repeatable Speech Embeddings Using An Intra-class Correlation Regularizer

    Authors: Jianwei Zhang, Suren Jayasuriya, Visar Berisha

    Abstract: A good supervised embedding for a specific machine learning task is only sensitive to changes in the label of interest and is invariant to other confounding factors. We leverage the concept of repeatability from measurement theory to describe this property and propose to use the intra-class correlation coefficient (ICC) to evaluate the repeatability of embeddings. We then propose a novel regulariz… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  7. arXiv:2306.09909  [pdf, other

    eess.IV cs.CV eess.SP

    Neural Volumetric Reconstruction for Coherent Synthetic Aperture Sonar

    Authors: Albert W. Reed, Juhyeon Kim, Thomas Blanford, Adithya Pediredla, Daniel C. Brown, Suren Jayasuriya

    Abstract: Synthetic aperture sonar (SAS) measures a scene from multiple views in order to increase the resolution of reconstructed imagery. Image reconstruction methods for SAS coherently combine measurements to focus acoustic energy onto the scene. However, image formation is typically under-constrained due to a limited number of measurements and bandlimited hardware, which limits the capabilities of exist… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  8. arXiv:2212.04923  [pdf, other

    eess.SP cs.HC cs.LG

    Eulerian Phase-based Motion Magnification for High-Fidelity Vital Sign Estimation with Radar in Clinical Settings

    Authors: Md Farhan Tasnim Oshim, Toral Surti, Stephanie Carreiro, Deepak Ganesan, Suren Jayasuriya, Tauhidur Rahman

    Abstract: Efficient and accurate detection of subtle motion generated from small objects in noisy environments, as needed for vital sign monitoring, is challenging, but can be substantially improved with magnification. We developed a complex Gabor filter-based decomposition method to amplify phases at different spatial wavelength levels to magnify motion and extract 1D motion signals for fundamental frequen… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

    Comments: Accepted in IEEE Sensors 2022

  9. arXiv:2211.11836  [pdf

    eess.IV cs.CV

    Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques

    Authors: David Ramirez, Suren Jayasuriya, Andreas Spanias

    Abstract: Mixed reality (MR) is a key technology which promises to change the future of warfare. An MR hybrid of physical outdoor environments and virtual military training will enable engagements with long distance enemies, both real and simulated. To enable this technology, a large-scale 3D model of a physical environment must be maintained based on live sensor observations. 3D reconstruction algorithms s… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted to 2022 Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC), 13 pages

  10. arXiv:2211.09858  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection

    Authors: Jianwei Zhang, Julie Liss, Suren Jayasuriya, Visar Berisha

    Abstract: Approximately 1.2% of the world's population has impaired voice production. As a result, automatic dysphonic voice detection has attracted considerable academic and clinical interest. However, existing methods for automated voice assessment often fail to generalize outside the training conditions or to other related applications. In this paper, we propose a deep learning framework for generating a… ▽ More

    Submitted 26 January, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: This manuscript is submitted on July 06, 2022 to IEEE/ACM Transactions on Audio, Speech, and Language Processing for peer-review

  11. arXiv:2112.09775  [pdf, other

    cs.CV cs.AR

    Adaptive Subsampling for ROI-based Visual Tracking: Algorithms and FPGA Implementation

    Authors: Odrika Iqbal, Victor Isaac Torres Muro, Sameeksha Katoch, Andreas Spanias, Suren Jayasuriya

    Abstract: There is tremendous scope for improving the energy efficiency of embedded vision systems by incorporating programmable region-of-interest (ROI) readout in the image sensor design. In this work, we study how ROI programmability can be leveraged for tracking applications by anticipating where the ROI will be located in future frames and switching pixels off outside of this region. We refer to this p… ▽ More

    Submitted 17 January, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  12. arXiv:2112.08539  [pdf, other

    cs.CV eess.IV

    Implicit Neural Representations for Deconvolving SAS Images

    Authors: Albert Reed, Thomas Blanford, Daniel C. Brown, Suren Jayasuriya

    Abstract: Synthetic aperture sonar (SAS) image resolution is constrained by waveform bandwidth and array geometry. Specifically, the waveform bandwidth determines a point spread function (PSF) that blurs the locations of point scatterers in the scene. In theory, deconvolving the reconstructed SAS image with the scene PSF restores the original distribution of scatterers and yields sharper reconstructions. Ho… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  13. arXiv:2108.05563  [pdf, other

    cs.CV eess.IV

    Deep Camera Obscura: An Image Restoration Pipeline for Lensless Pinhole Photography

    Authors: Joshua D. Rego, Huaijin Chen, Shuai Li, Jinwei Gu, Suren Jayasuriya

    Abstract: The lensless pinhole camera is perhaps the earliest and simplest form of an imaging system using only a pinhole-sized aperture in place of a lens. They can capture an infinite depth-of-field and offer greater freedom from optical distortion over their lens-based counterparts. However, the inherent limitations of a pinhole system result in lower sharpness from blur caused by optical diffraction and… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: 11 pages, 10 figures

  14. Restoring degraded speech via a modified diffusion model

    Authors: Jianwei Zhang, Suren Jayasuriya, Visar Berisha

    Abstract: There are many deterministic mathematical operations (e.g. compression, clipping, downsampling) that degrade speech quality considerably. In this paper we introduce a neural network architecture, based on a modification of the DiffWave model, that aims to restore the original speech signal. DiffWave, a recently published diffusion-based vocoder, has shown state-of-the-art synthesized speech qualit… ▽ More

    Submitted 2 September, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Journal ref: Proc. Interspeech 2021, 221-225, 2021)

  15. arXiv:2007.05996  [pdf, other

    cs.CV eess.IV physics.ao-ph

    Differentiable Programming for Hyperspectral Unmixing using a Physics-based Dispersion Model

    Authors: John Janiczek, Parth Thaker, Gautam Dasarathy, Christopher S. Edwards, Philip Christensen, Suren Jayasuriya

    Abstract: Hyperspectral unmixing is an important remote sensing task with applications including material identification and analysis. Characteristic spectral features make many pure materials identifiable from their visible-to-infrared spectra, but quantifying their presence within a mixture is a challenging task due to nonlinearities and factors of variation. In this paper, spectral variation is considere… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: 36 pages, 11 figures. Accepted to European Conference on Computer Vision (ECCV) 2020

  16. arXiv:1909.06436  [pdf, other

    eess.IV cs.CV

    Coupling Rendering and Generative Adversarial Networks for Artificial SAS Image Generation

    Authors: Albert Reed, Isaac Gerg, John McKay, Daniel Brown, David Williams, Suren Jayasuriya

    Abstract: Acquisition of Synthetic Aperture Sonar (SAS) datasets is bottlenecked by the costly deployment of SAS imaging systems, and even when data acquisition is possible,the data is often skewed towards containing barren seafloor rather than objects of interest. We present a novel pipeline, called SAS GAN, which couples an optical renderer with a generative adversarial network (GAN) to synthesize realist… ▽ More

    Submitted 2 October, 2019; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: 10 pages, 9 figures. Submitted to IEEE OCEANS 2019 (Seattle). Updated acknowledgements

  17. arXiv:1905.11595  [pdf, other

    eess.IV cs.CV

    Adaptive Lighting for Data-Driven Non-Line-of-Sight 3D Localization and Object Identification

    Authors: Sreenithy Chandran, Suren Jayasuriya

    Abstract: Non-line-of-sight (NLOS) imaging of objects not visible to either the camera or illumination source is a challenging task with vital applications including surveillance and robotics. Recent NLOS reconstruction advances have been achieved using time-resolved measurements which requires expensive and specialized detectors and laser sources. In contrast, we propose a data-driven approach for NLOS 3D… ▽ More

    Submitted 26 July, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

  18. arXiv:1905.07061  [pdf, other

    cs.CV cs.LG

    Non-Parametric Priors For Generative Adversarial Networks

    Authors: Rajhans Singh, Pavan Turaga, Suren Jayasuriya, Ravi Garg, Martin W. Braun

    Abstract: The advent of generative adversarial networks (GAN) has enabled new capabilities in synthesis, interpolation, and data augmentation heretofore considered very challenging. However, one of the common assumptions in most GAN architectures is the assumption of simple parametric latent-space distributions. While easy to implement, a simple latent-space distribution can be problematic for uses such as… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Journal ref: International Conference on Machine Learning (2019)

  19. arXiv:1806.03379  [pdf, other

    cs.CV cs.AI

    CS-VQA: Visual Question Answering with Compressively Sensed Images

    Authors: Li-Chi Huang, Kuldeep Kulkarni, Anik Jha, Suhas Lohit, Suren Jayasuriya, Pavan Turaga

    Abstract: Visual Question Answering (VQA) is a complex semantic task requiring both natural language processing and visual recognition. In this paper, we explore whether VQA is solvable when images are captured in a sub-Nyquist compressive paradigm. We develop a series of deep-network architectures that exploit available compressive data to increasing degrees of accuracy, and show that VQA is indeed solvabl… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: 5 pages, 2 figures, accepted to ICIP 2018

    MSC Class: 68

  20. arXiv:1803.06312  [pdf, other

    cs.CV eess.IV

    EVA$^2$: Exploiting Temporal Redundancy in Live Computer Vision

    Authors: Mark Buckler, Philip Bedoukian, Suren Jayasuriya, Adrian Sampson

    Abstract: Hardware support for deep convolutional neural networks (CNNs) is critical to advanced computer vision in mobile and embedded devices. Current designs, however, accelerate generic CNNs; they do not exploit the unique characteristics of real-time vision. We propose to use the temporal redundancy in natural video to avoid unnecessary computation on most frames. A new algorithm, activation motion com… ▽ More

    Submitted 16 April, 2018; v1 submitted 16 March, 2018; originally announced March 2018.

    Comments: Appears in ISCA 2018

  21. arXiv:1802.01722  [pdf, other

    cs.CV

    Compressive Light Field Reconstructions using Deep Learning

    Authors: Mayank Gupta, Arjun Jauhari, Kuldeep Kulkarni, Suren Jayasuriya, Alyosha Molnar, Pavan Turaga

    Abstract: Light field imaging is limited in its computational processing demands of high sampling for both spatial and angular dimensions. Single-shot light field cameras sacrifice spatial resolution to sample angular viewpoints, typically by multiplexing incoming rays onto a 2D sensor array. While this resolution can be recovered using compressive sensing, these iterative solutions are slow in processing a… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    Comments: Published at CCD 2017 workshop held in conjunction with CVPR 2017

  22. arXiv:1705.04352  [pdf, other

    cs.CV

    Reconfiguring the Imaging Pipeline for Computer Vision

    Authors: Mark Buckler, Suren Jayasuriya, Adrian Sampson

    Abstract: Advancements in deep learning have ignited an explosion of research on efficient hardware for embedded computer vision. Hardware vision acceleration, however, does not address the cost of capturing and processing the image data that feeds these algorithms. We examine the role of the image signal processing (ISP) pipeline in computer vision to identify opportunities to reduce computation and save e… ▽ More

    Submitted 1 August, 2017; v1 submitted 11 May, 2017; originally announced May 2017.

  23. arXiv:1612.00986  [pdf, other

    cs.CV

    Deep Learning with Energy-efficient Binary Gradient Cameras

    Authors: Suren Jayasuriya, Orazio Gallo, Jinwei Gu, Jan Kautz

    Abstract: Power consumption is a critical factor for the deployment of embedded computer vision systems. We explore the use of computational cameras that directly output binary gradient images to reduce the portion of the power consumption allocated to image sensing. We survey the accuracy of binary gradient cameras on a number of computer vision tasks using deep learning. These include object recognition,… ▽ More

    Submitted 3 December, 2016; originally announced December 2016.

  24. arXiv:1605.03621  [pdf, other

    cs.CV

    ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks using Angle Sensitive Pixels

    Authors: Huaijin Chen, Suren Jayasuriya, Jiyue Yang, Judy Stephen, Sriram Sivaramakrishnan, Ashok Veeraraghavan, Alyosha Molnar

    Abstract: Deep learning using convolutional neural networks (CNNs) is quickly becoming the state-of-the-art for challenging computer vision applications. However, deep learning's power consumption and bandwidth requirements currently limit its application in embedded and mobile systems with tight energy budgets. In this paper, we explore the energy savings of optically computing the first layer of CNNs. To… ▽ More

    Submitted 16 November, 2016; v1 submitted 11 May, 2016; originally announced May 2016.

    Comments: Presented in CVPR 2016 (oral), 10 pages, 12 figures. This new version corrects the comparison between imaging power for ASPs and a regular image sensor

  25. arXiv:1509.00816  [pdf, other

    cs.CV

    Depth Fields: Extending Light Field Techniques to Time-of-Flight Imaging

    Authors: Suren Jayasuriya, Adithya Pediredla, Sriram Sivaramakrishnan, Alyosha Molnar, Ashok Veeraraghavan

    Abstract: A variety of techniques such as light field, structured illumination, and time-of-flight (TOF) are commonly used for depth acquisition in consumer imaging, robotics and many other applications. Unfortunately, each technique suffers from its individual limitations preventing robust depth sensing. In this paper, we explore the strengths and weaknesses of combining light field and time-of-flight imag… ▽ More

    Submitted 2 September, 2015; originally announced September 2015.

    Comments: 9 pages, 8 figures, Accepted to 3DV 2015

  26. arXiv:1503.01804  [pdf, other

    cs.CV cs.GR

    Frequency Domain TOF: Encoding Object Depth in Modulation Frequency

    Authors: Achuta Kadambi, Vage Taamazyan, Suren Jayasuriya, Ramesh Raskar

    Abstract: Time of flight cameras may emerge as the 3-D sensor of choice. Today, time of flight sensors use phase-based sampling, where the phase delay between emitted and received, high-frequency signals encodes distance. In this paper, we present a new time of flight architecture that relies only on frequency---we refer to this technique as frequency-domain time of flight (FD-TOF). Inspired by optical cohe… ▽ More

    Submitted 5 March, 2015; originally announced March 2015.

    Comments: 10 pages